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Preface  to  the  Sixth  Edition 


As  the  title  suggests,  this  book  is  about  fundamental  things  that  one  might 
expect  to  remain  fairly  the  same.  Yet  astronomy  has  evolved  enormously  over 
the  last  few  years,  and  only  a  few  chapters  of  this  book  have  been  left  unmod¬ 
ified. 

Since  the  book  is  used  also  by  many  amateurs,  the  introductory  chapter 
has  been  extended  to  give  a  brief  summary  of  different  celestial  objects  to 
“soften”  the  jump  to  rather  technical  topics. 

The  chapter  on  the  solar  system  was  very  long.  It  has  now  been  split  into 
two  separate  chapters.  Chapter  7  deals  with  general  properties  of  the  solar 
system.  Individual  objects  are  discussed  in  Chap.  8,  which  is  more  prone 
to  change  when  new  data  will  accumulate.  Also,  new  data  on  exoplanets  is 
obtained  at  an  increasing  rate.  Therefore  exoplanets  are  given  a  chapter  of 
their  own;  it  is  at  the  end  of  the  book,  since  it  is  closely  related  to  astrobiology, 
already  included  in  the  previous  edition.  These  last  chapters  may  change  more 
than  the  rest  of  the  book  in  the  future. 

These  changes  mean  that  the  numbering  of  formulas  and  figures  has 
changed  quite  extensively  after  the  previous  version  of  the  book. 

Cosmology  and  galactic  astronomy  have  still  been  evolving  rapidly.  There¬ 
fore  there  are  many  revisions  to  the  chapters  on  the  Milky  Way,  galaxies,  and 
cosmology. 

In  addition,  several  other  chapters  contain  smaller  revisions  and  many  of 
the  previous  images  have  been  replaced  with  newer  ones. 

Helsinki,  Finland  Hannu  Karttunen 

April  2016  Pekka  Kroger 

Heikki  Oja 
Markku  Poutanen 
Karl  Johan  Donner 
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Preface  to  the  First  Edition 


The  main  purpose  of  this  book  is  to  serve  as  a  university  textbook  for  a  first 
course  in  astronomy.  However,  we  believe  that  the  audience  will  also  include 
many  serious  amateurs,  who  often  find  the  popular  texts  too  trivial.  The  lack 
of  a  good  handbook  for  amateurs  has  become  a  problem  lately,  as  more  and 
more  people  are  buying  personal  computers  and  need  exact,  but  comprehen¬ 
sible,  mathematical  formalism  for  their  programs.  The  reader  of  this  book  is 
assumed  to  have  only  a  standard  high-school  knowledge  of  mathematics  and 
physics  (as  they  are  taught  in  Finland);  everything  more  advanced  is  usually 
derived  step  by  step  from  simple  basic  principles.  The  mathematical  back¬ 
ground  needed  includes  plane  trigonometry,  basic  differential  and  integral 
calculus,  and  (only  in  the  chapter  dealing  with  celestial  mechanics)  some  vec¬ 
tor  calculus.  Some  mathematical  concepts  the  reader  may  not  be  familiar  with 
are  briefly  explained  in  the  appendices  or  can  be  understood  by  studying  the 
numerous  exercises  and  examples.  However,  most  of  the  book  can  be  read 
with  very  little  knowledge  of  mathematics,  and  even  if  the  reader  skips  the 
mathematically  more  involved  sections,  (s)he  should  get  a  good  overview  of 
the  field  of  astronomy. 

This  book  has  evolved  in  the  course  of  many  years  and  through  the  work 
of  several  authors  and  editors.  The  first  version  consisted  of  lecture  notes  by 
one  of  the  editors  (Oja).  These  were  later  modified  and  augmented  by  the 
other  editors  and  authors.  Hannu  Karttunen  wrote  the  chapters  on  spherical 
astronomy  and  celestial  mechanics;  Vilppu  Piirola  added  parts  to  the  chapter 
on  observational  instruments,  and  Goran  Sandell  wrote  the  part  about  radio 
astronomy;  chapters  on  magnitudes,  radiation  mechanisms  and  temperature 
were  rewritten  by  the  editors;  Markku  Poutanen  wrote  the  chapter  on  the  so¬ 
lar  system;  Juhani  Kyrolainen  expanded  the  chapter  on  stellar  spectra;  Timo 
Rahunen  rewrote  most  of  the  chapters  on  stellar  structure  and  evolution;  Ilkka 
Tuominen  revised  the  chapter  on  the  Sun;  Kalevi  Mattila  wrote  the  chapter 
on  interstellar  matter;  Tapio  Markkanen  wrote  the  chapters  on  star  clusters 
and  the  Milky  Way;  Karl  Johan  Donner  wrote  the  major  part  of  the  chap¬ 
ter  on  galaxies;  Mauri  Valtonen  wrote  parts  of  the  galaxy  chapter,  and,  in 
collaboration  with  Pekka  Teerikorpi,  the  chapter  on  cosmology.  Finally,  the 
resulting,  somewhat  inhomogeneous,  material  was  made  consistent  by  the 
editors. 

The  English  text  was  written  by  the  editors,  who  translated  parts  of  the 
original  Finnish  text,  and  rewrote  other  parts,  updating  the  text  and  correcting 
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Preface  to  the  First  Edition 


errors  found  in  the  original  edition.  The  parts  of  text  set  in  smaller  print  are 
less  important  material  that  may  still  be  of  interest  to  the  reader. 

For  the  illustrations,  we  received  help  from  Veikko  Sinkkonen,  Mirva 
Vuori  and  several  observatories  and  individuals  mentioned  in  the  figure  cap¬ 
tions.  In  the  practical  work,  we  were  assisted  by  Arja  Kyrolainen  and  Merja 
Karsma.  A  part  of  the  translation  was  read  and  corrected  by  Brian  Skiff.  We 
want  to  express  our  warmest  thanks  to  all  of  them. 

Financial  support  was  given  by  the  Finnish  Ministry  of  Education  and  Suo- 
malaisen  kirjallisuuden  edistamisvarojen  valtuuskunta  (a  foundation  promot¬ 
ing  Finnish  literature),  to  whom  we  express  our  gratitude. 

Helsinki,  Finland  Hannu  Karttunen 

June  1987  Pekka  Kroger 

Heikki  Oja 
Markku  Poutanen 
Karl  Johan  Donner 
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On  a  dark,  cloudless  night,  at  a  distant  location 
far  away  from  the  city  lights,  the  starry  sky  can 
be  seen  in  all  its  splendour  (Fig.  1.1).  It  is  easy  to 
understand  how  these  thousands  of  lights  in  the 
sky  have  affected  people  throughout  the  ages. 

As  long  as  human  beings  have  existed,  they 
have  certainly  wondered  the  sky.  In  the  sky, 
ancient  people  saw  figures  related  to  religious 
myths  and  omens  sent  by  the  gods.  However, 
already  a  couple  of  millennia  ago  the  real  as¬ 
tronomy  started  to  evolve,  separating  itself  from 
religions  and  astrological  superstitions.  People 
started  to  study  the  sky  for  its  own  sake. 


1 .1  Celestial  Objects 

In  the  17th  century  people  started  to  realise  that 
the  Earth  is  not  the  centre  of  the  Universe.  About 
the  same  time  emerged  the  current  view  that  stars 
are  celestial  bodies  similar  to  our  Sun.  The  seem 
to  be  faint  dots  only  due  to  their  huge  distances. 
We  now  know  that  the  Sun  and  stars  are  hot  glow¬ 
ing  balls  of  gas,  producing  energy  when  fusion 
reactions  convert  hydrogen  to  helium  and  also  to 
other  heavier  elements  (Chap.  11). 

Although  stars  actually  move  at  enormous 
speeds,  the  sky  does  not  seem  to  change  even 
in  thousands  of  years,  due  to  the  vast  distances 
of  the  stars.  In  addition  to  the  Sun  and  the  Moon 
there  are  some  other  objects  that  move  with  re¬ 
spect  to  the  stars.  Since  the  antiquity,  these  mov¬ 
ing  objects  have  been  called  planets ,  from  the 
Greek  word  meaning  a  wanderer. 


The  rapid  motions  of  the  planets  reveal  that 
they  are  much  closer  than  the  stars.  Indeed  they 
are  objects  orbiting  the  Sun.  According  the  cur¬ 
rent  definition  (Chap.  7)  there  are  eight  planets 
orbiting  the  Sun:  Mercury,  Venus,  Earth,  Mars, 
Jupiter,  Saturn,  Uranus  and  Neptune.  In  addition 
to  these  relatively  big  bodies  a  lot  of  different 
smaller  objects  move  around  the  Sun:  dwarf  plan¬ 
ets,  asteroids,  comets  and  meteoroids  (Chap.  8). 
Mosts  planets  also  have  their  own  satellites  or 
moons.  Planets,  moons  and  minor  bodies  do  not 
produce  light  by  nuclear  fusion;  instead  they 
shine  just  by  reflecting  the  sunlight. 

At  the  centre  of  the  solar  system  shines  the 
Sun,  producing  energy  by  fusion  reactions 
(Chap.  13).  It  is  the  nearest  star,  and  studying  its 
properties  reveals  also  a  lot  about  other  stars. 

A  few  thousand  stars  can  be  seen  by  the  naked 
eye,  but  even  a  small  telescope  reveals  millions 
of  them.  Based  on  their  properties,  stars  can  be 
divided  into  different  categories.  A  great  major¬ 
ity  of  them  is  main-sequence  stars ,  like  our  Sun. 
Some  of  them,  though,  are  much  bigger,  giants 
or  supergiants ,  and  some  are  much  smaller,  white 
dwarfs.  Different  stars  are  usually  related  to  dif¬ 
ferent  evolutionary  stages  in  the  lives  of  stars. 
Many  stars  are  variable  stars ,  whose  brightness 
varies  with  time. 

Rather  recently  found  objects  are  compact 
stars',  neutron  stars  and  black  holes  (Chap.  15). 
Their  material  is  squeezed  into  such  a  com¬ 
pressed  form  and  their  gravitational  field  is  so 
strong  that  Einstein’s  general  theory  of  relativity 
must  be  used  to  describe  their  matter  and  space 
around  them. 
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1  Introduction 


Fig.  1.1  The  starry  sky  in 
all  its  splendour  can  be 
seen  only  far  away  from 
the  light  pollution  of  cities. 
(Pekka  Parviainen) 


Fig.  1 .2  The  Pleiades  is 
one  of  the  best-known  open 
star  clusters.  The  six 
brightest  stars  can  easily  be 
seen  with  the  naked  eye. 
Photographs  reveal  also 
interstellar  gas  reflecting 
the  light  of  the  stars. 
(NASA,  ESA, 
AURA/Caltech,  Palomar 
Observatory) 


The  Sun  is  a  solitary  star.  Many  stars  appear  in 
pairs,  they  are  binary  stars ,  orbiting  around  their 
common  centre  of  mass  (Chap.  10).  Also  systems 
of  several  stars  are  relatively  common. 

Bigger  groups  of  stars  are  star  clusters 
(Chap.  17).  Open  clusters  (Fig.  1.2)  usually  con¬ 
tain  a  few  tens  or  hundreds  of  stars,  that  were  bom 
in  the  same  area,  usually  quite  recently.  Eventu¬ 
ally  the  stars  will  diverge  to  their  own  paths. 

Globular  cluster  (Fig.  1.3),  on  the  other  hand, 
may  contain  hundreds  of  thousands  or  millions  of 
stars,  which  are  usually  very  old. 


The  interstellar  space  corresponds  pretty  well 
to  our  idea  about  a  perfect  vacuum.  However,  it 
is  not  totally  empty  but  contains  interstellar  mat¬ 
ter,  mainly  hydrogen  and  helium,  but  also  minute 
amounts  of  heavier  elements,  molecules  and  dust 
(Chap.  16).  The  interstellar  medium  does  not 
fill  the  space  as  a  uniform  mist,  but  forms  huge 
clouds  (Fig.  1.4). 

New  stars  are  bom  by  condensing  from  the  in¬ 
terstellar  matter.  When  the  density,  pressure  and 
temperature  of  the  condensing  cloud  have  risen 
high  enough,  fusion  reactions  start  and  a  new  star 
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Fig.  1 .3  The  globular 
cluster  Ml 3  in  the 
Hercules  constellation 
contains  over  a  million 
stars.  The  cluster  can  even 
be  seen  with  the  naked  eye 
as  a  small  nebulous  spot. 
(Palomar  Observatory) 


1 .2  The  Role  of  Astronomy 

Already  a  long  time  ago  man  was  interested  in 
celestial  phenomena.  Several  bone  carvings  made 
by  the  Cro-Magnon  men  as  early  as  30,000  years 
ago  have  been  found,  possibly  recording  Lunar 
phases.  In  that  case  these  calendars  would  be 
the  oldest  astronomical  documents,  predating  the 
skill  of  writing  by  some  25,000  years. 

Agriculture  required  a  good  knowledge  of 
the  seasons.  Religious  rituals  and  prognostica¬ 
tion  were  based  on  the  locations  of  the  celestial 
bodies.  Thus  time  reckoning  became  more  and 
more  accurate,  and  people  learned  to  calculate 
the  movements  of  celestial  bodies  in  advance. 

During  the  rapid  development  of  seafaring, 
when  voyages  extended  farther  and  farther  from 
home  ports,  position  determination  presented 
a  problem  for  which  astronomy  offered  a  prac¬ 
tical  solution.  Solving  these  problems  of  navi¬ 
gation  were  the  most  important  tasks  of  astron¬ 
omy  in  the  17th  and  18th  centuries,  when  the  first 
precise  tables  on  the  movements  of  the  planets 
and  on  other  celestial  phenomena  were  published. 
The  basis  for  these  developments  was  the  discov¬ 
ery  of  the  laws  governing  the  motions  of  the  plan¬ 
ets  by  Copernicus ,  Tycho  Brahe ,  Kepler ,  Galilei 
and  Newton. 


begins  to  radiate  the  energy  released  in  the  re¬ 
actions  (Chap.  12).  After  millions  or  billions  of 
years  the  energy  resources  will  be  exhausted.  The 
evolution  then  depends  on  the  mass  of  the  star. 
The  smallest  stars  just  cool  down  and  fade  away, 
but  more  massive  ones  either  eject  part  of  their 
mass  back  to  space  as  planetary  nebula  or  ex¬ 
plode  as  supernovas.  Thus  matter  converted  by 
the  nuclear  reactions  of  the  stars  is  mixed  with 
the  interstellar  matter. 

All  stars  visible  as  separate  objects  to  the 
naked  eye  or  with  binoculars  belong  to  the  Milky 
Way  (Fig.  1.5,  Chap.  18).  The  Milky  Way  is  a 
system  containing  a  couple  of  billion  stars,  a 
galaxy  (Figs.  1.6  and  1.7,  Chap.  19).  It  takes 
about  100,000  years  to  travel  across  the  Milky 
Way  with  the  speed  of  light. 

Milky  Way  is  not  the  only  galaxy,  but  just 
one  of  very  many  similar  systems.  Galaxies  are 
the  basic  building  blocks  of  the  Universe.  They 
do  not  spread  out  evenly  but  form  small  galaxy 
groups ,  bigger  galaxy  clusters  and  even  bigger 
superclusters. 

Galaxies  are  observed  close  to  the  edge  of  the 
visible  universe.  Nuclei  of  some  galaxies  are  seen 
as  quasars ;  the  most  distant  of  them  have  radiated 
the  light  we  detect  now  when  the  age  of  the  Uni¬ 
verse  was  only  one  tenth  of  the  current  value. 
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Fig.  1.4  The  North  America  nebula  in  the  constellation 
of  Cygnus  is  a  large  cloud  of  interstellar  gas.  The  nebula 
appears  brighter  than  the  background  because  the  radia¬ 
tion  from  the  nearby  stars  makes  it  shine.  The  nebula  is, 

Astronomical  research  has  changed  man’s 
view  of  the  world  from  geocentric,  anthropocen¬ 
tric  conceptions  to  the  modern  view  of  a  vast  uni¬ 
verse  where  man  and  the  Earth  play  an  insignifi¬ 
cant  role.  Astronomy  has  taught  us  the  real  scale 
of  the  nature  surrounding  us. 

Modern  astronomy  is  fundamental  science, 
motivated  mainly  by  man’s  curiosity,  his  wish  to 
know  more  about  Nature  and  the  Universe.  As¬ 
tronomy  has  a  central  role  in  forming  a  scien¬ 
tific  view  of  the  world.  “A  scientific  view  of  the 
world”  means  a  model  of  the  universe  based  on 
observations,  thoroughly  tested  theories  and  log¬ 
ical  reasoning.  Observations  are  always  the  ulti¬ 
mate  test  of  a  model:  if  the  model  does  not  fit 
the  observations,  it  has  to  be  changed,  and  this 


however,  very  faint  and  difficult  to  observe  visually.  The 
brightest  star  on  the  right  is  a  Cygni  or  Deneb.  (Photo 
M.  Poutanen  and  H.  Virtanen) 


process  must  not  be  limited  by  any  philosophical, 
political  or  religious  conceptions  or  beliefs. 


1 .3  Astronomical  Objects  of 
Research 

Modern  astronomy  explores  the  whole  Universe 
and  its  different  forms  of  matter  and  energy.  As¬ 
tronomy  can  be  divided  into  different  branches  in 
several  ways,  based  e.g.  on  the  object  of  research 
or  the  method  used. 

The  Earth  (Fig.  E10)  is  of  interest  to  astron¬ 
omy  for  many  reasons.  Nearly  all  observations 
must  be  made  through  the  atmosphere  (Fig.  E9), 
and  the  phenomena  of  the  upper  atmosphere  and 
magnetosphere  reflect  the  state  of  interplanetary 
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Fig.  1 .5  The  Milky  Way 
appears  as  a  nebulous  band 
stretching  across  the  sky. 

A  telescope  reveals  that  it 
consists  of  myriads  of  stars 
as  observed  already  by 
Galileo  Galilei  400  years 
ago.  The  Milky  Way  is  a 
flat  disclike  stellar  system. 
Our  solar  system  is  close  to 
the  plane  of  the  disc,  and 
looking  in  the  direction  of 
the  plane  we  see  a  lot  of 
stars.  But  if  we  look  away 
from  the  disc  the  stellar 
density  is  much  lower.  The 
disc  contains  also  unevenly 
distributed  interstellar  gas 
and  dust,  which  obscures 
the  view  in  some 
directions.  Near  the  lower 
edge  of  the  picture  the 
Milky  Way  seems  to  split 
into  two  branches  because 
the  distant  stars  are  behind 
the  intervening  obscuring 
matter.  (Pekka  Parviainen) 


space.  The  Earth  is  also  the  most  important  ob¬ 
ject  of  comparison  for  planetologists. 

The  Moon  is  still  studied  by  astronomical 
methods,  although  spacecraft  and  astronauts  have 
visited  its  surface  and  brought  samples  back  to 
the  Earth.  To  amateur  astronomers,  the  Moon 
is  an  easy  and  interesting  object  for  observa¬ 
tions. 

Space  probes  have  already  studied  all  plan¬ 
ets,  many  of  their  satellites,  some  asteroids  and 


comets.  The  most  distant  planets,  Uranus  and 
Neptune,  have  been  observed  only  by  fly-bys,  but 
all  the  other  ones  also  by  orbiters.  Spacecraft  have 
softlanded  on  Mars,  Venus,  Saturn’s  moon  Ti¬ 
tan  and  some  minor  bodies.  Exploration  by  such 
probes  has  tremendously  added  to  our  knowledge 
about  the  conditions  of  these  objects.  Continuous 
monitoring  of  the  planets,  however,  can  still  only 
be  made  from  the  Earth,  and  many  small  bodies 
of  the  solar  system  still  await  their  spacecraft. 
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Fig.  1 .6  The  galaxy  M3 1 
in  the  Andromeda 
constellation  is  a  star 
system  resembling  our  own 
Milky  Way.  Stars  and 
interstellar  matter 
concentrate  in  spiral  arms. 
The  shape  of  M3  lisa 
round,  flat  disc,  but  due  to 
the  oblique  view  it  looks 
oval.  In  good  conditions 
the  centre  of  the  galaxy  can 
be  seen  even  with  the 
naked  eye  as  a  faint 
nebulous  spot.  M31  has 
two  small  elliptic 
neighbour  galaxies  seen 
here  as  bright  ellipses.  M32 
is  below  the  centre  of  M3 1 
and  M110  towards 
northeast  from  the  centre. 
North  is  up.  The  dots  are 
stars  of  the  Milky  Way. 

(Bill  Schoening,  Vanessa 
Harvey/REU  pro- 
gram/N  OAO/AURA/NSF) 


Fig.  1.7  In  addition  to  the 
big  galaxies  like  the  Milky 
Way  there  are  numerous 
much  smaller  dwarf 
galaxies,  which  are  often 
irregular  in  shape.  One  of 
them  is  the  Large 
Magellanic  Cloud,  our 
nearest  neighbour  galaxy. 

It  is  easily  seen  with  the 
naked  eye,  but  it  is  close  to 
the  southern  pole  of  the 
sky.  (NOAO/Cerro  Tololo 
Inter- American 
Observatory) 
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Fig.  1 .8  The  deep-field 
picture  of  the  Hubble  space 
telescope  is  a  combination 
of  several  images  exposed 
altogether  over  1 1  days. 
The  picture  shows  several 
galaxies  that  are  the  most 
distant  ones  known.  When 
we  are  looking  far  to  space 
we  are  also  looking  far  to 
the  past  since  the  light 
proceeds  at  a  finite  speed. 
Thus  many  of  the  galaxies 
in  the  picture  are  also 
among  the  oldest  known 
objects.  When  we  compare 
them  with  the  objects  in 
our  neighbourhood  we  can 
deduce  how  the  galaxies 
evolved  during  billions  of 
years.  (NASA) 


Fig.  1 .9  Although  space 
probes  and  satellites  have 
gathered  remarkable  new 
information,  a  great 
majority  of  astronomical 
observations  is  still 
Earth-based.  The  most 
important  observatories  are 
usually  located  at  high 
altitudes  far  from  densely 
populated  areas.  One  such 
observatory  is  on  Mt 
Paranal  in  Chile,  which 
houses  the  European  VLT 
telescopes.  (Photo  ESO) 


An  astronomer  can  also  specialise  in  study¬ 
ing  several  different  fields  like  the  Sun,  differ¬ 
ent  kinds  of  stars,  star  clusters,  the  Milky  Way 
or  galaxies  (Fig.  1.11). 


The  largest  object  of  research  is  the  whole 
Universe.  Earlier  this  field,  cosmology ,  belonged 
to  theologists  and  philosophers,  but  in  the  20th 
century  it  became  an  object  of  physical  theories 
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and  eventually  of  concrete  astronomical  observa¬ 
tions. 

Spherical  astronomy  is  an  old  field  of  astron¬ 
omy  studying  the  coordinate  systems  of  the  ce¬ 
lestial  sphere  and  apparent  positions  and  motions 
of  the  celestial  objects.  Until  the  17th  century  as¬ 
tronomy  was  mainly  spherical  astronomy. 

When  Isaac  Newton  published  the  fundamen¬ 
tal  laws  of  mechanics  in  1687  in  his  Principia 
mathematical  the  motions  of  celestial  objects  got 
a  physical  explanation.  That  was  the  beginning 
of  celestial  mechanics ,  studying  the  motions  from 
the  planets  of  the  Solar  System  and  satellites  or¬ 
biting  the  Earth  to  distant  galaxies  and  galaxy 
clusters. 


Halfway  the  19th  century  it  was  found  out  how 
spectra  can  reveal  physical  properties  of  celestial 
objects.  This  was  the  beginning  of  astrophysics , 
studying  the  physical  phenomena  of  the  stars.  Re¬ 
sults  from  astrophysics  are  utilised  particularly  in 
the  research  of  the  Sun,  stars  and  interstellar  mat¬ 
ter. 

Astronomy  can  be  divided  into  different  bran¬ 
ches  also  by  the  wavelengths  used.  We  can  talk 
about  radio,  infrared,  optical,  ultraviolet,  X-ray  or 
gamma-ray  astronomy,  depending  on  the  wave¬ 
length  used  in  the  observations. 

Astronomers  study  also  particles  coming  from 
the  space,  like  neutrinos  and  cosmic  rays.  Grav- 


Fig.  1.10  The  Earth  as 
seen  from  the  Moon. 
Thanks  to  spaceflights  we 
have  seen  clearly  the 
planetary  status  of  the 
Earth.  The  picture  was 
taken  by  the  Japanese 
Kaguya  lunar  orbiter  in 
2007.  Currently  the  Moon 
is  the  only  celestial  object 
outside  the  Earth  visited  by 
human  beings,  on  the 
Apollo  flights  in 
1969-1972.  (JAXA) 


—  -  Planets  and  solar  system 

Active  galactic  nuclei 

—  -  -  Sun 

—  Interstellar  matter,  Milky  Way 
Origin  of  stars  and  planets 
Observational  instruments 


Year 


Fig.  1.11  Astronomy  in  the  change.  Although  the  num¬ 
bers  of  astronomical  articles  have  increased  in  all  sub¬ 
fields  in  the  last  few  decades,  the  relative  proportions  have 
changed.  Cosmology  and  galaxies  are  the  greatest  winners 
whereas  the  share  of  stellar  research  has  decreased.  The 


graph  illustrates  the  relative  numbers  of  articles  in  differ¬ 
ent  fields  of  astronomy  in  the  most  influential  journals  in 
1981-2009.  (Adapted  from  the  New  Worlds,  New  Hori¬ 
zons  in  Astronomy  and  Astrophysics ,  2010,  p.  120.)  Pub¬ 
lished  by  the  US  National  Science  Academy 
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Size  Mass  Density  Temperature 

|  metres  I  [kilograms]  [kg/m3]  [ke  Ivins] 
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Fig.  1.12  The  dimensions  of  the  Universe 


itational  waves  are  the  most  recent  object  of  re¬ 
search. 

Astronomy  and  space  research  may  seem 
to  be  related,  although  they  are  quite  differ¬ 
ent  things.  Space  research  includes  all  activi¬ 


ties  in  the  space,  but  only  a  minor  fraction  of 
that  is  astronomical  research.  Space  research  is 
mainly  commercial  services,  like  communica¬ 
tion,  weather  observations,  navigation,  remote 
sensing  and  environmental  control,  and  also  mil- 


10 


1  Introduction 


itary  reconnaissance.  Space  astronomy  is  a  field 
of  astronomy  that  utilises  observations  made  by 
satellites  and  space  probes. 


1 .4  The  Scale  of  the  Universe 

The  masses  and  sizes  of  astronomical  objects 
are  usually  enormously  large.  But  to  understand 
their  properties,  the  smallest  parts  of  matter, 
molecules,  atoms  and  elementary  particles,  must 
be  studied.  The  densities,  temperatures  and  mag¬ 
netic  fields  in  the  Universe  vary  within  much 
larger  limits  than  can  be  reached  in  laboratories 
on  the  Earth  (Fig.  1.12). 

The  greatest  natural  density  met  on  the  Earth  is 
22,500  kgm-3  (osmium),  while  in  neutron  stars 
densities  of  the  order  of  1018  kgm-3  are  possi¬ 
ble.  The  density  in  the  best  vacuum  achieved  on 
the  Earth  is  only  10-9  kgm-3,  but  in  interstellar 
space  the  density  of  the  gas  may  be  10-21  kg  m-3 
or  even  less.  Modern  accelerators  can  give  parti¬ 
cles  energies  of  the  order  of  10 13  electron  volts 
(eV).  Cosmic  rays  coming  from  the  sky  may  have 
energies  of  over  102°  eV. 

It  has  taken  man  a  long  time  to  grasp  the  vast 
dimensions  of  space.  Already  Hipparchos  in  the 
second  century  B.C.  obtained  a  reasonably  cor¬ 
rect  value  for  the  distance  of  the  Moon.  The  scale 
of  the  solar  system  was  established  together  with 
the  heliocentric  system  in  the  17th  century.  In 
the  old  geocentric  system  the  distances  of  planets 
did  not  affect  their  apparent  motions  and  could 


be  chosen  arbitrarily.  In  the  heliocentric  system 
this  is  no  more  possible.  Thus  the  distances  of 
the  Solar  System  were  known  reasonably  well  as 
early  as  in  the  15th  century.  Also  serious  attempts 
to  determine  stellar  distances  were  made,  but  the 
first  successful  measurements  were  made  only  in 
the  1830’s,  and  decent  estimates  for  the  distances 
to  the  galaxies  were  obtained  only  in  the  1920’s. 

We  can  get  some  kind  of  picture  of  the  dis¬ 
tances  involved  (Fig.  1.4)  by  considering  the  time 
required  for  light  to  travel  from  a  source  to  the 
retina  of  the  human  eye.  It  takes  8  minutes  for 
light  to  travel  from  the  Sun,  5  ^  hours  from  Nep¬ 
tune  and  4  years  from  the  nearest  star.  We  can¬ 
not  see  the  centre  of  the  Milky  Way,  but  the 
many  globular  clusters  around  the  Milky  Way 
are  at  approximately  similar  distances.  It  takes 
about  20,000  years  for  the  light  from  the  glob¬ 
ular  cluster  of  Fig.  1.5  to  reach  the  Earth.  It 
takes  150,000  years  to  travel  the  distance  from 
the  nearest  galaxy,  the  Magellanic  Cloud  seen  on 
the  southern  sky  (Fig.  1.7).  The  photons  that  we 
see  now  started  their  voyage  when  Neanderthal 
Man  lived  on  the  Earth.  The  light  coming  from 
the  Andromeda  Galaxy  (Fig.  1.6)  in  the  north¬ 
ern  sky  originated  2  million  years  ago.  Around 
the  same  time  the  first  actual  human  using  tools, 
Homo  habilis ,  appeared.  The  most  distant  objects 
known,  the  quasars,  are  so  far  away  that  their  ra¬ 
diation,  seen  on  the  Earth  now,  was  emitted  long 
before  the  Sun  or  the  Earth  were  born  (Fig.  1.8). 


Spherical  Astronomy 


Spherical  astronomy  is  a  science  studying  astro¬ 
nomical  coordinate  frames,  directions  and  appar¬ 
ent  motions  of  celestial  objects,  determination  of 
position  from  astronomical  observations,  obser¬ 
vational  errors,  etc.  We  shall  concentrate  mainly 
on  astronomical  coordinates,  apparent  motions  of 
stars  and  time  reckoning.  Also,  some  of  the  most 
important  star  catalogues  will  be  introduced. 

For  simplicity  we  will  assume  that  the  ob¬ 
server  is  always  on  the  northern  hemisphere.  Al¬ 
though  all  definitions  and  equations  are  easily 
generalised  for  both  hemispheres,  this  might  be 
unnecessarily  confusing.  In  spherical  astronomy 
all  angles  are  usually  expressed  in  degrees;  we 
will  also  use  degrees  unless  otherwise  mentioned. 


2.1  Spherical  Trigonometry 

For  the  coordinate  transformations  of  spherical 
astronomy,  we  need  some  mathematical  tools, 
which  we  present  now. 

If  a  plane  passes  through  the  centre  of  a  sphere, 
it  will  split  the  sphere  into  two  identical  hemi¬ 
spheres  along  a  circle  called  a  great  circle 
(Fig.  2.1).  A  line  perpendicular  to  the  plane  and 
passing  through  the  centre  of  the  sphere  inter¬ 
sects  the  sphere  at  the  poles  P  and  P' .  If  a  sphere 
is  intersected  by  a  plane  not  containing  the  cen¬ 
tre,  the  intersection  curve  is  a  small  circle.  There 
is  exactly  one  great  circle  passing  through  two 
given  points  Q  and  Q'  on  a  sphere  (unless  these 
points  are  antipodal,  in  which  case  all  circles 
passing  through  both  of  them  are  great  circles). 
The  arc  QQ'  of  this  great  circle  is  the  shortest 


Fig.  2.1  A  great  circle  is  the  intersection  of  a  sphere  and 
a  plane  passing  through  its  centre.  P  and  P'  are  the  poles 
of  the  great  circle.  The  shortest  path  from  Q  to  Q'  follows 
the  great  circle 

path  on  the  surface  of  the  sphere  between  these 
points. 

A  spherical  triangle  is  not  just  any  three- 
cornered  figure  lying  on  a  sphere;  its  sides  must 
be  arcs  of  great  circles.  The  spherical  trian¬ 
gle  A B C  in  Fig.  2.2  has  the  arcs  AB,  BC  and  AC 
as  its  sides.  If  the  radius  of  the  sphere  is  r,  the 
length  of  the  arc  A  B  is 

\AB\=rc,  [c]=rad, 

where  c  is  the  angle  subtended  by  the  arc  A  B 
as  seen  from  the  centre.  This  angle  is  called  the 
central  angle  of  the  side  A  B.  Because  lengths  of 
sides  and  central  angles  correspond  to  each  other 
in  a  unique  way,  it  is  customary  to  give  the  central 
angles  instead  of  the  sides.  In  this  way,  the  radius 

11 
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of  the  sphere  does  not  enter  into  the  equations 
of  spherical  trigonometry.  An  angle  of  a  spheri¬ 
cal  triangle  can  be  defined  as  the  angle  between 
the  tangents  of  the  two  sides  meeting  at  a  vertex, 
or  as  the  dihedral  angle  between  the  planes  in¬ 
tersecting  the  sphere  along  these  two  sides.  We 
denote  the  angles  of  a  spherical  triangle  by  cap¬ 
ital  letters  (A,  B ,  C)  and  the  opposing  sides,  or, 
more  correctly,  the  corresponding  central  angles, 
by  lowercase  letters  (a,  b,  c). 

The  sum  of  the  angles  of  a  spherical  triangle  is 
always  greater  than  180  degrees;  the  excess 

E  =  A  +  B  +  C  -  180°  (2.1) 

is  called  the  spherical  excess.  It  is  not  a  constant, 
but  depends  on  the  triangle.  Unlike  in  plane  ge¬ 
ometry,  it  is  not  enough  to  know  two  of  the  angles 
to  determine  the  third  one.  The  area  of  a  spherical 
triangle  is  related  to  the  spherical  excess  in  a  very 
simple  way: 

Area  =  Er2,  [E]  =  rad.  (2.2) 

This  shows  that  the  spherical  excess  equals  the 
solid  angle  in  steradians  (see  Appendix  A.l),  sub¬ 
tended  by  the  triangle  as  seen  from  the  centre. 

To  prove  (2.2),  we  extend  all  sides  of  the  tri¬ 
angle  A  to  great  circles  (Fig.  2.3).  These  great 
circles  will  form  another  triangle  Ar,  congruent 
with  A  but  antipodal  to  it.  If  the  angle  A  is  ex¬ 
pressed  in  radians,  the  area  of  the  slice  S(A ) 


Fig.  2.2  A  spherical  triangle  is  bounded  by  three  arcs  of 
great  circles,  AB,  BC  and  C A.  The  corresponding  central 
angles  are  c,  a,  and  b 


bounded  by  the  two  sides  of  A  (the  shaded 
area  in  Fig.  2.3)  is  obviously  2 A /In  =  A/n 
times  the  area  of  the  sphere,  4nr2 .  Similarly,  the 
slices  S(B)  and  S(C )  cover  fractions  B/rt  and 
C / it  of  the  whole  sphere. 

Together,  the  three  slices  cover  the  whole  sur¬ 
face  of  the  sphere,  the  equal  triangles  A  and  A 7 
belonging  to  every  slice,  and  each  point  outside 
the  triangles,  to  exactly  one  slice.  Thus  the  area  of 
the  slices  S(A),  S(B)  and  S(C )  equals  the  area  of 
the  sphere  plus  four  times  the  area  of  A,  A(A): 

A  +  B  +  C  o  o 

- 4nr2  =  4jtr2  +  4A(A), 

Tt 

whence 

.4(A)  =  (A  +  B  +  C  -  n)r 2  =  Er2 . 

As  in  the  case  of  plane  triangles,  we  can  de¬ 
rive  relationships  between  the  sides  and  angles  of 
spherical  triangles.  The  easiest  way  to  do  this  is 
by  inspecting  certain  coordinate  transformations. 

Suppose  we  have  two  rectangular  coordinate 
frames  Oxyz  and  Ox'y' z!  (Fig.  2.4),  such  that 
the  x’y’z!  frame  is  obtained  from  the  vyz  frame 
by  rotating  it  around  the  v  axis  by  an  angle  / . 

The  position  of  a  point  P  on  a  unit  sphere  is 
uniquely  determined  by  giving  two  angles.  The 
angle  xj/  is  measured  counterclockwise  from  the 
positive  v  axis  along  the  vy  plane;  the  other  an¬ 
gle  0  tells  the  angular  distance  from  the  vy  plane. 


Fig.  2.3  If  the  sides  of  a  spherical  triangle  are  extended 
all  the  way  around  the  sphere,  they  form  another  trian¬ 
gle  Ar,  antipodal  and  equal  to  the  original  triangle  A.  The 
shaded  area  is  the  slice  S  (A) 
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Fig.  2.4  The  location  of  a  point  P  on  the  surface  of  a  unit 
sphere  can  be  expressed  by  rectangular  xyz  coordinates  or 
by  two  angles,  x/z  and  0.  The  x'y'z'  frame  is  obtained  by 
rotating  the  xyz  frame  around  its  v  axis  by  an  angle  x 

In  an  analogous  way,  we  can  define  the  angles  xj/' 
and  O',  which  give  the  position  of  the  point  P  in 
the  x'y'z'  frame.  The  rectangular  coordinates  of 
the  point  P  as  functions  of  these  angles  are: 

x  =  cos  if/  cos  0 ,  x'  =  cos  x/r'cosO', 

y  =  sin  xjz  cos  0 ,  y'  =  sin  cos  0',  (2.3) 

z  =  sin0,  z' =  sinO'. 

We  also  know  that  the  dashed  coordinates  are  ob¬ 
tained  from  the  undashed  ones  by  a  rotation  in  the 
yz  plane  (Fig.  2.5): 

/ 

v  =  v, 

y'  =  y  COS  x  +  z  sin  x ,  (2.4) 

z  =  -ysinx  +  z  cos  x  ■ 

By  substituting  the  expressions  of  the  rectangular 
coordinates  (2.3)  into  (2.4),  we  have 

cos  x/r'  cos  0'  =  cos  x/r  cos  0, 

sin  xjr'  cos  0'  =  sin  x/s  cos  0  cos  x  +  sin  0  sin  x , 

sin  0'  =  —  sin  \jz  cos  0  sin  x  +  sin  0  cos  x  • 

(2.5) 

In  fact,  these  equations  are  quite  sufficient  for 
all  coordinate  transformations  we  may  encounter. 


Fig.  2.5  The  coordinates  of  the  point  P  in  the  rotated 
frame  are  x’  —  x,  yr  —  y  cosx  +  zsinx,  z'  =  zcosx  — 
ysinx 


Fig.  2.6  To  derive  triangulation  formulas  for  the  spher¬ 
ical  triangle  ABC ,  the  spherical  coordinates  0,  x/s' 
and  O'  of  the  vertex  C  are  expressed  in  terms  of  the  sides 
and  angles  of  the  triangle 

However,  we  shall  also  derive  the  usual  equations 
for  spherical  triangles.  To  do  this,  we  set  up  the 
coordinate  frames  in  a  suitable  way  (Fig.  2.6). 
The  z  axis  points  towards  the  vertex  A  and  the 
z!  axis,  towards  B.  Now  the  vertex  C  corresponds 
to  the  point  P  in  Fig.  2.4.  The  angles  x\r,0,  xjr' ,  0' 
and  x  can  be  expressed  in  terms  of  the  angles  and 
sides  of  the  spherical  triangle: 

x/r  =  A-  90°,  0  =  90°-b, 

xj/'  =  90°  —  B,  0'  =  90°-a,  X=c. 

(2.6) 
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Substitution  into  (2.5)  gives 

cos(90°  -  B )  cos(90°  -  a) 

=  cos(A  —  90°)  cos(90°  —  b), 
sin(90°  —  B)  cos(90°  —  a) 

=  sin(A  —  90°)  cos(90°  —  b)  cos  c 
+  sin(90°  —  b)  sine, 
sin(90°  —  a) 

=  —  sin(A  —  90°)  cos(90°  —  b)  sine 
+  sin(90°  —  b)  cose, 


The  second  equation  in  (2.7)  is  the  sine-cosine 
formula ,  and  the  corresponding  plane  formula  is 
a  trivial  one: 

e  =  b  cos  A  +  a  cos  B. 

This  is  obtained  by  substituting  the  approxima¬ 
tions  of  sine  and  cosine  into  the  sine-cosine  for¬ 
mula  and  ignoring  all  quadratic  and  higher-order 
terms.  In  the  same  way  we  can  use  the  third  equa¬ 
tion  in  (2.7),  the  cosine  formula,  to  derive  the  pla¬ 
nar  cosine  formula: 

a2  =  b2  +  c2  —  2 be  cos  A . 


or 

sin  B  sin  a  =  sin  A  sin/?, 

cos  B  sin  a  =  —  cos  A  sin  b  cos  c  +  cos  b  sin  c, 

cos  a  =  cos  A  sin  b  sin  c  +  cos  b  cos  c. 

(2.7) 

Equations  for  other  sides  and  angles  are  obtained 
by  cyclic  permutations  of  the  sides  a,  b,c  and  the 
angles  A,  B,  C.  For  instance,  the  first  equation 
also  yields 

sin  C  sin  b  =  sin  B  sin  c, 
sin  A  sine  =  sinC  sin  a. 

All  these  variations  of  the  sine  formula  can  be 
written  in  an  easily  remembered  form: 

sin  a  sin  b  sin  c 

- = - = - .  (2.8) 

sin  A  sin  B  sin  C 

If  we  take  the  limit,  letting  the  sides  a,  b 
and  c  shrink  to  zero,  the  spherical  triangle  be¬ 
comes  a  plane  triangle.  If  all  angles  are  expressed 
in  radians,  we  have  approximately 

,  1  2 

sin a^a,  cos  a  ~  1 - a  . 

2 

Substituting  these  approximations  into  the  sine 
formula,  we  get  the  familiar  sine  formula  of  plane 
geometry: 

a  b  c 

sin  A  sin  B  sin  C 


2.2  The  Earth 

A  position  on  the  Earth  is  usually  given  by  two 
spherical  coordinates  (although  in  some  calcu¬ 
lations  rectangular  or  other  coordinates  may  be 
more  convenient).  If  necessary,  also  a  third  coor¬ 
dinate,  e.g.  the  distance  from  the  centre,  can  be 
used. 

The  reference  plane  is  the  equatorial  plane , 
perpendicular  to  the  rotation  axis  and  intersecting 
the  surface  of  the  Earth  along  the  equator .  Small 
circles  parallel  to  the  equator  are  called  paral¬ 
lels  of  latitude.  Semicircles  from  pole  to  pole 
are  meridians.  The  geographical  longitude  is  the 
angle  between  the  meridian  and  the  zero  merid¬ 
ian  passing  through  Greenwich  Observatory.  We 
shall  use  positive  values  for  longitudes  east  of 
Greenwich  and  negative  values  west  of  Green¬ 
wich.  Sign  convention,  however,  varies,  and  neg¬ 
ative  longitudes  are  not  used  in  maps;  so  it  is  usu¬ 
ally  better  to  say  explicitly  whether  the  longitude 
is  east  or  west  of  Greenwich. 

The  latitude  is  usually  supposed  to  mean  the 
geographical  latitude ,  which  is  the  angle  between 
the  plumb  line  and  the  equatorial  plane.  The  lat¬ 
itude  is  positive  in  the  northern  hemisphere  and 
negative  in  the  southern  one.  The  geographical 
latitude  can  be  determined  by  astronomical  ob¬ 
servations  (Fig.  2.7):  the  altitude  of  the  celestial 
pole  measured  from  the  horizon  equals  the  geo¬ 
graphical  latitude.  (The  celestial  pole  is  the  inter¬ 
section  of  the  rotation  axis  of  the  Earth  and  the 
infinitely  distant  celestial  sphere;  we  shall  return 
to  these  concepts  a  little  later.) 
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Celestial  pole 


Fig.  2.7  The  latitude  <p  is  obtained  by  measuring  the  alti¬ 
tude  of  the  celestial  pole.  The  celestial  pole  can  be  imag¬ 
ined  as  a  point  at  an  infinite  distance  in  the  direction  of  the 
Earth’s  rotation  axis 


Because  the  Earth  is  rotating,  it  is  slightly  flat¬ 
tened.  The  exact  shape  is  rather  complicated,  but 
for  most  purposes  it  can  by  approximated  by  an 
oblate  spheroid,  the  short  axis  of  which  coincides 
with  the  rotation  axis  (Sect.  7.6).  In  1979  the 
International  Union  of  Geodesy  and  Geophysics 
(IUGG)  adopted  the  Geodetic  Reference  System 
1980  (GRS-80),  which  is  used  when  global  ref¬ 
erence  frames  fixed  to  the  Earth  are  defined.  The 
GRS-80  reference  ellipsoid  has  the  following  di¬ 
mensions: 

equatorial  radius  a  =  6,378, 137  m, 
polar  radius  b  =  6,356,752  m, 
flattening  f  =  (a  —  b) /a 

=  1/298.25722210. 

The  shape  defined  by  the  surface  of  the  oceans, 
called  the  geoid ,  differs  from  this  spheroid  at 
most  by  about  100  m. 

The  angle  between  the  equator  and  the  nor¬ 
mal  to  the  ellipsoid  approximating  the  true  Earth 
is  called  the  geodetic  latitude.  Because  the  sur¬ 
face  of  a  liquid  (like  an  ocean)  is  perpendicular 
to  the  plumb  line,  the  geodetic  and  geographical 
latitudes  are  practically  the  same. 


Fig.  2.8  Due  to  the  flattening  of  the  Earth,  the  geographic 
latitude  0  and  geocentric  latitude  <p'  are  different 


Because  of  the  flattening,  the  plumb  line  does 
not  point  to  the  centre  of  the  Earth  except  at  the 
poles  and  on  the  equator.  An  angle  corresponding 
to  the  ordinary  spherical  coordinate  (the  angle  be¬ 
tween  the  equator  and  the  line  from  the  centre  to 
a  point  on  the  surface),  the  geocentric  latitude  <p' 
is  therefore  a  little  smaller  than  the  geographic 
latitude  0  (Fig.  2.8). 

We  now  derive  an  equation  between  the  geo¬ 
graphic  latitude  0  and  geocentric  latitude  0r,  as¬ 
suming  the  Earth  is  an  oblate  spheroid  and  the 
geographic  and  geodesic  latitudes  are  equal.  The 
equation  of  the  meridional  ellipse  is 

2  2 
v  yz 

- b  —  =  1 

a2  b2 

The  direction  of  the  normal  to  the  ellipse  at 
a  point  (x,  y )  is  given  by 


tan  0  = 


dx 

d  y 


2 

b2  x 


The  geocentric  latitude  is  obtained  from 


tan  0r  =  y  /x . 


Hence 


b 


tan0r  =  — 2  tan0  =  (l  —  e2)  tan0,  (2.9) 

CL 
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where 

e  —  yj  \  —  b2 /a1 

is  the  eccentricity  of  the  ellipse.  The  difference 
A 0  =  <fi  —  <fi'  has  a  maximum  1 1 .5'  at  the  latitude 
45°. 

Since  the  coordinates  of  celestial  bodies  in  as¬ 
tronomical  almanacs  are  given  with  respect  to  the 
centre  of  the  Earth,  the  coordinates  of  nearby  ob¬ 
jects  must  be  corrected  for  the  difference  in  the 
position  of  the  observer,  if  high  accuracy  is  re¬ 
quired.  This  means  that  one  has  to  calculate  the 
top o centric  coordinates,  centered  at  the  observer. 
The  easiest  way  to  do  this  is  to  use  rectangular 
coordinates  of  the  object  and  the  observer  (Ex¬ 
ample  2.6). 

One  arc  minute  along  a  meridian  is  called 
a  nautical  mile.  Since  the  radius  of  curvature 
varies  with  latitude,  increasing  towards  the  poles, 
the  length  of  the  nautical  mile  so  defined  would 
depend  on  the  latitude  (1843  m  on  the  equator, 
1862  m  at  the  poles).  Therefore  one  nautical  mile 
has  been  defined  to  be  equal  to  one  minute  of  arc 
at  0  =  45°,  whence  1  nautical  mile  =  1852  m. 


2.3  The  Celestial  Sphere 

The  ancient  universe  was  confined  within  a  fi¬ 
nite  spherical  shell.  The  stars  were  fixed  to  this 
shell  and  thus  were  all  equidistant  from  the  Earth, 
which  was  at  the  centre  of  the  spherical  universe. 
This  simple  model  is  still  in  many  ways  as  use¬ 
ful  as  it  was  in  antiquity:  it  helps  us  to  easily  un¬ 
derstand  the  diurnal  and  annual  motions  of  stars, 
and,  more  important,  to  predict  these  motions  in 
a  relatively  simple  way.  Therefore  we  will  assume 
for  the  time  being  that  all  the  stars  are  located  on 
the  surface  of  an  enormous  sphere  and  that  we 
are  at  its  centre.  Because  the  radius  of  this  celes¬ 
tial  sphere  is  practically  infinite,  we  can  neglect 
the  effects  due  to  the  changing  position  of  the  ob¬ 
server,  caused  by  the  rotation  and  orbital  motion 
of  the  Earth.  These  effects  will  be  considered  later 
in  Sects.  2.9  and  2.10. 

Since  the  distances  of  the  stars  are  ignored, 
we  need  only  two  coordinates  to  specify  their  di¬ 
rections.  Each  coordinate  frame  has  some  fixed 
reference  plane  passing  through  the  centre  of 


Fig.  2.9  (a)  The  apparent  motions  of  stars  during  a  night 
as  seen  from  latitude  0  =  45°.  (b)  The  same  stars  seen 
from  latitude  <fi  =  10° 

the  celestial  sphere  and  dividing  the  sphere  into 
two  hemispheres  along  a  great  circle.  One  of  the 
coordinates  indicates  the  angular  distance  from 
this  reference  plane.  There  is  exactly  one  great 
circle  going  through  the  object  and  intersecting 
this  plane  perpendicularly;  the  second  coordinate 
gives  the  angle  between  that  point  of  intersection 
and  some  fixed  direction. 


2.4  The  Horizontal  System 

The  most  natural  coordinate  frame  from  the  ob¬ 
server’s  point  of  view  is  the  horizontal  frame 
(Fig.  2.9).  Its  reference  plane  is  the  tangent  plane 
of  the  Earth  passing  through  the  observer;  this 
horizontal  plane  intersects  the  celestial  sphere 
along  the  horizon.  The  point  just  above  the  ob¬ 
server  is  called  the  zenith  and  the  antipodal  point 
below  the  observer  is  the  nadir.  (These  two  points 
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are  the  poles  corresponding  to  the  horizon.)  Great 
circles  through  the  zenith  are  called  verticals.  All 
verticals  intersect  the  horizon  perpendicularly. 

By  observing  the  motion  of  a  star  over  the 
course  of  a  night,  an  observer  finds  out  that  it  fol¬ 
lows  a  track  like  one  of  those  in  Fig.  2.9.  Stars  rise 
in  the  east,  reach  their  highest  point,  or  culminate , 
on  the  vertical  NZS,  and  set  in  the  west.  The  ver¬ 
tical  NZS  is  called  the  meridian.  North  and  south 
directions  are  defined  as  the  intersections  of  the 
meridian  and  the  horizon. 

One  of  the  horizontal  coordinates  is  the  alti¬ 
tude  or  elevation ,  a ,  which  is  measured  from  the 
horizon  along  the  vertical  passing  through  the  ob¬ 
ject.  The  altitude  lies  in  the  range  [—90°,  +90°]; 
it  is  positive  for  objects  above  the  horizon  and 
negative  for  the  objects  below  the  horizon.  The 
zenith  distance ,  or  the  angle  between  the  object 
and  the  zenith,  is  obviously 

z  =  90  °-a.  (2.10) 

The  second  coordinate  is  the  azimuth ,  A;  it 
is  the  angular  distance  of  the  vertical  of  the  ob¬ 
ject  from  some  fixed  direction.  Unfortunately, 
in  different  contexts,  different  fixed  directions 
are  used;  thus  it  is  always  advisable  to  check 
which  definition  is  employed.  The  azimuth  is 
usually  measured  from  the  north  or  south,  and 
though  clockwise  is  the  preferred  direction,  coun¬ 
terclockwise  measurements  are  also  occasionally 
used.  In  geography,  navigation  and  many  other 
fields,  the  azimuth  is  measured  from  the  north. 
In  this  book  we  have  adopted  a  fairly  common 
astronomical  convention,  measuring  the  azimuth 
clockwise  from  the  south.  Its  values  are  usually 
normalised  between  0°  and  360°  (or  —180°- 
180°).  The  reason  for  this  definition  is  that  some 
other  important  angles  are  also  measured  from 
the  south. 

In  Fig.  2.9a  we  can  see  the  altitude  and  az¬ 
imuth  of  three  stars  at  some  instant.  As  a  star 
moves  along  its  daily  track,  both  of  its  coordi¬ 
nates  will  change.  Another  difficulty  with  this  co¬ 
ordinate  frame  is  its  local  character.  In  Fig.  2.9b 
we  have  the  same  stars,  but  the  observer  is  now 
further  south.  We  can  see  that  the  coordinates  of 
the  same  star  at  the  same  moment  are  different 


for  different  observers.  Since  the  horizontal  co¬ 
ordinates  are  time  and  position  dependent,  they 
cannot  be  used,  for  instance,  in  star  catalogues. 


2.5  The  Equatorial  System 

The  direction  of  the  rotation  axis  of  the  Earth  re¬ 
mains  almost  constant  and  so  does  the  equato¬ 
rial  plane  perpendicular  to  this  axis.  Therefore  the 
equatorial  plane  is  a  suitable  reference  plane  for 
a  coordinate  frame  that  has  to  be  independent  of 
time  and  the  position  of  the  observer. 

The  intersection  of  the  celestial  sphere  and 
the  equatorial  plane  is  a  great  circle,  which  is 
called  the  equator  of  the  celestial  sphere.  The 
north  pole  of  the  celestial  sphere  is  one  of  the 
poles  corresponding  to  this  great  circle.  It  is  also 
the  point  in  the  northern  sky  where  the  exten¬ 
sion  of  the  Earth’s  rotational  axis  meets  the  ce¬ 
lestial  sphere.  The  celestial  north  pole  is  at  a  dis¬ 
tance  of  about  one  degree  (which  is  equivalent  to 
two  full  moons)  from  the  moderately  bright  star 
Polaris.  The  meridian  always  passes  through  the 
north  pole;  it  is  divided  by  the  pole  into  north  and 
south  meridians. 

The  angular  separation  of  a  star  from  the  equa¬ 
torial  plane  is  not  affected  by  the  rotation  of  the 
Earth.  This  angle  is  called  the  declination  8. 

Stars  seem  to  revolve  around  the  pole  once  ev¬ 
ery  day  (Fig.  2.10).  To  define  the  second  coordi¬ 
nate,  we  must  again  agree  on  a  fixed  direction, 
unaffected  by  the  Earth’s  rotation.  From  a  mathe¬ 
matical  point  of  view,  it  does  not  matter  which 
point  on  the  equator  is  selected.  However,  for 
later  purposes,  it  is  more  appropriate  to  employ 
a  certain  point  with  some  valuable  properties, 
which  will  be  explained  in  the  next  section.  This 
point  is  called  the  vernal  equinox.  Because  it  used 
to  be  in  the  constellation  Aries  (the  Ram),  it  is 
also  called  the  first  point  of  Aries  ant  denoted 
by  the  sign  of  Aries,  T.  Now  we  can  define  the 
second  coordinate  as  the  angle  from  the  vernal 
equinox  measured  along  the  equator.  This  angle 
is  the  right  ascension  a  (or  R.A.)  of  the  object, 
measured  counterclockwise  from  T. 

Since  declination  and  right  ascension  are  in¬ 
dependent  of  the  position  of  the  observer  and  the 
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motions  of  the  Earth,  they  can  be  used  in  star 
maps  and  catalogues.  As  will  be  explained  later, 
in  many  telescopes  one  of  the  axes  (the  hour  axis) 
is  parallel  to  the  rotation  axis  of  the  Earth.  The 
other  axis  (declination  axis)  is  perpendicular  to 
the  hour  axis.  Declinations  can  be  read  imme¬ 
diately  on  the  declination  dial  of  the  telescope. 
But  the  zero  point  of  the  right  ascension  seems 
to  move  in  the  sky,  due  to  the  diurnal  rotation  of 
the  Earth.  So  we  cannot  use  the  right  ascension  to 
find  an  object  unless  we  know  the  direction  of  the 
vernal  equinox. 

Since  the  south  meridian  is  a  well-defined  line 
in  the  sky,  we  use  it  to  establish  a  local  coordinate 
corresponding  to  the  right  ascension.  The  hour 
angle  h  is  measured  clockwise  from  the  meridian. 
The  hour  angle  of  an  object  is  not  a  constant,  but 
grows  at  a  steady  rate,  due  to  the  Earth’s  rotation. 
The  hour  angle  of  the  vernal  equinox  is  called  the 


Fig.  2.10  At  night,  stars  seem  to  revolve  around  the  ce¬ 
lestial  pole.  The  altitude  of  the  pole  from  the  horizon 
equals  the  latitude  of  the  observer.  (Photo  Aimo  Sillanpaa 
and  Pasi  Nurmi) 


sidereal  time  0.  Figure  2.11  shows  that  for  any 
object, 

0=h  +  a,  (2.11) 

where  h  is  the  object’s  hour  angle  and  a  its  right 
ascension. 

Since  hour  angle  and  sidereal  time  change 
with  time  at  a  constant  rate,  it  is  practical  to  ex¬ 
press  them  in  units  of  time.  Also  the  closely  re¬ 
lated  right  ascension  is  customarily  given  in  time 
units.  Thus  24  hours  equals  360  degrees,  1  hour  = 
15  degrees,  1  minute  of  time  =15  minutes  of  arc, 
and  so  on.  All  these  quantities  are  in  the  range 
[0  h,  24  h).  This  custom  also  explains  the  name 
hour  circle ;  it  is  a  great  circle  along  which  the 
hour  angle  (or  right  ascension)  is  constant. 

In  practice,  the  sidereal  time  can  be  readily  de¬ 
termined  by  pointing  the  telescope  to  an  easily 
recognisable  star  and  reading  its  hour  angle  on 
the  hour  angle  dial  of  the  telescope.  The  right  as¬ 
cension  found  in  a  catalogue  is  then  added  to  the 
hour  angle,  giving  the  sidereal  time  at  the  moment 
of  observation.  For  any  other  time,  the  sidereal 
time  can  be  evaluated  by  adding  the  time  elapsed 
since  the  observation.  If  we  want  to  be  accurate, 
we  have  to  use  a  sidereal  clock  to  measure  time 
intervals.  A  sidereal  clock  runs  3  min  56.56  s  fast 
a  day  as  compared  with  an  ordinary  solar  time 


Celestial  pole 


Fig.  2.11  The  relation  between  the  right  ascension  a, 
hour  angle  h  and  sidereal  time  (9 .  The  right  ascension  is 
measured  counterclockwise  from  the  vernal  equinox,  the 
hour  angle  and  sidereal  time  clockwise  from  the  South 
meridian.  The  sidereal  time  is  the  hour  angle  the  vernal 
equinox 
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The  inverse  transformation  is  obtained  by  sub¬ 
stituting 

0  =  90°  —  h,  0  =  8 , 

0/  =  9O  °-A,  0'  =  fl,  (2.15) 

X  —  —(90°  —  0), 

whence 

sin  A  cos  a  =  sin  h  cos  8, 

cos  A  cos  a  =  cos  h  cos  8  sin  0  —  sin  8  cos  0 , 

sin  a  =  cos  h  cos  8  cos  0  +  sin  8  sin  0 . 

(2.16) 


Fig.  2.12  The  nautical  triangle  for  deriving  transforma¬ 
tions  between  the  horizontal  and  equatorial  frames 


clock: 

24  h  solar  time 

=  24  h  3  min  56.56  s  sidereal  time.  (2.12) 

The  reason  for  this  is  the  orbital  motion  of  the 
Earth:  stars  seem  to  move  faster  than  the  Sun 
across  the  sky;  hence,  a  sidereal  clock  must  run 
faster.  (This  is  further  discussed  in  Sect.  2.13.) 

Transformations  between  the  horizontal  and 
equatorial  frames  are  easily  obtained  from  spher¬ 
ical  trigonometry.  Comparing  Figs.  2.6  and  2.12, 
we  find  that  we  must  make  the  following  substi¬ 
tutions  into  (2.5): 


Since  the  altitude  and  declination  are  in  the 
range  [—90°,  +90°],  it  suffices  to  know  the  sine 
of  one  of  these  angles  to  determine  the  other  an¬ 
gle  unambiguously.  Azimuth  and  right  ascension, 
however,  can  have  any  value  from  0°  to  360°  (or 
from  0  h  to  24  h),  and  to  solve  for  them,  we  have 
to  know  both  the  sine  and  cosine  to  choose  the 
correct  quadrant. 

The  altitude  of  an  object  is  greatest  when  it 
is  on  the  south  meridian  (the  great  circle  arc  be¬ 
tween  the  celestial  poles  containing  the  zenith). 
At  that  moment  (called  upper  culmination ,  or 
transit )  its  hour  angle  is  0  h.  At  the  lower  culmi¬ 
nation  the  hour  angle  is  h  =  12  h.  When  h  =  Oh, 
we  get  from  the  last  equation  in  (2.16) 

sin  a  =  cos  8  cos  0  +  sin  8  sin  0 

=  cos(0  —  5)  =  sin(90°  —0  +  5). 

Thus  the  altitude  at  the  upper  culmination  is 


0  =  90 °  -  A, 
1/0  =  90 °  -h, 


0  =  a, 
O' =  8 , 


(2.13) 

X  =  90°  —  <p. 


The  angle  0  in  the  last  equation  is  the  altitude  of 
the  celestial  pole,  or  the  latitude  of  the  observer. 
Making  the  substitutions,  we  get 


sin  h  cos  8  =  sin  A  cos  a, 

cos  h  cos  8  =  cos  A  cos  a  sin  0  +  sin  a  cos  0 , 

sin  8  =  —  cos  A  cos  a  cos  0  +  sin  a  sin  0 . 

(2.14) 


90°  —0  +  5,  if  the  object  culminates 

south  of  zenith, 

+nax  —  * 

90°  +0  —  5,  if  the  object  culminates 

north  of  zenith. 

(2.17) 

The  altitude  is  positive  for  objects  with  5  > 
0  —  90°.  Objects  with  declinations  less  than  0  — 
90°  can  never  be  seen  at  the  latitude  0.  On  the 
other  hand,  when  h  =  12  h  we  have 


sin  a  =  —  cos  5  cos  0  +  sin  5  sin  0 

=  —  cos(5  +  0)  =  sin(5  +  0  —  90°), 
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and  the  altitude  at  the  lower  culmination  is 

$min  =  <5  +  0  —  90  .  (2.18) 

Stars  with  8  >  90°  —  0  will  never  set.  For  exam¬ 
ple,  in  Helsinki  (0  ~  60°),  all  stars  with  a  decli¬ 
nation  higher  than  30°  are  such  circumpolar  stars. 
And  stars  with  a  declination  less  than  —30°  can 
never  be  observed  there. 

We  shall  now  study  briefly  how  the  (a,  8)  fra¬ 
me  can  be  established  by  observations.  Suppose 
we  observe  a  circumpolar  star  at  its  upper  and 
lower  culmination  (Fig.  2.13).  At  the  upper  tran¬ 
sit,  its  altitude  is  amSLX  =  90°  —  0  +  5  and  at  the 
lower  transit,  am jn  =  8  +  0  —  90° .  Eliminating  the 
latitude,  we  get 

1 

5  —  ~(amm  +  ^max)-  (2.19) 

Thus  we  get  the  same  value  for  the  declination, 
independent  of  the  observer’s  location.  Therefore 
we  can  use  it  as  one  of  the  absolute  coordinates. 
From  the  same  observations,  we  can  also  deter¬ 
mine  the  direction  of  the  celestial  pole  as  well  as 
the  latitude  of  the  observer.  After  these  prepara¬ 
tions,  we  can  find  the  declination  of  any  object  by 
measuring  its  distance  from  the  pole. 

The  equator  can  be  now  defined  as  the  great 
circle  all  of  whose  points  are  at  a  distance  of  90° 
from  the  pole.  The  zero  point  of  the  second  co¬ 
ordinate  (right  ascension)  can  then  be  defined  as 
the  point  where  the  Sun  seems  to  cross  the  equa¬ 
tor  from  south  to  north. 

In  practice  the  situation  is  more  complicated, 
since  the  direction  of  Earth’s  rotation  axis  changes 


Fig.  2.1 3  The  altitude  of  a  circumpolar  star  at  upper  and 
lower  culmination 


due  to  perturbations.  Therefore  the  equatorial  co¬ 
ordinate  frame  is  nowadays  defined  using  cer¬ 
tain  standard  objects  the  positions  of  which 
are  known  very  accurately.  The  best  accuracy 
is  achieved  by  using  the  most  distant  objects, 
quasars  (Sect.  19.7),  which  remain  in  the  same 
direction  over  very  long  intervals  of  time. 


2.6  Rising  and  Setting  Times 

From  the  last  equation  (2.16),  we  find  the  hour 
angle  h  of  an  object  at  the  moment  its  altitude 
is  a: 

sin  a 

cosh  =  —  tan 5 tan0  -| - .  (2.20) 

cos  8  cos  0 

This  equation  can  be  used  for  computing  rising 
and  setting  times.  Then  a  =  0  and  the  hour  an¬ 
gles  corresponding  to  rising  and  setting  times  are 
obtained  from 

cos  h  =  —  tan  5  tan  0 .  (2.21) 

If  the  right  ascension  a  is  known,  we  can  use 
(2.11)  to  compute  the  sidereal  time  (9.  (Later,  in 
Sect.  2.14,  we  shall  study  how  to  transform  the 
sidereal  time  to  ordinary  time.) 

If  higher  accuracy  is  needed,  we  have  to  cor¬ 
rect  for  the  refraction  of  light  caused  by  the  at¬ 
mosphere  of  the  Earth  (see  Sect.  2.9).  In  that 
case,  we  must  use  a  small  negative  value  for  a 
in  (2.20).  This  value,  the  horizontal  refraction ,  is 
about  —34'. 

The  rising  and  setting  times  of  the  Sun  given 
in  almanacs  refer  to  the  time  when  the  upper 
edge  of  the  Solar  disk  just  touches  the  horizon. 
To  compute  these  times,  we  must  set  a  =  —  50r 
(=  —  34r  -  160. 

Also  for  the  Moon  almanacs  give  rising  and 
setting  times  of  the  upper  edge  of  the  disk.  Since 
the  distance  of  the  Moon  varies  considerably,  we 
cannot  use  any  constant  value  for  the  radius  of  the 
Moon,  but  it  has  to  be  calculated  separately  each 
time.  The  Moon  is  also  so  close  that  its  direction 
with  respect  to  the  background  stars  varies  due  to 
the  rotation  of  the  Earth.  Thus  the  rising  and  set¬ 
ting  times  of  the  Moon  are  defined  as  the  instants 
when  the  altitude  of  the  Moon  is  —  34r  —  s  +  tv. 
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Fig.  2.14  The  ecliptic  or  the  apparent  orbit  of  the  Sun 
resembles  somewhat  a  sine  curve  in  the  equatorial  coordi¬ 
nate  frame.  The  ecliptic  passes  through  13  constellations. 
Since  many  of  them  have  been  named  after  animals,  the 
neighbourhood  of  the  ecliptic  is  called  the  zodiac.  As¬ 
trologers  divide  the  ecliptic  into  12  equal  signs  of  30° 

where  s  is  the  apparent  radius  (15.57  on  the  aver¬ 
age)  and  jt  the  parallax  (57r  on  the  average).  The 
latter  quantity  is  explained  in  Sect.  2.9. 

Finding  the  rising  and  setting  times  of  the 
Sun,  planets  and  especially  the  Moon  is  compli¬ 
cated  by  their  motion  with  respect  to  the  stars. 
We  can  use,  for  example,  the  coordinates  for  the 
noon  to  calculate  estimates  for  the  rising  and  set¬ 
ting  times,  which  can  then  be  used  to  interpolate 
more  accurate  coordinates  for  the  rising  and  set¬ 
ting  times.  When  these  coordinates  are  used  to 
compute  new  times  a  pretty  good  accuracy  can 
be  obtained.  The  iteration  can  be  repeated  if  even 
higher  precision  is  required. 


2.7  The  Ecliptic  System 

The  orbital  plane  of  the  Earth,  the  ecliptic ,  is  the 
reference  plane  of  another  important  coordinate 
frame.  The  ecliptic  can  also  be  defined  as  the 
great  circle  on  the  celestial  sphere  described  by 
the  Sun  in  the  course  of  one  year.  This  frame  is 
used  mainly  for  planets  and  other  bodies  of  the 
solar  system.  The  orientation  of  the  Earth’s  equa¬ 
torial  plane  remains  invariant,  unaffected  by  an¬ 
nual  motion.  In  spring,  the  Sun  appears  to  move 
from  the  southern  hemisphere  to  the  northern  one 
(Fig.  2.14).  The  time  of  this  remarkable  event 
as  well  as  the  direction  to  the  Sun  at  that  mo¬ 
ment  are  called  the  vernal  equinox.  At  the  vernal 


each.  (Ophiuchus  is  unknown  to  them.)  Due  to  the  preces¬ 
sion  of  the  coordinate  frame  the  astrological  signs  and  the 
constellations  with  the  same  name  are  no  more  aligned. 
Also  the  sizes  of  the  actual  constellations  are  quite  differ¬ 
ent 


Fig.  2.15  The  ecliptic  geocentric  (k,  ft)  and  heliocentric 
(k' , /3')  coordinates  are  equal  only  if  the  object  is  very 
far  away.  The  geocentric  coordinates  depend  also  on  the 
Earth’s  position  in  its  orbit 

equinox,  the  Sun’s  right  ascension  and  declina¬ 
tion  are  zero.  The  equatorial  and  ecliptic  planes 
intersect  along  a  straight  line  directed  towards  the 
vernal  equinox.  Thus  we  can  use  this  direction  as 
the  zero  point  for  both  the  equatorial  and  ecliptic 
coordinate  frames.  The  point  opposite  the  vernal 
equinox  is  the  autumnal  equinox ,  it  is  the  point  at 
which  the  Sun  crosses  the  equator  from  north  to 
south. 

The  ecliptic  latitude  p  is  the  angular  distance 
from  the  ecliptic;  it  is  in  the  range  [—90°,  +90°]. 
The  other  coordinate  is  the  ecliptic  longitude  A, 
measured  counterclockwise  from  the  vernal 
equinox  (Fig.  2.15). 

Transformation  equations  between  the  equa¬ 
torial  and  ecliptic  frames  can  be  derived  analo- 
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gously  to  (2.14)  and  (2.16): 

sin  A  cos  p  =  sin  8  sin  s  +  cos  8  cos  s  sin  a, 
cos  A  cos  p  =  cos  8  cos  a,  (2.22) 

sin  P  =  sin  8  cos  s  —  cos  8  sin  s  sin  a , 

sin  a  cos  8  =  —  sin  p  sin  e  +  cos  p  cos  s  sin  A , 
cos  a  cos  5  =  cos  A  cos /3,  (2.23) 

sin  8  =  sin  p  cos  s  +  cos  P  sin  s  sin  A . 

The  angle  s  appearing  in  these  equations  is 
the  obliquity  of  the  ecliptic ,  or  the  angle  between 
the  equatorial  and  ecliptic  planes.  Its  value  is 
roughly  23°26r  (a  more  accurate  value  is  given 
in  Box  2.1). 

Depending  on  the  problem  to  be  solved,  we 
may  encounter  heliocentric  (origin  at  the  Sun), 
geocentric  (origin  at  the  centre  of  the  Earth)  or 
topo centric  (origin  at  the  observer)  coordinates. 
For  very  distant  objects  the  differences  are  neg¬ 
ligible,  but  not  for  bodies  of  the  solar  system. 
To  transform  heliocentric  coordinates  to  geocen¬ 
tric  coordinates  or  vice  versa,  we  must  also  know 
the  distance  of  the  object.  This  transformation  is 
most  easily  accomplished  by  computing  the  rect¬ 
angular  coordinates  of  the  object  and  the  new  ori¬ 
gin,  then  changing  the  origin  and  finally  evaluat¬ 
ing  the  new  latitude  and  longitude  from  the  rect¬ 
angular  coordinates  (see  Examples  2.5  and  2.6). 


2.8  The  Galactic  Coordinates 

For  studies  of  the  Milky  Way  Galaxy,  the  most 
natural  reference  plane  is  the  plane  of  the  Milky 
Way  (Fig.  2.16).  Since  the  Sun  lies  very  close 
to  that  plane,  we  can  put  the  origin  at  the  Sun. 
The  galactic  longitude  l  is  measured  counter¬ 
clockwise  (like  right  ascension)  from  the  direc¬ 
tion  of  the  centre  of  the  Milky  Way  (in  Sagit¬ 
tarius,  o'  =  17  h  45.7  min,  8  =  — 29°00/).  The 


Celestial 

Galactic 

b 

pole 

centre 

Sun 

Fig.  2.16  The  galactic  coordinates  /  and  b 


galactic  latitude  b  is  measured  from  the  galac¬ 
tic  plane,  positive  northwards  and  negative  south¬ 
wards.  This  definition  was  officially  adopted  only 
in  1959,  when  the  direction  of  the  galactic  cen¬ 
tre  was  determined  from  radio  observations  ac¬ 
curately  enough.  The  old  galactic  coordinates  Z1 
and  bl  had  the  intersection  of  the  equator  and  the 
galactic  plane  as  their  zero  point. 

The  galactic  coordinates  can  be  obtained  from 
the  equatorial  ones  with  the  transformation  equa¬ 
tions 

sin(/N  —  /)  cos  b  =  cos  8  sin  (a  —  cq>) , 
cos(/n  —  l)  cos  b  =  —  cos  8  sin  8 p  cos(a  —  o?p) 

+  sin  8  cos  <$p, 

sin  b  =  cos  8  cos  5p  cos(a  —  a?p) 

+  sin  8  sin  <$p, 

(2.24) 

where  the  direction  of  the  Galactic  north  pole  is 
ap  =  12  h  5 1 .4  min,  5p  =  27°08/,  and  the  galactic 
longitude  of  the  celestial  pole,  /n  =  123.0°. 


2.9  Perturbations  of  Coordinates 

Even  if  a  star  remains  fixed  with  respect  to  the 
Sun,  its  coordinates  can  change,  due  to  several 
disturbing  effects.  Naturally  its  altitude  and  az¬ 
imuth  change  constantly  because  of  the  rotation 
of  the  Earth,  but  even  its  right  ascension  and  dec¬ 
lination  are  not  quite  free  from  perturbations. 

Precession  The  Earth  is  not  quite  spherical  but 
slightly  flattened.  Since  most  of  the  members  of 
the  solar  system  orbit  close  to  the  ecliptic,  they 
tend  to  pull  the  equatorial  bulge  of  the  Earth 
towards  it.  Most  of  this  “flattening”  torque  is 
caused  by  the  Moon  and  the  Sun.  But  the  Earth 
is  rotating  and  therefore  the  torque  cannot  change 
the  inclination  of  the  equator  relative  to  the  eclip¬ 
tic.  Instead,  the  rotation  axis  turns  in  a  direction 
perpendicular  to  the  axis  and  the  torque,  thus  de¬ 
scribing  a  cone  once  in  roughly  26,000  years. 
This  slow  turning  of  the  rotation  axis  is  called 
precession  (Fig.  2.17).  Because  of  precession,  the 
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Fig.  2.1 7  Due  to 

precession  the  celestial 
pole  moves  and  makes  a 
full  revolution  in  about 
26,000  years.  For  a  while  it 
is  still  approaching  Polaris, 
but  will  then  begin  to 
recede.  In  about  14,000  it 
will  be  close  to  Vega. 
Somewhat  over  2000  years 
BC  the  pole  was  close  to 
the  a  star  of  Drace,  Thuban 


vernal  equinox  moves  along  the  ecliptic  clock¬ 
wise  about  50  seconds  of  arc  every  year,  thus  in¬ 
creasing  the  ecliptic  longitudes  of  all  objects  at 
the  same  rate.  At  present  the  rotation  axis  points 
about  one  degree  away  from  Polaris,  but  after 
12,000  years,  the  celestial  pole  will  be  roughly  in 
the  direction  of  Vega.  The  changing  ecliptic  lon¬ 
gitudes  also  affect  the  right  ascension  and  decli¬ 
nation.  Thus  we  have  to  know  the  instant  of  time, 
or  epoch ,  for  which  the  coordinates  are  given. 

Currently  most  maps  and  catalogues  use  the 
epoch  J2000.0,  which  means  the  beginning  of 
the  year  2000,  or,  to  be  exact,  the  noon  of  Jan¬ 
uary  1,  2000,  or  the  Julian  date  2,451,545.0  (see 
Sect.  2.15). 

Let  us  now  derive  expressions  for  the  changes 
in  right  ascension  and  declination.  Taking  the  last 
transformation  equation  in  (2.23), 

sin  8  =  cos  £  sin  +  sin  s  cos  sin  A , 

and  differentiating,  we  get 

cos  8  d5  =  sin  s  cos  /3  cos  A  dX . 

Applying  the  second  equation  in  (2.22)  to  the 
right-hand  side,  we  have,  for  the  change  in  dec¬ 


lination, 

d5  =  dA  sin  s  cos  a.  (2.25) 

By  differentiating  the  equation 

cos  a  cos  8  =  cos  /3  cos  A , 

we  get 

— sina  cos  8  da  —  cos  a  sin  8  d5 
=  —  cos  p  sinAdA; 

and,  by  substituting  the  previously  obtained  ex¬ 
pression  for  d5  and  applying  the  first  equation 
(2.22),  we  have 

sin  a  cos  8  da 

=  dA  (cos  (3  sin  A  —  sin  s  cos2  a  sin  5) 

=  dA  (sin  8  sin  s  +  cos  8  cos  s  sin  a 

—sins  cos2  a  sin  5). 

Simplifying  this,  we  get 

da  =  dA(sina  sin£tan<$  +  coss).  (2.26) 

If  dA  is  the  annual  increment  of  the  ecliptic  lon¬ 
gitude  (about  50"),  the  precessional  changes  in 
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Table  2.1  Precession  constants  m  and  n.  Here,  “a”  means 
a  tropical  year 


Epoch 

m 

n 

1800 

3.07048  s/a 

1.33703  s/a  = 

20.05547a 

1850 

3.07141 

1.33674 

20.0511 

1900 

3.07234 

1.33646 

20.0468 

1950 

3.07327 

1.33617 

20.0426 

2000 

3.07419 

1.33589 

20.0383 

right  ascension  and  declination  in  one  year  are 
thus 


Ecliptic  north  pole 


d<$  =  dX  sins  cos  of, 

da  =  dX  (sin  s  sin  a  tan  8  +  cos  s ) . 


(2.27) 


Fig.  2.1 8  Due  to  precession  the  rotation  axis  of  the  Earth 
turns  around  the  ecliptic  north  pole.  Nutation  is  the  small 
wobble  disturbing  the  smooth  precessional  motion.  In  this 
figure  the  magnitude  of  the  nutation  is  highly  exaggerated 


These  expressions  are  usually  written  in  the  form 


d 8  =  n  cos  a, 

da  =  m  +  n  sina  tan 5, 


(2.28) 


where 


m  =  dAcos  s, 
n  =  dX  sin  s 


(2.29) 


are  the  precession  constants.  Since  the  obliq¬ 
uity  of  the  ecliptic  is  not  exactly  a  constant  but 
changes  with  time,  m  and  n  also  vary  slowly  with 
time.  However,  this  variation  is  so  slow  that  usu¬ 
ally  we  can  regard  m  and  n  as  constants  unless 
the  time  interval  is  very  long.  The  values  of  these 
constants  for  some  epochs  are  given  in  Table  2.1. 
For  intervals  longer  than  a  few  decades  a  more 
rigorous  method  should  be  used.  Its  derivation  ex¬ 
ceeds  the  level  of  this  book,  but  the  necessary  for¬ 
mulas  are  given  in  Box  2.1. 


Nutation  The  Moon’s  orbit  is  inclined  with  re¬ 
spect  to  the  ecliptic,  resulting  in  precession  of 
its  orbital  plane.  One  revolution  takes  18.6  years, 
producing  perturbations  with  the  same  period  in 
the  precession  of  the  Earth.  This  effect,  nutation , 
changes  ecliptic  longitudes  as  well  as  the  obliq¬ 
uity  of  the  ecliptic  (Fig.  2.18).  Calculations  are 
now  much  more  complicated,  but  fortunately  nu- 
tational  perturbations  are  relatively  small,  only 
fractions  of  an  arc  minute.  Thus  they  can  often  be 


calculated  using  the  simple  formulas  of  Box  2.1 
or  just  omitted. 

Parallax  If  we  observe  an  object  from  different 
points,  we  see  it  in  different  directions.  The  dif¬ 
ference  of  the  observed  directions  (as  well  as  the 
whole  phenomenon)  is  called  the  parallax.  Since 
the  amount  of  parallax  depends  on  the  distance 
of  the  observer  from  the  object,  we  can  utilise 
the  parallax  to  measure  distances.  Human  stereo¬ 
scopic  vision  is  based  (at  least  to  some  extent) 
on  this  effect.  For  astronomical  purposes  we  need 
much  longer  baselines  than  the  distance  between 
our  eyes  (about  7  cm).  Appropriately  large  and 
convenient  baselines  are  the  radius  of  the  Earth 
and  the  radius  of  its  orbit. 

Distances  to  the  nearest  stars  can  be  deter¬ 
mined  from  the  annual  parallax ,  which  is  the  an¬ 
gle  subtended  by  the  radius  of  the  Earth’s  orbit 
(called  the  astronomical  unit ,  au)  as  seen  from  the 
star.  (We  shall  discuss  this  further  in  Sect.  2.10.) 

By  diurnal  parallax  we  mean  the  change  of 
direction  due  to  the  daily  rotation  of  the  Earth.  In 
addition  to  the  distance  of  the  object,  the  diurnal 
parallax  also  depends  on  the  latitude  of  the  ob¬ 
server.  If  we  talk  about  the  parallax  of  a  body  in 
our  solar  system,  we  always  mean  the  angle  sub¬ 
tended  by  the  Earth’s  equatorial  radius  (6378  km) 
as  seen  from  the  object  (Fig.  2.19).  This  equals 
the  apparent  shift  of  the  object  with  respect  to  the 
background  stars  seen  by  an  observer  at  the  equa¬ 
tor  if  (s)he  observes  the  object  moving  from  the 
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Fig.  2.19  The  horizontal 
parallax  n  of  an  object  is 
the  angle  subtended  by  the 
Earth’s  equatorial  radius  as 
seen  from  the  object 


Fig.  2.20  The  effect  of  aberration  on  the  apparent  direc¬ 
tion  of  an  object,  (a)  Observer  at  rest,  (b)  Observer  in  mo¬ 
tion 

horizon  to  the  zenith.  The  parallax  of  the  Moon, 
for  example,  is  about  57r,  and  that  of  the  Sun 
8.79". 

In  astronomy  parallax  may  also  refer  to  dis¬ 
tance  in  general,  even  if  it  is  not  measured  using 
the  shift  in  the  observed  direction. 

Aberration  Because  of  the  finite  speed  of  light, 
an  observer  in  motion  sees  an  object  shifted  in 
the  direction  of  her/his  motion  (Fig.  2.20).  This 
change  of  apparent  direction  is  called  the  aber¬ 
ration.  To  derive  the  exact  value  we  have  to  use 
the  special  theory  of  relativity,  but  for  practical 
purposes  it  suffices  to  use  the  approximate  value 

a  =  -  sin#,  [a]  =  rad,  (2.30) 
c 


Fig.  2.21  A  telescope  is  pointed  in  the  true  direction  of 
a  star.  It  takes  a  time  t  =  l/c  for  the  light  to  travel  the 
length  of  the  telescope.  The  telescope  is  moving  with  ve¬ 
locity  v,  which  has  a  component  v  sin#,  perpendicular  to 
the  direction  of  the  light  beam.  The  beam  will  hit  the  bot¬ 
tom  of  the  telescope  displaced  from  the  optical  axis  by 
a  distance  x  =  tv  sinO  =  l  (v/c )  sin#.  Thus  the  change  of 
direction  in  radians  is  a  =  x/ 1  =  (v/c)  sin  0 

where  v  is  the  velocity  of  the  observer,  c  is  the 
speed  of  light  and  6  is  the  angle  between  the 
true  direction  of  the  object  and  the  velocity  vector 
of  the  observer  (Fig  2.21).  The  greatest  possible 
value  of  the  aberration  due  to  the  orbital  motion 
of  the  Earth,  v/c,  called  the  aberration  constant , 
is  21".  The  maximal  shift  due  to  the  Earth’s  ro¬ 
tation,  the  diurnal  aberration  constant,  is  much 
smaller,  about  0.3". 

Refraction  Since  light  is  refracted  by  the  at¬ 
mosphere,  the  direction  of  an  object  differs  from 
the  true  direction  by  an  amount  depending  on  the 
atmospheric  conditions  along  the  line  of  sight. 
Since  this  refraction  varies  with  atmospheric 
pressure  and  temperature,  it  is  very  difficult  to 
predict  it  accurately.  However,  an  approximation 
good  enough  for  most  practical  purposes  is  eas¬ 
ily  derived.  If  the  object  is  not  too  far  from  the 
zenith,  the  atmosphere  between  the  object  and 
the  observer  can  be  approximated  by  a  stack  of 
parallel  planar  layers,  each  of  which  has  a  certain 
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index  of  refraction  (Fig.  2.22).  Outside  the  at¬ 
mosphere,  we  have  n  =  1 . 

Let  the  true  zenith  distance  be  z  and  the  appar¬ 
ent  one,  f .  Using  the  notations  of  Fig.  2.22,  we 
obtain  the  following  equations  for  the  boundaries 
of  the  successive  layers: 

sinz  =  nk  sin  Zk, 

ri2  sinz2  =  n\  sinzi, 
n  \  sinzi  =  no  sinf . 

Simplifying  this  set  of  equations  we  get 

sinz  =  no  sin£ .  (2.31) 

When  the  refraction  angle  R  =  z  —  £  is  small  and 
is  expressed  in  radians,  we  have 

no  sin£  =  sinz  =  sin (R  +  f ) 

=  sin  R  cos  £  +  cos  R  sin  f 
~  R  cos  f  +  sinf . 

Thus  we  get 

7?  =  (no  —  1)  tan  f ,  [7?]  =  rad.  (2.32) 

The  index  of  refraction  depends  on  the  density 
of  the  air,  which  further  depends  on  the  pressure 


Fig.  2.22  Refraction  of  a  light  ray  travelling  through  the 
atmosphere.  If  the  altitude  is  not  very  low  we  can  think 
that  the  atmosphere  consists  of  planar  layers  each  of  which 
has  its  own  index  of  refraction.  Outside  the  atmosphere  the 
index  is  n  =  1  and  increases  towards  the  ground 


and  temperature.  When  the  altitude  is  over  15°, 
we  can  use  an  approximate  formula 

R  =  — - - 0.00452°  tan(90°  -  a) ,  (2.33) 

273 +  T 

where  a  is  the  altitude  in  degrees,  T  tempera¬ 
ture  in  degrees  Celsius,  and  P  the  atmospheric 
pressure  in  hectopascals  (or,  equivalently,  in  mil¬ 
libars).  At  lower  altitudes  the  curvature  of  the  at¬ 
mosphere  must  be  taken  into  account.  An  approx¬ 
imate  formula  for  the  refraction  is  then 

P  0.1594  + 0.0196a  +  0.00002a2 

_ _ _ | _ 

“  273  +  r  1  +0.505(3+ 0.0845a2 

(2.34) 

These  formulas  are  widely  used,  although  they 
are  against  the  rules  of  dimensional  analysis. 
To  get  correct  values,  all  quantities  must  be  ex¬ 
pressed  in  correct  units.  Figure  2.23  shows  the 
refraction  under  different  conditions  evaluated 
from  these  formulas. 

Altitude  is  always  (except  very  close  to  zenith) 
increased  by  refraction.  On  the  horizon  the  change 
is  about  34r,  which  is  slightly  more  than  the  di¬ 
ameter  of  the  Sun.  When  the  lower  limb  of  the 
Sun  just  touches  the  horizon,  the  Sun  has  in  real¬ 
ity  already  set. 

Light  coming  from  the  zenith  is  not  refracted 
at  all  if  the  boundaries  between  the  layers  are  hor¬ 
izontal.  Under  some  climatic  conditions,  a  bound¬ 
ary  (e.g.  between  cold  and  warm  layers)  can  be 
slanted,  and  in  this  case,  there  can  be  a  small 


Fig.  2.23  Refraction  at  different  altitudes.  The  refraction 
angle  R  tells  how  much  higher  the  object  seems  to  be  com¬ 
pared  with  its  true  altitude  a.  Refraction  depends  on  the 
density  and  thus  on  the  pressure  and  temperature  of  the 
air.  The  upper  curves  give  the  refraction  at  sea  level  dur¬ 
ing  rather  extreme  weather  conditions.  At  the  altitude  of 
2.5  kilometers  the  average  pressure  is  only  700  hPa,  and 
thus  the  effect  of  refraction  smaller  ( lowest  curve) 
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zenith  refraction,  which  is  of  the  order  of  a  few 
arc  seconds. 

Stellar  positions  given  in  star  catalogues  are 
mean  places ,  from  which  the  effects  of  parallax, 
aberration  and  nutation  have  been  removed.  The 
mean  place  of  the  date  (i.e.  at  the  observing  time) 
is  obtained  by  correcting  the  mean  place  for  the 
proper  motion  of  the  star  (Sect.  2.10)  and  preces¬ 
sion.  The  apparent  place  is  obtained  by  correct¬ 
ing  this  place  further  for  nutation,  parallax  and 
aberration.  There  is  a  catalogue  published  annu¬ 
ally  that  gives  the  apparent  places  of  certain  ref¬ 
erences  stars  at  intervals  of  a  few  days.  These  po¬ 
sitions  have  been  corrected  for  precession,  nuta¬ 
tion,  parallax  and  annual  aberration.  The  effects 
of  diurnal  aberration  and  refraction  are  not  in¬ 
cluded  because  they  depend  on  the  location  of  the 
observer. 


2.1 0  Positional  Astronomy 

The  position  of  a  star  can  be  measured  either  with 
respect  to  some  reference  stars  (relative  astrome¬ 
try)  or  with  respect  to  a  fixed  coordinate  frame 
(absolute  astrometry). 

Absolute  coordinates  are  usually  determined 
using  a  meridian  circle ,  which  is  a  telescope 
that  can  be  turned  only  in  the  meridional  plane 
(Fig.  2.27).  It  has  only  one  axis,  which  is  aligned 
exactly  in  the  east-west  direction.  Since  all  stars 
cross  the  meridian  in  the  course  of  a  day,  they  all 
come  to  the  field  of  the  meridian  circle  at  some 
time  or  other.  When  a  star  culminates,  its  altitude 
and  the  time  of  the  transit  are  recorded.  If  the  time 
is  determined  with  a  sidereal  clock,  the  sidereal 
time  immediately  gives  the  right  ascension  of  the 
star,  since  the  hour  angle  is  h  =  0  h.  The  other 
coordinate,  the  declination  5,  is  obtained  from  the 
altitude: 

8  =  a-( 90° -0), 

where  a  is  the  observed  altitude  and  0  is  the  geo¬ 
graphic  latitude  of  the  observatory. 

Relative  coordinates  are  measured  on  photo¬ 
graphic  plates  (Fig.  2.25)  or  CCD  images  con¬ 
taining  some  known  reference  stars.  The  scale  of 
the  plate  as  well  as  the  orientation  of  the  coordi- 


Fig.  2.24  The  trigonometric  parallax  jt  of  a  star  S  is  the 
angle  subtended  by  the  radius  of  the  orbit  of  the  Earth,  or 
one  astronomical  unit,  as  seen  from  the  star 

nate  frame  can  be  determined  from  the  reference 
stars.  After  this  has  been  done,  the  right  ascen¬ 
sion  and  declination  of  any  object  in  the  image 
can  be  calculated  if  its  coordinates  in  the  image 
are  measured. 

All  stars  in  a  small  field  are  almost  equally  af¬ 
fected  by  the  dominant  perturbations,  precession, 
nutation,  and  aberration.  The  much  smaller  effect 
of  parallax,  on  the  other  hand,  changes  the  rela¬ 
tive  positions  of  the  stars. 

The  shift  in  the  direction  of  a  star  with  respect 
to  distant  background  stars  due  to  the  annual  mo¬ 
tion  of  the  Earth  is  called  the  trigonometric  par¬ 
allax  of  the  star.  It  gives  the  distance  of  the  star: 
the  smaller  the  parallax,  the  farther  away  the  star 
is.  Trigonometric  parallax  is,  in  fact,  the  only  di¬ 
rect  method  we  currently  have  of  measuring  dis¬ 
tances  to  stars.  Later  we  shall  be  introduced  to 
some  other,  indirect  methods,  which  require  cer¬ 
tain  assumptions  on  the  motions  or  structure  of 
stars.  The  same  method  of  triangulation  is  em¬ 
ployed  to  measure  distances  of  earthly  objects. 
To  measure  distances  to  stars,  we  have  to  use  the 
longest  baseline  available,  the  diameter  of  the  or¬ 
bit  of  the  Earth. 

During  the  course  of  one  year,  a  star  will  ap¬ 
pear  to  describe  a  circle  if  it  is  at  the  pole  of  the 
ecliptic,  a  segment  of  line  if  it  is  in  the  ecliptic,  or 
an  ellipse  otherwise.  The  semimajor  axis  of  this 
ellipse  is  called  the  parallax  of  the  star.  It  is  usu¬ 
ally  denoted  by  n .  It  equals  the  angle  subtended 
by  the  radius  of  the  Earth’s  orbit  as  seen  from  the 
star  (Fig.  2.24). 
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The  unit  of  distance  used  in  astronomy  is  par¬ 
sec  (pc).  At  a  distance  of  one  parsec,  one  astro¬ 
nomical  unit  subtends  an  angle  of  one  arc  sec¬ 
ond.  Since  one  radian  is  about  206,265",  1  pc 
equals  206,265  au.  Furthermore,  because  1  au  = 
1.496  x  1011  m,  1  pc  ^  3.086  x  1016  m.  If  the 
parallax  is  given  in  arc  seconds,  the  distance  is 
simply 


r  =  l/n,  [r]=pc,  [tv]  =  ".  (2.35) 

In  popular  astronomical  texts,  distances  are 
usually  given  in  light-years ,  one  light-year  be¬ 
ing  the  distance  light  travels  in  one  year,  or  9.5  x 
10 15  m.  Thus  one  parsec  is  about  3.26  light-years. 

The  first  parallax  measurement  was  accom¬ 
plished  by  Friedrich  Wilhelm  Bessel  (1784-1846) 
in  1838.  He  found  the  parallax  of  61  Cygni  to  be 
0.3".  The  nearest  star  Proxima  Centauri  has  a  par¬ 
allax  of  0.762"  and  thus  a  distance  of  1.31  pc. 

In  addition  to  the  motion  due  to  the  annual  par¬ 
allax,  many  stars  seem  to  move  slowly  in  a  direc¬ 
tion  that  does  not  change  with  time.  This  effect  is 
caused  by  the  relative  motion  of  the  Sun  and  the 
stars  through  space;  it  is  called  the  proper  motion. 
The  appearance  of  the  sky  and  the  shapes  of  the 
constellations  are  constantly,  although  extremely 
slowly,  changed  by  the  proper  motions  of  the  stars 
(Fig.  2.26). 

The  velocity  of  a  star  with  respect  to  the  Sun 
can  be  divided  into  two  components  (Fig.  2.28), 
one  of  which  is  directed  along  the  line  of  sight 
(the  radial  component  or  the  radial  velocity ),  and 
the  other  perpendicular  to  it  (the  tangential  com¬ 
ponent).  The  tangential  velocity  results  in  the 
proper  motion,  which  can  be  measured  by  taking 
pictures  at  intervals  of  several  years  or  decades. 
The  proper  motion  p  has  two  components,  one 
giving  the  change  in  declination  /is  and  the  other, 
in  right  ascension,  pLa  cos  8.  The  coefficient  cos  8 
is  used  to  correct  the  scale  of  right  ascension: 
hour  circles  (the  great  circles  with  a  =  constant) 
approach  each  other  towards  the  poles,  so  the  co¬ 
ordinate  difference  must  be  multiplied  by  cos  8 
to  obtain  the  true  angular  separation.  The  total 
proper  motion  is 


The  greatest  known  proper  motion  belongs  to 
Barnard’s  Star,  which  moves  across  the  sky  at  the 
enormous  speed  of  10.3  arc  seconds  per  year.  It 
needs  less  than  200  years  to  travel  the  diameter 
of  a  full  moon. 

In  order  to  measure  proper  motions,  we  must 
observe  stars  for  a  long  time.  The  radial  compo¬ 
nent,  on  the  other  hand,  is  readily  obtained  from 
a  single  observation,  thanks  to  the  Doppler  effect. 
By  the  Doppler  effect  we  mean  the  change  in  fre¬ 
quency  and  wavelength  of  radiation  due  to  the  ra¬ 
dial  velocity  of  the  radiation  source.  The  same  ef¬ 
fect  can  be  observed,  for  example,  in  the  sound 
of  an  ambulance,  the  pitch  being  higher  when  the 
ambulance  is  approaching  and  lower  when  it  is 
receding. 

The  formula  for  the  Doppler  effect  for  small 
velocities  can  be  derived  as  in  Fig.  2.29.  The 
source  of  radiation  transmits  electromagnetic 
waves,  the  period  of  one  cycle  being  T .  In  time  T, 
the  radiation  approaches  the  observer  by  a  dis¬ 
tance  s  =  cT,  where  c  is  the  speed  of  propa¬ 
gation.  During  the  same  time,  the  source  moves 
with  respect  to  the  observer  a  distance  s'  =  vT, 
where  v  is  the  speed  of  the  source,  positive  for 
a  receding  source  and  negative  for  an  approach¬ 
ing  one.  We  find  that  the  length  of  one  cycle,  the 
wavelength  A,  equals 

X  =  s  s'  =  cT  +  vT. 

If  the  source  were  at  rest,  the  wavelength  of  its 
radiation  would  be  Ao  =  cT.  The  motion  of  the 
source  changes  the  wavelength  by  an  amount 


AA  =  A  —  Ao  =  cT  +  vT  —  cT  =  vT, 


and  the  relative  change  A  A  of  the  wavelength  is 


AX  v 
Xo  C 


(2.37) 


This  is  valid  only  when  v  <<C  c.  For  very  high  ve¬ 
locities,  we  must  use  the  relativistic  formula 


AX  1 1  -\-  v / c 
Xo  y  1  —  v/c 


(2.38) 


li  =  Jpl  cos 28  +  pj. 


These  formulas  are  valid  only  for  electromag- 
(2.36)  netic  radiation.  The  Doppler  shift  of  e.g.  sound 
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Fig.  2.25  (a)  A  plate  photographed  for  the  Carte  du  Ciel 
project  in  Helsinki  on  November  21,  1902.  The  centre  of 
the  held  is  at  a  =  18  h  40  min,  8  =  46°,  and  the  area  is 
2°  x  2°.  Distance  between  coordinate  lines  (exposed  sep¬ 
arately  on  the  plate)  is  5  minutes  of  arc.  (b)  The  framed 
region  on  the  same  plate,  (c)  The  same  area  on  a  plate 


taken  on  November  7,  1948.  The  bright  star  in  the  lower 
right  comer  (SAO  47747)  has  moved  about  12  seconds 
of  arc.  The  brighter,  slightly  drop- shaped  star  to  the  left 
is  a  binary  star  (SAO  47767);  the  separation  between  its 
components  is  8" 
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Fig.  2.26  Proper  motions  of  stars  slowly  change  the  appearance  of  constellations,  (a)  The  Big  Dipper  during  the  last 
ice  age  30,000  years  ago,  (b)  nowadays,  and  (c)  after  30,000  years 


Fig.  2.27  Astronomers  discussing  observations  with  the 
transit  circle  of  Helsinki  Observatory  in  1904 


waves  is  different  depending  on  the  motion  of  the 
source  or  the  observer. 

In  astronomy  the  Doppler  effect  can  be  seen 
in  stellar  spectra,  in  which  the  spectral  lines  are 
often  displaced  towards  the  blue  (shorter  wave¬ 
lengths)  or  red  (longer  wavelengths)  end  of  the 
spectrum.  A  blueshift  means  that  the  star  is  ap¬ 
proaching,  while  a  redshift  indicates  that  it  is  re¬ 
ceding. 

The  displacements  due  to  the  Doppler  effect 
are  usually  very  small.  In  order  to  measure  them, 
a  reference  spectrum  is  exposed  on  the  plate  next 
to  the  stellar  spectrum.  Now  that  CCD-cameras 
have  replaced  photographic  plates,  separate  cal¬ 


Fig.  2.28  The  radial  and  tangential  components,  vr  and 
vt  of  the  velocity  v  of  a  star.  The  latter  component  is  ob¬ 
served  as  proper  motion 


Fig.  2.29  The  wavelength  of  radiation  increases  if  the 
source  is  receding 

ibration  exposures  of  reference  spectra  are  taken 
to  determine  the  wavelength  scale  (Sect.  3.3).  The 
lines  in  the  reference  spectrum  are  produced  by 
a  light  source  at  rest  in  the  laboratory.  If  the  ref¬ 
erence  spectrum  contains  some  lines  found  also 
in  the  stellar  spectrum,  the  displacements  can  be 
measured. 

Also  the  spectra  of  distant  galaxies  show  red- 
shifts,  but  only  a  part  of  that  is  due  to  the  Doppler 
effect.  Instead,  the  redshifts  are  caused  by  the  ex¬ 
pansion  of  the  Universe  (Chap.  20).  Also,  gravi¬ 
tational  fields  cause  redshifts  as  predicted  by  the 
general  relativity  (Appendix  B). 

Displacements  of  spectral  lines  give  the  ra¬ 
dial  velocity  vY  of  the  star,  and  the  proper  mo¬ 
tion  fi  can  be  measured  from  photographic  plates 
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or  CCD  images.  To  find  the  tangential  velocity  nt, 
we  have  to  know  the  distance  r,  obtainable  from 
e.g.  parallax  measurements.  Tangential  velocity 
and  proper  motion  are  related  by 

vt  =  fir.  (2.39) 

If  fi  is  given  in  arc  seconds  per  year  and  r  in 
parsecs  we  have  to  make  the  following  unit  trans¬ 
formations  to  get  V{  in  km/s: 

1  rad  =  206,265",  1  year  =  3.156  x  107  s, 

1  pc  =  3.086  x  1013  km. 

Hence 

V{  =  4.74  fir ,  [vt]  =  km/s, 

(2.40) 

[lA  =  "/a,  [r]  =  pc. 

The  total  velocity  v  of  the  star  is  then 

v  =  y  +  v\.  (2.41) 


2.11  Constellations 

At  any  one  time,  about  1000-1500  stars  can  be 
seen  in  the  sky  (above  the  horizon).  Under  ideal 
conditions,  the  number  of  stars  visible  to  the 
naked  eye  can  be  as  high  as  3000  on  a  hemi¬ 
sphere,  or  6000  altogether.  Some  stars  seem  to 
form  figures  vaguely  resembling  something;  for 
ages  they  have  been  ascribed  to  various  mytho¬ 
logical  and  other  animals.  This  grouping  of  stars 
into  constellations  is  a  product  of  human  imagi¬ 
nation  without  any  physical  basis.  Different  cul¬ 
tures  have  different  constellations,  depending  on 
their  mythology,  history  and  environment. 

About  half  of  the  shapes  and  names  of  the 
constellations  we  are  familiar  with  date  back 
to  Mediterranean  antiquity.  But  the  names  and 
boundaries  were  far  from  unambiguous  as  late  as 
the  19th  century.  Therefore  the  International  As¬ 
tronomical  Union  (IAU)  confirmed  fixed  bound¬ 
aries  at  its  1928  meeting. 

The  official  boundaries  of  the  constellations 
were  established  along  lines  of  constant  right  as¬ 
cension  and  declination  for  the  epoch  1875.  Dur¬ 
ing  the  time  elapsed  since  then,  precession  has 


noticeably  turned  the  equatorial  frame.  However, 
the  boundaries  remain  fixed  with  respect  to  the 
stars.  So  a  star  belonging  to  a  constellation  will 
belong  to  it  forever  (unless  it  is  moved  across  the 
boundary  by  its  proper  motion). 

The  names  of  the  88  constellations  confirmed 
by  the  IAU  are  given  in  Table  C.21  at  the  end  of 
the  book.  The  table  also  gives  the  abbreviation  of 
the  Latin  name,  its  genitive  (needed  for  names  of 
stars)  and  the  English  name. 

In  his  star  atlas  Uranometria  (1603)  Johannes 
Bayer  started  the  current  practice  to  denote  the 
brightest  stars  of  each  constellation  by  Greek  let¬ 
ters.  The  brightest  star  is  usually  a  (alpha),  e.g. 
Deneb  in  the  constellation  Cygnus  is  a  Cygni, 
which  is  abbreviated  as  a  Cyg.  The  second  bright¬ 
est  star  is  /3  (beta),  the  next  one  y  (gamma)  and 
so  on.  There  are,  however,  several  exceptions  to 
this  rule;  for  example,  the  stars  of  the  Big  Dipper 
are  named  in  the  order  they  appear  in  the  con¬ 
stellation.  After  the  Greek  alphabet  has  been  ex¬ 
hausted,  Latin  letters  can  be  employed.  Another 
method  is  to  use  numbers,  which  are  assigned 
in  the  order  of  increasing  right  ascension;  e.g. 
30  Tau  is  a  bright  binary  star  in  the  constellation 
Taurus.  These  numbers  are  based  on  the  star  cata¬ 
logue  Historia  Ccelestis  Britannica  (1725)  by  the 
first  Astronomer  Royal,  John  Flamsteed.  More¬ 
over,  variable  stars  have  their  special  identifiers 
(Sect.  13.1).  About  two  hundred  bright  stars  have 
a  proper  name;  e.g.  the  bright  a  Aur  is  called  also 
Capella. 

As  telescopes  evolved,  more  and  more  stars 
were  seen  and  catalogued.  It  soon  became  im¬ 
practical  to  continue  this  method  of  naming. 
Thus  most  of  the  stars  are  known  only  by  their 
catalogue  index  numbers.  One  star  may  have 
many  different  numbers;  e.g.  the  abovementioned 
Capella  ( a  Aur)  is  number  BD+45°  1077  in  the 
Bonner  Durchmusterung  and  HD  34029  in  the 
Henry  Draper  catalogue. 


2.1 2  Star  Catalogues  and  Maps 

The  first  actual  star  catalogue  was  published  by 
Ptolemy  in  the  second  century;  this  catalogue 
appeared  in  the  book  to  be  known  later  as  Al¬ 
magest  (which  is  a  Latin  corruption  of  the  name 
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of  the  Arabic  translation,  Al-mijisti).  It  had  1025 
entries;  the  positions  of  these  bright  stars  had 
been  measured  by  Hipparchos  250  years  earlier. 
Ptolemy’s  catalogue  was  the  only  widely  used 
one  prior  to  the  17th  century. 

The  first  catalogues  still  being  used  by  as¬ 
tronomers  were  prepared  under  the  direction  of 
Friedrich  Wilhelm  August  Argelander  (1799- 
1875).  Argelander  worked  in  Turku  and  later 
served  as  professor  of  astronomy  in  Helsinki,  but 
he  made  his  major  contributions  in  Bonn.  Using 
a  72  mm  telescope,  he  and  his  assistants  mea¬ 
sured  the  positions  and  estimated  the  magnitudes 
of  320,000  stars.  The  catalogue,  Bonner  Durch- 
musterung ,  contains  nearly  all  stars  brighter  than 
magnitude  9.5  between  the  north  pole  and  dec¬ 
lination  —2°.  (Magnitudes  are  further  discussed 
in  Chap.  4.)  Argelander’s  work  was  later  used  as 
a  model  for  two  other  large  catalogues  covering 
the  whole  sky.  The  total  number  of  stars  in  these 
catalogues  is  close  to  one  million  (Fig.  2.30). 

The  purpose  of  these  Durchmusterungen  or 
general  catalogues  was  to  systematically  list 
a  great  number  of  stars.  In  the  zone  catalogues , 
the  main  goal  is  to  give  the  positions  of  stars 
as  exactly  as  possible.  A  typical  zone  cata¬ 
logue  is  the  German  Katalog  der  Astronomischen 
Gesellschaft  (AGK).  Twelve  observatories,  each 
measuring  a  certain  region  in  the  sky,  contributed 
to  this  catalogue.  The  work  was  begun  in  the 
1870’s  and  completed  at  the  turn  of  the  century. 

General  and  zone  catalogues  were  based  on  vi¬ 
sual  observations  with  a  telescope.  The  evolution 
of  photography  made  this  kind  of  work  unneces¬ 
sary  at  the  end  of  the  19th  century.  Photographic 
plates  could  be  stored  for  future  purposes,  and 
measuring  the  positions  of  stars  became  easier 
and  faster,  making  it  possible  to  measure  many 
more  stars. 

A  great  international  program  was  started  at 
the  end  of  the  19th  century  in  order  to  photograph 
the  entire  sky.  Eighteen  observatories  participated 
in  this  Carte  du  Ciel  project,  all  using  similar  in¬ 
struments  and  plates.  The  positions  of  stars  were 
first  measured  with  respect  to  a  rectangular  grid 
exposed  on  each  plate  (Fig.  2.25a).  These  coor¬ 
dinates  could  then  be  converted  into  declination 
and  right  ascension. 


The  old  plates  have  been  useful  also  much 
later.  By  observing  the  same  areas  again  the 
proper  motions  of  many  stars  can  be  measured. 

Positions  of  stars  in  catalogues  are  measured 
with  respect  to  certain  comparison  stars,  the  co¬ 
ordinates  of  which  are  known  with  high  accuracy. 
The  coordinates  of  these  reference  stars  are  pub¬ 
lished  in  fundamental  catalogues.  The  first  such 
catalogue  was  needed  for  the  AGK  catalogue;  it 
was  published  in  Germany  in  1879.  This  Funda¬ 
mental  Katalog  (FK  1)  gives  the  positions  of  over 
500  stars. 

A  widely  used  catalogue  is  the  SAO  cata¬ 
logue,  published  by  the  Smithsonian  Astrophysi- 
cal  Observatory  in  the  1960’s.  It  contains  the  ex¬ 
act  positions,  magnitudes,  proper  motions,  spec¬ 
tral  classifications,  etc.  of  258,997  stars  brighter 
than  magnitude  9.  The  catalogue  was  accompa¬ 
nied  by  a  star  map  containing  all  the  stars  in  the 
catalogue. 

In  the  1990’s  a  large  astrometric  catalogue, 
PPM  (Positions  and  Proper  Motions),  was  pub¬ 
lished  to  replace  the  AGK  and  SAO  catalogues.  It 
contained  all  stars  brighter  than  7.5  magnitudes, 
and  was  almost  complete  to  magnitude  8.5.  Al¬ 
together,  the  four  volumes  of  the  catalogue  con¬ 
tained  information  on  378,910  stars. 

The  PPM  was  effectively  replaced  by  the  Ty¬ 
cho  catalogue  from  Hipparcos  satellite.  Hippar- 
cos  was  the  first  astrometric  satellite,  and  was 
launched  by  the  European  Space  Agency  (ESA) 
in  1989.  Although  Hipparcos  didn’t  reach  the 
planned  geosynchronous  orbit,  it  gave  exact  posi¬ 
tions  of  over  a  hundred  thousand  stars.  The  Hip¬ 
parcos  catalogue ,  based  on  the  measurements  of 
the  satellite,  contains  astrometric  and  photomet¬ 
ric  data  of  1 18,000  stars.  The  coordinates  are  pre¬ 
cise  to  a  couple  of  milliarcseconds.  The  less  pre¬ 
cise  Tycho  catalogue  contains  the  data  of  about 
one  million  stars. 

In  1999  and  2000,  the  sixth  version  of  the 
Fundamental  Katalog,  the  FK6,  was  published. 
It  combined  the  Hipparcos  data  and  FK5  for 
4150  fundamental  stars.  The  typical  mean  error  in 
proper  motion  was  0.35  milliarcseconds  per  year 
for  the  basic  stars.  With  the  advance  of  the  Inter¬ 
net,  the  printed  versions  of  star  catalogues  were 
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Fig.  2.30  The  representations  in  four  atlases  of  the 
Hyades  cluster  in  the  constellation  Taurus,  (a)  Heis: 
Atlas  Coelestis,  published  in  1872.  (b)  Bonner  Durch- 
musterung.  (c)  SAO,  (d)  Palomar  Sky  Atlas,  red  plate.  The 
big  blob  is  the  brightest  star  of  Taurus,  or  a  Tauri  alias 

discontinued  in  the  first  years  of  the  new  millen¬ 
nium,  and  the  catalogues  were  moved  to  the  In¬ 
ternet. 


Aldebaran.  (e)  All  the  stars  in  the  Tycho  Catalog,  num¬ 
bering  over  one  million,  are  marked  on  an  all-sky  chart. 
The  bright  lane  is  the  Milky  Way.  (Picture  David  Seal, 
NASA/JPL/Caltech) 


In  the  current  ICRS  system  (International  Co¬ 
ordinate  Reference  System)  the  celestial  coordi¬ 
nates  are  fixed  to  distant  quasars,  the  positions 
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of  which  remain  pretty  constant  with  time.  The 
coordinates  of  the  epoch  J2000.0  correspond  to 
the  coordinates  of  this  fundamental  system  quite 
closely. 

With  the  new  media,  the  size  of  the  star  cat¬ 
alogues  exploded.  The  first  Hubble  Guide  Star 
Catalog  from  the  early  1990’s  contained  18  mil¬ 
lion  stars  and  the  second  Guide  Star  Catalog 
from  the  year  2001,  nearly  500  million  stars.  It 
was  surpassed  by  the  U.S.  Naval  Observatory 
USNO-Bl.O  Catalog,  which  contains  entries  for 
1,024,618,261  stars  and  galaxies  from  digitised 
images  of  several  photographic  sky  surveys.  The 
catalogue  presents  right  ascension  and  declina¬ 
tion,  proper  motion  and  magnitude  estimates. 

The  next  step  in  the  accuracy  of  astrometry 
will  be  achieved  in  the  2010’s  with  a  new  Eu¬ 
ropean  astrometric  satellite.  The  Gaia  satellite 
launched  in  2013  has  improved  the  accuracy  to 
about  1(T5  seconds  of  arc. 

The  importance  of  astrometry  is  not  limited 
to  star  maps.  Astrometric  observations  give  ba¬ 
sic  information  with  deeper  physical  content  also. 
When  the  distance  of  a  star  is  known,  its  true  radi¬ 
ation  power  can  be  calculated  from  the  observed 
brightness  (Chap.  4),  giving  information  about 
the  structure  and  evolution  of  the  star  (Chaps.  11 
and  12).  Motions  of  stars  are  related  e.g.  to  the 
mass  distribution  of  the  Milky  Way  (Chap.  18). 

Star  maps  have  been  published  since  ancient 
times,  but  the  earliest  maps  were  globes  show¬ 
ing  the  celestial  sphere  as  seen  from  the  outside. 
At  the  beginning  of  the  17th  century,  a  German, 
Johannes  Bayer,  published  the  first  map  show¬ 
ing  the  stars  as  seen  from  inside  the  celestial 
sphere,  as  we  see  them  in  the  sky.  Constellations 
were  usually  decorated  with  drawings  of  mytho¬ 
logical  figures.  The  Uranometria  Nova  (1843)  by 
Argelander  represents  a  transition  towards  mod¬ 
em  maps:  mythological  figures  are  beginning  to 
fade  away.  The  map  accompanying  the  Bonner 
Durchmusterung  carried  this  evolution  to  its  ex¬ 
treme.  The  sheets  contain  nothing  but  stars  and 
coordinate  lines. 

Most  maps  are  based  on  star  catalogues.  Pho¬ 
tography  made  it  possible  to  produce  star  maps 
without  the  cataloguing  stage.  The  most  impor¬ 
tant  of  such  maps  is  a  photographic  atlas  the 


full  name  of  which  is  The  National  Geographic 
Society — Palomar  Observatory  Sky  Atlas.  The 
plates  for  this  atlas  were  taken  with  the  1.2  m 
Schmidt  camera  on  Mount  Palomar.  The  Palomar 
Sky  Atlas  was  completed  in  the  1950’s.  It  con¬ 
sists  of  935  pairs  of  photographs:  each  region  has 
been  photographed  in  red  and  blue  light.  The  size 
of  each  plate  is  about  35  cm  x  35  cm,  covering 
an  area  of  6.6°  x  6.6°.  The  prints  are  negatives 
(black  stars  on  a  light  background),  because  in 
this  way,  fainter  objects  are  visible.  The  limiting 
magnitude  is  about  19  in  blue  and  20  in  red. 

The  Palomar  atlas  covers  the  sky  down  to 
—30°.  Work  to  map  the  rest  of  the  sky  was  car¬ 
ried  out  later  at  two  observatories  in  the  south¬ 
ern  hemisphere,  at  Siding  Spring  Observatory  in 
Australia,  and  at  the  European  Southern  Obser¬ 
vatory  (ESO)  in  Chile.  The  instruments  and  the 
scale  on  the  plates  are  similar  to  those  used  earlier 
for  the  Palomar  plates,  but  the  atlas  is  distributed 
on  film  transparencies  instead  of  paper  prints. 

For  amateurs  there  are  several  star  maps  of 
various  kinds.  Some  of  them  are  mentioned  in  the 
references. 

Several  star  catalogues  can  nowadays  copied 
freely  from  the  Internet.  Thus  anyone  with  suit¬ 
able  programming  skills  can  make  star  maps  ap¬ 
propriate  for  her  own  purposes. 


2.1 3  Sidereal  and  Solar  Time 

Time  measurements  can  be  based  on  the  rotation 
of  the  Earth,  orbital  motion  around  the  Sun,  or 
on  atomic  clocks.  The  last-mentioned  will  be  dis¬ 
cussed  in  the  next  section.  Here  we  consider  the 
sidereal  and  solar  times  related  to  the  rotation  of 
the  Earth. 

We  defined  the  sidereal  time  as  the  hour  an¬ 
gle  of  the  vernal  equinox.  A  good  basic  unit  is  a 
sidereal  day ,  which  is  the  time  between  two  suc¬ 
cessive  upper  culminations  of  the  vernal  equinox. 
After  one  sidereal  day  the  celestial  sphere  with 
all  its  stars  has  returned  to  its  original  position 
with  respect  to  the  observer.  The  flow  of  sidereal 
time  is  as  constant  as  the  rotation  of  the  Earth. 
The  rotation  rate  is  slowly  decreasing,  and  thus 
the  length  of  the  sidereal  day  is  increasing.  In  ad¬ 
dition  to  the  smooth  slowing  down  irregular  vari- 
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Fig.  2.31  One  sidereal  day  is  the  time  between  two 
successive  transits  or  upper  culminations  of  the  vernal 
equinox.  By  the  time  the  Earth  has  moved  from  A  to  B, 
one  sidereal  day  has  elapsed.  The  angle  A  is  greatly  exag¬ 
gerated;  in  reality,  it  is  slightly  less  than  one  degree 

ations  of  the  order  of  one  millisecond  have  been 
observed. 

Unfortunately,  also  the  sidereal  time  comes  in 
two  varieties,  apparent  and  mean.  The  apparent 
sidereal  time  is  determined  by  the  true  vernal 
equinox,  and  so  it  is  obtained  directly  from  ob¬ 
servations. 

Because  of  the  precession  the  ecliptic  longi¬ 
tude  of  the  vernal  equinox  increases  by  about 
50"  a  year.  This  motion  is  very  smooth.  Nutation 
causes  more  complicated  wobbling.  The  mean 
equinox  is  the  point  where  the  vernal  equinox 
would  be  if  there  were  no  nutation.  The  mean 
sidereal  time  is  the  hour  angle  of  this  mean 
equinox. 

The  difference  of  the  apparent  and  mean  side¬ 
real  time  is  called  the  equation  of  equinoxes: 

(9a  —  @M  =  At/f  cos  £,  (2.42) 

where  s  is  the  obliquity  of  the  ecliptic  at  the  in¬ 
stant  of  the  observation,  and  the  nutation  in 
longitude.  This  value  is  tabulated  for  each  day 
e.g.  in  the  Astronomical  Almanac.  It  can  also  be 
computed  from  the  formulae  given  in  Box  2.1.  It 
is  at  most  about  one  second,  so  it  has  to  be  taken 
into  account  only  in  the  most  precise  calculations. 

Figure  2.31  shows  the  Sun  and  the  Earth  at 
vernal  equinox.  When  the  Earth  is  at  the  point  A, 
the  Sun  culminates  and,  at  the  same  time,  a  new 
sidereal  day  begins  in  the  city  with  the  huge  black 
arrow  standing  in  its  central  square.  After  one 


sidereal  day,  the  Earth  has  moved  along  its  orbit 
almost  one  degree  of  arc  to  the  point  B.  There¬ 
fore  the  Earth  has  to  turn  almost  a  degree  further 
before  the  Sun  will  culminate.  The  solar  or  syn¬ 
odic  day  is  therefore  3  min  56.56  s  (sidereal  time) 
longer  than  the  sidereal  day.  This  means  that  the 
beginning  of  the  sidereal  day  will  move  around 
the  clock  during  the  course  of  one  year.  After 
one  year,  sidereal  and  solar  time  will  again  be  in 
phase.  The  number  of  sidereal  days  in  one  year  is 
one  higher  than  the  number  of  solar  days. 

When  we  talk  about  rotation  periods  of  plan¬ 
ets,  we  usually  mean  sidereal  periods.  The  length 
of  day,  on  the  other  hand,  means  the  rotation  pe¬ 
riod  with  respect  to  the  Sun.  If  the  orbital  period 
around  the  Sun  is  P ,  sidereal  rotation  period  r* 
and  synodic  day  r,  we  now  know  that  the  num¬ 
ber  of  sidereal  days  in  time  P ,  P/r* ,  is  one  higher 
than  the  number  of  synodic  days,  P/r: 

P  P  _  j 


-  =  (2.43) 

r  r*  P 

This  holds  for  a  planet  rotating  in  the  direction  of 
its  orbital  motion  (counterclockwise).  If  the  sense 
of  rotation  is  opposite,  or  retrograde ,  the  number 
of  sidereal  days  in  one  orbital  period  is  one  less 
than  the  number  of  synodic  days,  and  the  equa¬ 
tion  becomes 

1  1  1 

-  =  -  +  (2.44) 

r  r*  P 

For  the  Earth,  we  have  P  =  365.2564  d,  and 
r  =  1  d,  whence  (2.43)  gives  r*  =  0.99727  d  = 
23  h  56  min  4  s,  solar  time. 

Since  our  everyday  life  follows  the  alternation 
of  day  and  night,  it  is  more  convenient  to  base  our 
timekeeping  on  the  apparent  motion  of  the  Sun 
rather  than  that  of  the  stars.  Unfortunately,  the  so¬ 
lar  time  does  not  flow  at  a  constant  rate.  There  are 
two  reasons  for  this.  First,  the  orbit  of  the  Earth  is 
not  exactly  circular,  but  an  ellipse,  which  means 
that  the  velocity  of  the  Earth  along  its  orbit  is 
not  constant.  Second,  the  Sun  moves  along  the 
ecliptic,  not  the  equator.  Thus  its  right  ascension 
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does  not  increase  at  a  constant  rate.  The  change 
is  fastest  at  the  end  of  December  (4  min  27  s  per 
day)  and  slowest  in  mid-September  (3  min  35  s 
per  day).  As  a  consequence,  the  hour  angle  of  the 
Sun  (which  determines  the  solar  time)  also  grows 
at  an  uneven  rate. 

To  find  a  solar  time  flowing  at  a  constant  rate, 
we  define  a  fictitious  mean  sun ,  which  moves 
along  the  celestial  equator  with  constant  angu¬ 
lar  velocity,  making  a  complete  revolution  in  one 
year.  By  year  we  mean  here  the  tropical  year , 
which  is  the  time  it  takes  for  the  Sun  to  move 
from  one  vernal  equinox  to  the  next.  In  one  trop¬ 
ical  year,  the  right  ascension  of  the  Sun  increases 
exactly  24  hours.  The  length  of  the  tropical  year 
is  365  d  5  h  48  min  46  s  =  365.2422  d.  Since  the 
direction  of  the  vernal  equinox  moves  due  to  pre¬ 
cession,  the  tropical  year  differs  from  the  sidereal 
year,  during  which  the  Sun  makes  one  revolution 
with  respect  to  the  background  stars.  One  sidereal 
year  is  365.2564  d. 

Using  our  artificial  mean  sun,  we  now  define 
an  evenly  flowing  solar  time,  the  mean  solar  time 
(or  simply  mean  time)  7m,  which  is  equal  to  the 
hour  angle  Hm  of  the  centre  of  the  mean  sun  plus 
12  hours  (so  that  the  date  will  change  at  midnight, 
to  annoy  astronomers): 

7m  =  hyi  +  12  h.  (2.45) 

The  difference  between  the  true  solar  time  T 
and  the  mean  time  7m  is  called  the  equation  of 
time : 

E.T.  =  T  -  7m.  (2.46) 

(In  spite  of  the  identical  abbreviation,  this  has 
nothing  to  do  with  a  certain  species  of  little  green 
men.)  The  greatest  positive  value  of  E.T.  is  about 
16  minutes  and  the  greatest  negative  value  about 
—  14  minutes  (see  Fig.  2.32).  This  is  also  the  dif¬ 
ference  between  the  true  noon  (the  meridian  tran¬ 
sit  of  the  Sun)  and  the  mean  noon. 

Both  the  true  solar  time  and  mean  time  are 
local  times ,  depending  on  the  hour  angle  of  the 
Sun,  real  or  artificial.  If  one  observes  the  true  so¬ 
lar  time  by  direct  measurement  and  computes  the 
mean  time  from  (2.46),  a  digital  watch  will  prob¬ 
ably  be  found  to  disagree  with  both  of  them.  The 
reason  for  this  is  that  we  do  not  use  local  time  in 


Fig.  2  .32  Equation  of  time.  A  sundial  always  shows  (if 
correctly  installed)  true  local  solar  time.  To  find  the  local 
mean  time  the  equation  of  time  must  be  subtracted  from 
the  local  solar  time 

our  everyday  life;  instead  we  use  the  zonal  time 
of  the  nearest  time  zone . 

In  the  past,  each  city  had  its  own  local  time. 
When  travelling  became  faster  and  more  popu¬ 
lar,  the  great  variety  of  local  times  became  an  in¬ 
convenience.  At  the  end  of  the  19th  century,  the 
Earth  was  divided  into  24  zones,  the  time  of  each 
zone  differing  from  the  neighbouring  ones  by  one 
hour.  On  the  surface  of  the  Earth,  one  hour  in  time 
corresponds  to  15°  in  longitude;  the  time  of  each 
zone  is  determined  by  the  local  mean  time  at  one 
of  the  longitudes  0°,  15°, ... ,  345°. 

The  time  of  the  zero  meridian  going  through 
Greenwich  is  used  as  an  international  reference, 
Universal  Time.  In  most  European  countries,  time 
is  one  hour  ahead  of  this  (Fig.  2.33). 

In  summer,  many  countries  switch  to  daylight 
saving  time ,  during  which  time  is  one  hour  ahead 
of  the  ordinary  time.  The  purpose  of  this  is  to 
make  the  time  when  people  are  awake  coincide 
with  daytime  in  order  to  save  electricity,  particu¬ 
larly  in  the  evening,  when  people  go  to  bed  one 
hour  earlier.  During  daylight  saving  time,  the  dif¬ 
ference  between  the  true  solar  time  and  the  offi¬ 
cial  time  can  grow  even  larger.  It  has  also  been 
criticised  that  the  disadvantages  of  the  daylight 
saving  time  are  much  more  severe  than  its  profits 
in  energy  saving.  Many  people  have  serious  diffi¬ 
culties  to  adapt  to  a  different  daily  rhythm  twice 
every  year.  Also,  confusingly,  different  countries 
use  different  rules  for  the  daylight  saving  time. 

In  the  EU  countries  the  daylight  saving  time 
begins  on  the  last  Sunday  of  March,  at  1  o’clock 
UTC  in  the  morning,  when  the  clocks  are  moved 
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WORLD  MAP  OF  TIME  ZONES 
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forward  by  one  hour,  and  ends  on  the  last  Sunday 
of  October  at  1  o’clock  UTC. 


2.1 4  Astronomical  Time  Systems 


Time  can  be  defined  using  several  different  phe¬ 
nomena: 

1 .  The  solar  and  sidereal  times  are  based  on  the 
rotation  of  the  Earth. 

2.  The  standard  unit  of  time  in  the  current  SI  sys¬ 
tem,  the  second,  is  based  on  quantum  mechan¬ 
ical  atomary  phenomena. 

3.  Equations  of  physics  like  the  ones  describing 
the  motions  of  celestial  bodies  involve  a  time 
variable  corresponding  to  an  ideal  time  run¬ 
ning  at  a  constant  pace.  The  ephemeris  time 
and  dynamical  time  discussed  a  little  later  are 
such  times. 

Observations  give  directly  the  apparent  side¬ 
real  time  as  the  hour  angle  of  the  true  vernal 
equinox.  From  the  apparent  sidereal  time  the 
mean  sidereal  time  can  be  calculated. 

The  universal  time  UT  is  defined  by  the  equa¬ 
tion 


GMST(0  UT)  =  24, 1 10.54841  s 

+  T  x  8,640,184.812866  s 
+  T2  x  0.093104  s 
-  T3  x  0.0000062  s, 

(2.47) 

where  GMST  is  the  Greenwich  mean  sidereal 
time  and  T  the  Julian  century.  The  latter  is  ob¬ 
tained  from  the  Julian  date  /,  which  is  a  running 
number  of  the  day  (Sects.  2.15  and  Box  2.2): 


J  —  2,451,545.0 
““  36,525 


(2.48) 


This  gives  the  time  elapsed  since  January  1,  2000, 
in  Julian  centuries. 

Sidereal  time  and  hence  also  UT  are  related 
to  the  rotation  of  the  Earth,  and  thus  contain  per¬ 
turbations  due  to  the  irregular  variations,  mainly 
slowing  down,  of  the  rotation. 

In  (2.47)  the  constant  8,640,184.812866  s 
tells  how  much  sidereal  time  runs  fast  compared 


to  the  UT  in  a  Julian  century.  As  the  rotation  of 
the  Earth  is  slowing  down  the  solar  day  becomes 
longer.  Since  the  Julian  century  T  contains  a  fixed 
number  of  days,  it  will  also  become  longer.  This 
gives  rise  to  the  small  correction  terms  in  (2.47). 

Strictly  speaking  this  universal  time  is  the  time 
denoted  by  UT1.  Observations  give  UT0,  which 
contains  a  small  perturbation  due  to  the  wander¬ 
ing  of  the  geographical  pole,  or  polar  variation. 
The  direction  of  the  axis  with  respect  to  the  solid 
surface  varies  by  about  0.1"  (a  few  metres  on  the 
surface)  with  a  period  of  about  430  days  ( Chan¬ 
dler  period).  In  addition  to  this,  the  polar  motion 
contains  a  slow  nonperiodic  part. 

The  z  axis  of  the  astronomical  coordinates  is 
aligned  with  the  angular  momentum  vector  of  the 
Earth,  but  the  terrestrial  coordinates  refer  to  the 
axis  at  the  epoch  1903.5.  In  the  most  accurate  cal¬ 
culations  this  has  to  be  taken  into  account. 

Due  to  the  polar  variation  the  UT0  does  not 
grow  at  a  constant  rate,  since  also  the  direction 
of  the  origin,  the  Greenwich  meridian,  is  vary¬ 
ing.  There  are  also  additional  irregular  variations 
of  the  order  of  a  millisecond,  and  the  secular 
slowing-down  caused  by  tidal  forces. 

Nowadays  the  SI  unit  of  time,  the  second,  is 
defined  in  a  way  that  has  nothing  to  do  with  celes¬ 
tial  phenomena.  Periods  of  quantum  mechanical 
phenomena  remain  more  stable  than  the  motions 
of  celestial  bodies  involving  complicated  pertur¬ 
bations. 

In  1967,  one  second  was  defined  as 
9,192,631,770  times  the  period  of  the  light  emit¬ 
ted  by  cesium  133  isotope  in  its  ground  state, 
transiting  from  hyperfine  level  F  =  4  to  F  =  3. 
Later,  this  definition  was  revised  to  include 
small  relativistic  effects  produced  by  gravita¬ 
tional  fields.  The  relative  accuracy  of  this  atomic 
time  is  about  10-12.  The  international  atomic 
time ,  TAI,  was  adopted  as  the  basis  of  time  sig¬ 
nals  in  1972.  The  time  is  maintained  by  the  Bu¬ 
reau  International  des  Poids  et  Mesures  in  Paris, 
and  it  is  the  average  of  several  accurate  atomic 
clocks. 

Even  before  atomic  clocks  there  was  a  need 
for  an  ideal  time  proceeding  at  a  perfectly  con¬ 
stant  rate,  corresponding  to  the  time  variable 
in  the  equations  of  Newtonian  mechanics.  The 
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ephemeris  time  was  such  a  time.  It  was  used 
e.g.  for  tabulating  ephemerides.  The  unit  of 
ephemeris  time  was  the  ephemeris  second ,  which 
is  the  length  of  the  tropical  year  1900  divided  by 
31,556,925.9747.  Ephemeris  time  was  not  known 
in  advance.  Only  afterwards  was  it  possible  to  de¬ 
termine  the  difference  of  ET  and  UT  from  obser¬ 
vational  data. 

In  1984  ephemeris  time  was  replaced  by  dy¬ 
namical  time.  It  comes  in  two  varieties. 

The  terrestrial  dynamical  time  (TDT)  corre¬ 
sponds  to  the  proper  time  of  an  observer  moving 
with  the  Earth.  The  time  scale  is  affected  by  the 
relativistic  time  dilation  due  to  the  orbital  speed 
of  the  Earth.  The  rotation  velocity  depends  on 
the  latitude,  and  thus  in  TDT  it  is  assumed  that 
the  observer  is  not  rotating  with  the  Earth.  The 
zero  point  of  TDT  was  chosen  so  that  the  old  ET 
changed  without  a  jump  to  TDT. 

In  1991  a  new  standard  time,  the  terrestrial 
time  (TT),  was  adopted.  Practically  it  is  equiva¬ 
lent  to  TDT. 

TT  (or  TDT)  is  the  time  currently  used  for  tab¬ 
ulating  ephemerides  of  planets  and  other  celestial 
bodies.  For  example,  the  Astronomical  Almanac 
gives  the  coordinates  of  the  planets  for  each  day 
at  0  TT. 

The  Astronomical  Almanac  also  gives  the  dif¬ 
ference 

AT  =  TDT  -  UT  (2.49) 

for  earlier  years.  For  the  present  year  and  some 
future  years  a  prediction  extrapolated  from  the 
earlier  years  is  given.  Its  accuracy  is  about  0.1  s. 
At  the  beginning  of  1990  the  difference  was 
56.7  s;  it  increases  every  year  by  an  amount  that 
is  usually  a  little  less  than  one  second. 

The  terrestrial  time  differs  from  the  atomic 
time  by  a  constant  offset 

TT  =  TAI  +  32.184  s.  (2.50) 

TT  is  well  suited  for  ephemerides  of  phenom¬ 
ena  as  seen  from  the  Earth.  The  equations  of  mo¬ 
tion  of  the  solar  system,  however,  are  solved  in 
a  frame  the  origin  of  which  is  the  centre  of  mass 
or  barycentre  of  the  solar  system.  The  coordinate 
time  of  this  frame  is  called  the  barycentric  dy¬ 
namical  time ,  TDB.  The  unit  of  TDB  is  defined 


so  that,  on  the  average,  it  runs  at  the  same  rate  as 
TT,  the  difference  containing  only  periodic  terms 
depending  on  the  orbital  motion  of  the  Earth.  The 
difference  can  usually  be  neglected,  since  it  is  at 
most  about  0.002  seconds. 

Which  of  these  many  times  should  we  use 
in  our  alarm-clocks?  None  of  them.  Yet  another 
time  is  needed  for  that  purpose.  This  official  wall- 
clock  time  is  called  the  coordinated  universal 
time ,  UTC.  The  zonal  time  follows  UTC  but  dif¬ 
fers  from  it  usually  by  an  integral  number  of 
hours. 

UTC  is  defined  so  that  it  proceeds  at  the  same 
rate  as  TAI,  but  differs  from  it  by  an  integral  num¬ 
ber  of  seconds.  These  leap  seconds  are  used  to  ad¬ 
just  UTC  so  that  the  difference  from  UT1  never 
exceeds  0.9  seconds  (Fig.  2.34).  A  leap  second 
is  added  either  at  the  beginning  of  a  year  or  the 
night  between  June  and  July. 

The  difference 

AAT  =  TAI  —  UTC  (2.51) 

is  also  tabulated  in  the  Astronomical  Almanac. 
According  to  the  definition  of  UTC  the  difference 
in  seconds  is  always  an  integer.  The  difference 
cannot  be  predicted  very  far  to  the  future. 

From  (2.50)  and  (2.51)  we  get 

TT  =  UTC  +  32.184  s  +  AAT,  (2.52) 

which  gives  the  terrestrial  time  TT  corresponding 
to  a  given  UTC.  Table  2.2  gives  this  correction. 
The  table  is  easy  to  extend  to  the  future.  When 
it  is  told  in  the  news  that  a  leap  second  will  be 
added  the  difference  will  increase  by  one  second. 
In  case  the  number  of  leap  seconds  is  not  known, 
it  can  be  approximated  that  a  leap  second  will  be 
added  every  1.25  years. 

The  units  of  the  coordinated  universal  time 
UTC,  atomic  time  TAI  and  terrestrial  time  TT  are 
the  same  second  of  the  SI  system.  Hence  all  these 
times  proceed  at  the  same  rate,  the  only  differ¬ 
ence  being  in  their  zero  points.  The  difference  of 
the  TAI  and  TT  is  always  the  same,  but  due  to  the 
leap  seconds  the  UTC  will  fall  behind  in  a  slightly 
irregular  way. 

Culminations  and  rising  and  setting  times  of 
celestial  bodies  are  related  to  the  rotation  of  the 
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Fig.  2.34  The  difference  between  the  universal  time 
UT1,  based  on  the  rotation  of  the  Earth,  and  the  coordi¬ 
nated  universal  time  UTC  during  1972-2002.  Because  the 
rotation  of  the  Earth  is  slowing  down,  the  UT1  will  run 


Table  2.2  Differences  of  the  atomic  time  and  UTC 
(AAT)  and  the  terrestrial  time  TT  and  UTC.  The  terres¬ 
trial  time  TT  used  in  ephemerides  is  obtained  by  adding 
AAT  +  32.184  s  to  the  ordinary  time  UTC 


AAT 

TT  -  UTC 

1.1.1972-  30.6.1972 

10  s 

42.184  s 

1.7.1972-31.12.1972 

11  s 

43.184  s 

1.1.1973-31.12.1973 

12  s 

44.184  s 

1.1.1974-31.12.1974 

13  s 

45.184  s 

1.1.1975-31.12.1975 

14  s 

46.184  s 

1.1.1976-31.12.1976 

15  s 

47.184  s 

1.1.1977-31.12.1977 

16  s 

48.184  s 

1.1.1978-31.12.1978 

17  s 

49.184  s 

1.1.1979-31.12.1979 

18  s 

50.184  s 

1.1.1980-  30.6.1981 

19  s 

51.184  s 

1.7.1981-  30.6.1982 

20  s 

52.184  s 

1.7.1982-  30.6.1983 

21  s 

53.184  s 

1.7.1983-  30.6.1985 

22  s 

54.184  s 

1.7.1985-31.12.1987 

23  s 

55.184  s 

1.1.1988-31.12.1989 

24  s 

56.184  s 

1.1.1990-31.12.1990 

25  s 

57.184  s 

1.1.1991-  30.6.1992 

26  s 

58.184  s 

1.7.1992-  30.6.1993 

27  s 

59.184  s 

1.7.1993-  30.6.1994 

28  s 

60.184  s 

1.7.1994-31.12.1995 

29  s 

61.184  s 

1.1.1996-31.6.1997 

30  s 

62.184  s 

1.7.1997-31.12.1998 

31  s 

63.184  s 

1.1.1999-31.12.2005 

32  s 

64.184  s 

1.1.2006- 

33  s 

65.184  s 

slow  of  the  UTC  by  about  0.8  seconds  a  year.  Leap  sec¬ 
onds  are  added  to  the  UTC  when  necessary  to  keep  the 
times  approximately  equal.  In  the  graph  these  leap  sec¬ 
onds  are  seen  as  one  second  jumps  upward 

Earth.  Thus  the  sidereal  time  and  hence  the  UT 
of  such  an  event  can  be  calculated  precisely.  The 
corresponding  UTC  cannot  differ  from  the  UT  by 
more  than  0.9  seconds,  but  the  exact  value  is  not 
known  in  advance.  The  future  coordinates  of  the 
Sun,  Moon  and  planets  can  be  calculated  as  func¬ 
tions  of  the  TT,  but  the  corresponding  UTC  can 
only  be  estimated. 


2.15  Calendars 

Our  calendar  is  a  result  of  long  evolution.  The 
main  problem  it  must  contend  with  is  the  incom¬ 
mensurability  of  the  basic  units,  day,  month  and 
year:  the  numbers  of  days  and  months  in  a  year 
are  not  integers.  This  makes  it  rather  complicated 
to  develop  a  calendar  that  takes  correctly  into  ac¬ 
count  the  alternation  of  seasons,  day  and  night, 
and  perhaps  also  the  lunar  phases. 

Our  calendar  has  its  origin  in  the  Roman  cal¬ 
endar,  which,  in  its  earliest  form,  was  based  on 
the  phases  of  the  Moon.  From  around  700  B.C. 
on,  the  length  of  the  year  has  followed  the  appar¬ 
ent  motion  of  the  Sun;  thus  originated  the  division 
of  the  year  into  twelve  months.  One  month,  how¬ 
ever,  still  had  a  length  roughly  equal  to  the  lunar 
cycle.  Hence  one  year  was  only  354  days  long. 
To  keep  the  year  synchronised  with  the  seasons, 
a  leap  month  had  to  be  added  to  every  other  year. 

Eventually  the  Roman  calendar  got  mixed  up. 
The  mess  was  cleared  by  Julius  Caesar  in  about 
46  B.C. ,  when  the  Julian  calendar  was  developed 
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upon  his  orders  by  the  Alexandrian  astronomer 
Sosigenes.  The  year  had  365  days  and  a  leap  day 
was  added  to  every  fourth  year. 

In  the  Julian  calendar,  the  average  length  of 
one  year  is  365  d  6  h,  but  the  tropical  year  is 
11  min  14  s  shorter.  After  128  years,  the  Julian 
year  begins  almost  one  day  too  late.  The  differ¬ 
ence  was  already  10  days  in  1582,  when  a  calen¬ 
dar  reform  was  carried  out  by  Pope  Gregory  XIII. 
In  the  Gregorian  calendar ,  every  fourth  year  is 
a  leap  year,  the  years  divisible  by  100  being  ex¬ 
ceptions.  Of  these,  only  the  years  divisible  by  400 
are  leap  years.  Thus  1900  was  not  a  leap  year,  but 
2000  was.  The  Gregorian  calendar  was  adopted 
slowly,  at  different  times  in  different  countries. 
Although  the  calendar  reform  was  seen  neces¬ 
sary,  orders  given  by  the  catholic  pope  were  not 
easy  to  accept  in  e.g.  protestant  countries.  The 
transition  period  did  not  end  before  the  20th  cen¬ 
tury. 

Even  the  Gregorian  calendar  is  not  perfect. 
The  differences  from  the  tropical  year  will  accu¬ 
mulate  to  one  day  in  about  3300  years. 

Since  years  and  months  of  variable  length 
make  it  difficult  to  compute  time  differences, 
especially  astronomers  have  employed  various 
methods  to  give  each  day  a  running  number.  The 
most  widely  used  numbers  are  the  Julian  dates. 
In  spite  of  their  name,  they  are  not  related  to  the 
Julian  calendar.  The  only  connection  is  the  length 
of  a  Julian  century  of  36,525  days,  a  quantity  ap¬ 
pearing  in  many  formulas  involving  Julian  dates. 
The  Julian  day  number  0  dawned  in  4713  B.C. 
The  day  number  changes  always  at  12  :  00  UT. 
For  example,  the  Julian  day  2,451,545  began  at 
noon  in  January  1,  2000.  The  Julian  date  can  be 
computed  using  the  formulas  given  in  Box  2.2. 

Julian  dates  are  uncomfortably  big  numbers, 
and  therefore  modified  Julian  dates  are  often 
used.  The  zero  point  can  be  e.g.  January  1,  2000. 
Sometimes  0.5  is  subtracted  from  the  date  to 
make  it  to  coincide  with  the  date  corresponding  to 
the  UTC.  When  using  such  dates,  the  zero  point 
should  always  be  mentioned. 


Box  2.1  (Reduction  of  Coordinates)  Star  cat¬ 
alogues  give  coordinates  for  some  standard 


epoch.  In  the  following  we  give  the  formulas 
needed  to  reduce  the  coordinates  to  a  given  date 
and  time.  The  full  reduction  is  rather  labori¬ 
ous,  but  the  following  simplified  version  is  suf¬ 
ficient  for  most  practical  purposes. 

We  assume  that  the  coordinates  are  given  for 
the  epoch  J2000.0. 

1 .  First  correct  the  place  for  proper  motion  un¬ 
less  it  is  negligible. 

2.  Precess  the  coordinates  to  the  time  of  the 
observation.  First  we  use  the  coordinates  of 
the  standard  epoch  ( ao ,  5o)  to  find  a  unit 
vector  pointing  in  the  direction  of  the  star: 

(COS($o  cosao 

cos<$o  sinao 
sin  <$o 

Precession  changes  the  ecliptic  longitude  of 
the  object.  The  effect  on  right  ascension  and 
declination  can  be  calculated  as  three  rota¬ 
tions,  given  by  three  rotation  matrices.  By 
multiplying  these  matrices  we  get  the  com¬ 
bined  precession  matrix  that  maps  the  pre¬ 
vious  unit  vector  to  its  precessed  equiva¬ 
lent.  A  similar  matrix  can  be  derived  for  the 
nutation.  The  transformations  and  constants 
given  here  are  based  on  the  system  standard¬ 
ised  by  the  IAU  in  1976. 

The  precession  and  nutation  matrices 
contain  several  quantities  depending  on 
time.  The  time  variables  appearing  in  their 
expressions  are 

t  =  J  —  2,451,545.0, 

J  —  2,451,545.0 
““  36,525  ' 

Here  J  is  the  Julian  date  of  the  observa¬ 
tion,  t  the  number  of  days  since  the  epoch 
J2000.0  (i.e.  noon  of  January  1,  2000),  and 
T  the  same  interval  of  time  in  Julian  cen¬ 
turies. 

The  following  three  angles  are  needed 
for  the  precession  matrix 

f  =2306.218l"7’  +  0.30188"7’2 

+  0.017998"r3, 
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z  =2306.218l"r+  1.09468"r2 
+  0.018203"r3, 

e  =  2004.3 109" T  -  0.42665"7’2 
-0.041833"r3. 

The  precession  matrix  is  now 

/Pn  Pn  Ptf\ 

P  =  \p2\  P22  P23  I  • 

\^31  P32  P33/ 

The  elements  of  this  matrix  in  terms  of  the 
abovementioned  angles  are 

Pn  =  cos  z  cos  0  cos  £  —  sin z  sin  f, 

P\2  =  —  cos  z  cos  0  sinf  —  sin  z  cos  f , 

Pi 3  =  —  cos  z  sin  9, 

P21  =  sin  z  cos  0  cos  f  +  cos  z  sin  f, 

P22  =  —  sin  z  cos  0  sin  f  +  cos  z  cos  f , 

P23  =  —  sin  z  sin  0 , 

P31  =  sin  0  cos  f , 

P32  =  —  sin#  sinf , 

P33  =  cos  0 . 

The  new  coordinates  are  now  obtained  by 
multiplying  the  coordinates  of  the  standard 
epoch  by  the  precession  matrix: 

Pi  =  PPo- 

This  is  the  mean  place  at  the  given  time  and 
date. 

If  the  standard  epoch  is  not  J2000.0,  it  is 
probably  easiest  to  first  transform  the  given 
coordinates  to  the  epoch  J2000.0.  This  can 
be  done  by  computing  the  precession  matrix 
for  the  given  epoch  and  multiplying  the  co¬ 
ordinates  by  the  inverse  of  this  matrix.  In¬ 
verting  the  precession  matrix  is  easy:  we 
just  transpose  it,  i.e.  interchange  its  rows 
and  columns.  Thus  coordinates  given  for 
some  epoch  can  be  precessed  to  J2000.0  by 


multiplying  them  by 

/Pn  P21  P3A 

P-1  =  I  Pi 2  P22  P32  I  • 

\Pl3  P23  P33/ 

In  case  the  required  accuracy  is  higher  than 
about  one  minute  of  arc,  we  have  to  do  the 
following  further  corrections. 

3.  The  full  nutation  correction  is  rather  com¬ 
plicated.  The  nutation  used  in  astronomical 
almanacs  involves  series  expansions  con¬ 
taining  over  a  hundred  terms.  Very  often, 
though,  the  following  simple  form  is  suffi¬ 
cient.  We  begin  by  finding  the  mean  obliq¬ 
uity  of  the  ecliptic  at  the  observation  time: 

so  =  23°26/21.448//  -  46. 81 50" P 
—  0.00059"T2  +  0.001 8 13"r3. 

The  mean  obliquity  means  that  periodic  per¬ 
turbations  have  been  omitted.  The  formula 
is  valid  a  few  centuries  before  and  after  the 
year  2000. 

The  true  obliquity  of  the  ecliptic,  e,  is  ob¬ 
tained  by  adding  the  nutation  correction  to 
the  mean  obliquity: 

£  =  £()  +  As. 

The  effect  of  the  nutation  on  the  ecliptic 
longitude  (denoted  usually  by  Ai/s)  and  the 
obliquity  of  the  ecliptic  can  be  found  from 

Ci  =  125°  -  0.05295V, 

C2  =  200.9°  +  1.97129V, 

Axl/  =  -0.0048°  sin  Ci  -  0.0004°  sinC2, 
As  =  0.0026°  cos  Ci  +  0.0002°  cos  C2. 

Since  A\jr  and  As  are  very  small  angles, 
we  have,  for  example,  sinAi/f  ~  A^  and 
cos  At//-  ~  1,  when  the  angles  are  expressed 
in  radians.  Thus  we  get  the  nutation  matrix 

(1  —Axl/coss 
Ai/fcose  1 

Arsine  As 


—  A} Jr  sine 
—  As 

1 
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This  is  a  linearised  version  of  the  full  trans¬ 
formation.  The  angles  here  must  be  in  radi¬ 
ans.  The  place  in  the  coordinate  frame  of  the 
observing  time  is  now 

pi  =  np\- 

4.  The  annual  aberration  can  affect  the  place 
about  as  much  as  the  nutation.  Approximate 
corrections  are  obtained  from 

Aa  cos  8  =  —20.5"  sin  a  sin  A 

—  1 8.8"  cos  a  cos  A, 

A 8  =  20.5"  cos  a  sin  8  sin  A 

+  1 8. 8"  sin  a  sin  8  cos  A 

—  8.1"  cos  5  cos  A, 

where  A  is  the  ecliptic  longitude  of  the  Sun. 
Sufficiently  accurate  value  for  this  purpose 
is  given  by 

G  =  357.528°  +0.985600°;, 

A  =  280.460°  +  0.985647°t 

+  1.915°  sin  G  + 0.020°  sin2G. 

These  reductions  give  the  apparent  place  of 
the  date  with  an  accuracy  of  a  few  seconds 
of  arc.  The  effects  of  parallax  and  diurnal 
aberration  are  even  smaller. 

Example  The  coordinates  of  Regulus 
(< a  Leo)  for  the  epoch  J2000.0  are 

a  =  10  h  8  min  22.2  s  =  10.139500  h, 

<$  =  11°  58r  02"  =  11.967222°. 

Find  the  apparent  place  of  Regulus  on  March 
12,  1995. 

We  start  by  finding  the  unit  vector  corre¬ 
sponding  to  the  catalogued  place: 

/— 0.86449829\ 
p0  =  |  0.45787318  . 

V  0.20735204  ) 

The  Julian  date  is  J  =  2,449,789.0,  and 
thus  t  =  —1756  and  T  =  —0.04807666.  The 


angles  of  the  precession  matrix  are  £  = 
-0.03079849°,  z  =  -0.03079798°  and  0  = 
—0.02676709°.  The  precession  matrix  is  then 

P  = 

/  0.99999931  0.00107506  0.00046717  \ 

-0.00107506  0.99999942  -0.00000025  . 

0.00046717  -0.00000025  0.99999989  / 

The  precessed  unit  vector  is 

/— 0.86390858\ 
px  =  0.45880225  . 

\  0.20775577  / 

The  angles  needed  for  the  nutation  are 
AiA  =  0.00309516°,  As  =  -0.00186227°, 
s  =  23.43805403°,  which  give  the  nutation 
matrix 

N  = 

/  1  -0.00004956  — 0.00002149\ 

0.00004956  1  0.00003250  . 

y0. 00002149  -0.00003250  1  / 

The  place  in  the  frame  of  the  date  is 

/— 0.86393578\ 
p2=l  0.45876618  , 

V  0.20772230  ) 

whence 

a  =  10.135390  h, 

5  =  11.988906°. 

To  correct  for  the  aberration  we  first  find  the 
longitude  of  the  Sun:  G  =  —1373.2°  =  66.8°, 
A  =  —8.6°.  The  correction  terms  are  then 

Aa  =  18.25"  =  0.0050°, 

AS  =  -5.46"  =  -0.0015°. 

Adding  these  to  the  previously  obtained  coor¬ 
dinates  we  get  the  apparent  place  of  Regulus 
on  March  12,  1995: 

a  =  10.1357  h  =  10  h  8  min  8.5  s, 

5  =  11.9874°  =  11°  59r  15". 


44 


2  Spherical  Astronomy 


Comparison  with  the  places  given  in  the  cat¬ 
alogue  Apparent  Places  of  Fundamental  Stars 
shows  that  we  are  within  about  3"  of  the  correct 
place,  which  is  a  satisfactory  result. 

Box  2.2  (Julian  Date)  There  are  several  meth¬ 
ods  for  finding  the  Julian  date.  The  following 
one,  developed  by  Fliegel  and  Van  Flandern  in 
1968,  is  well  adapted  for  computer  programs. 
Let  y  be  the  year  (with  all  four  digits),  m  the 
month  and  d  the  day.  The  Julian  date  J  at  noon 
is  then 

J  =  367y  —  {7[y  +  (m  +  9)/12]}/4 
-(3{[y  +  (m-9)/7]/100+l})/4 
+  275m/9  +  d  +  1,721,029. 

The  division  here  means  an  integer  division,  the 
decimal  part  being  truncated:  e.g.  7/3  =  2  and 
-7/3  = -2. 

Example  Find  the  Julian  date  on  January  1, 
1990. 

Now  y  =  1990,  in  =  1  and  d  =  1. 

J  =  367  x  1990  -  7  x  [1990  +  (1  +  9)/12]/4 
-3  x  {[1990  + (1  -9)/7]/100  +  l}/4 
+  275  x  1/9  +  1  +  1,721,029 
=  730,330  -  3482  -  15  +  30+1 
+  1,721,029 
=  2,447,893. 

Astronomical  tables  usually  give  the  Julian 
date  at  0  UT.  In  this  case  that  would  be 
2,447,892.5. 

The  inverse  procedure  is  a  little  more  com¬ 
plicated.  In  the  following  J  is  the  Julian  date  at 
noon  (so  that  it  will  be  an  integer): 

a  =  J  +  68,569, 
b  =  (4$)/ 146, 097, 
c  =  a-  (146,097/?  +  3)/4, 
d  =  [4000(c+  1)]/ 1,461, 001, 


e  =  c  —  (1461d)/4  +  31, 

/  = (80e)/2447, 
day  =  e  -  (2447/)/80, 

8  =  ft  11. 

month  =  /  +  2  —  12  g, 
year  =  100(/?  —  49)  +  d  +  g. 

Example  In  the  previous  example  we  got 
J  =  2,447,893.  Let’s  check  this  by  calculating 
the  corresponding  calendar  date: 

a  =  2,447,893  +  68,569  =  2,516,462, 
b  =  (4  x  2,5 16,462)/ 146,097  =  68, 
c  =  2,516,462  -  (146,097  x  68  +  3)/4 
=  32,813, 

d  =  [4000(32,813  +  1)]  / 1 ,461 ,001  =  89, 
e  =  32,813  -  (1461  x  89)/4  +  31  =  337, 

/  =  (80  x  337)/2447  =  11, 
day  =  337  -  (2447  x  ll)/80  =  1, 

*  =  11/11  =  1, 

month  =11+2  —  12  x  1  =  1, 

year  =  100(68  -  49)  +  89  +  1  =  1990. 

Thus  we  arrived  back  to  the  original  date. 

Since  the  days  of  the  week  repeat  in  seven 
day  cycles,  the  remainder  of  the  division 
7/7  unambiguously  determines  the  day  of  the 
week.  If  J  is  the  Julian  date  at  noon,  the  re¬ 
mainder  of  J /I  tells  the  day  of  the  week  in  the 
following  way: 

0  =  Monday, 

5  =  Saturday, 

6  =  Sunday. 

Example  The  Julian  date  corresponding 
to  January  1,  1990  was  2,447,893.  Since 
2,447,893  =  7  x  349,699,  the  remainder  is 
zero,  and  the  day  was  Monday. 
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Example  2.1  (Trigonometric  Functions  in  a  Rect¬ 
angular  Spherical  Triangle)  Let  the  angle  A  be 
a  right  angle.  When  the  figure  is  a  plane  triangle, 
the  trigonometric  functions  of  the  angle  B  would 
be: 

sin  B  =  b/a,  cos  B  =  c/a,  tan  B  =  b/c. 

For  the  spherical  triangle  we  have  to  use  the 
equations  in  (2.7),  which  are  now  simply: 

sin  B  sin  a  =  sin  b, 
cos  B  sin  a  =  cos  b  sin  c, 
cos  a  =  cos  b  cos  c. 


The  first  equation  gives  the  sine  of  B : 
sin  B  =  sin  b/ sin  a. 

Dividing  the  second  equation  by  the  third  one,  we 
get  the  cosine  of  B : 

cos  B  =  tan  c/tan  a. 

And  the  tangent  is  obtained  by  dividing  the  first 
equation  by  the  second  one: 

tan  B  =  tan£>/sinc. 

The  third  equation  is  the  equivalent  of  the  Pytha¬ 
gorean  theorem  for  rectangular  triangles. 

Example  2.2  (Distance  Between  Two  Locations) 
The  latitude  of  Helsinki  is  about  0i  =  60°  and 
longitude  X\  =  25°.  The  corresponding  coordi¬ 
nates  of  La  Palma  are  02  =  28.7°  and  A  2  = 
—  17.9°.  What  is  the  distance  between  these  lo¬ 
cations  measured  along  the  surface  of  the  Earth. 

The  distance  can  be  found  by  applying  the  co¬ 
sine  formula  to  the  spherical  triangle  NHP: 


cos  a  =  cos(A-2  —  X\)  sin(90°  —  0i) 
x  sin(90°  -  02) 

+  cos(90°  —  0i )  cos(90°  —  02) 

=  COS(A-2  —  X 1 )  COS  01  COS  02 
+  sin  0i  sin  02) 

=  0.7325  x  0.5  x  0.8771 
+  0.8660  x  0.4802 
=  0.7372, 

from  which  a  =  42.5°. 

Another  method  is  to  use  vector  calculus  (Ap¬ 
pendix  A).  The  radius  vector  with  respect  to  the 
centre  of  the  Earth  is 

r  =  R  (cos  0  cos  X,  cos  0  sin  A,  sin0), 

where  R  is  the  radius  of  the  Earth.  In  the  follow¬ 
ing,  only  unit  vectors  are  needed,  and  thus  we  can 
omit  the  constant  R.  In  the  example  th  unit  vec¬ 
tors  are 

ri  =  (0.4532,  0.2113,  0.8660), 
r2  =  (0.8347,  -0.2696,  0.4802). 

The  scalar  product  of  these  vectors  gives  the  co¬ 
sine  of  the  angle  between  the  vectors: 

cos  a  =  r  1  •  r  2 

=  0.4532  x  0.8347  -  0.21 13  x  0.2696 
+  0.8660  x  0.4802 
=  0.7372, 

from  which  a  =  42.5°  =  0.7419  rad. 

Both  methods  give  the  same  angular  sepa¬ 
ration  between  the  two  locations.  The  distance 
along  the  great  circle  on  the  surface  of  the  Earth 
is  then  Ra  =  6400  x  0.7419  =  4748  km. 

Example  2.3  (The  Coordinates  of  New  York 
City)  The  geographic  coordinates  are  41°  north 
and  74°  west  of  Greenwich,  or  0  =  +41°,  A  = 
—74°.  In  time  units,  the  longitude  would  be 
74/15  h  =  4h  56  min  west  of  Greenwich.  The 
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geocentric  latitude  is  obtained  from 


tan  <p!  = 


tan  (j)  = 


6,356,752 

6,378,137 


2 

tan41° 


=  0.86347  =>  0'  =  40°  48'  34". 


The  geocentric  latitude  is  1 V  26"  less  than  the  ge¬ 
ographic  latitude. 


Example  2.4  The  angular  separation  of  two  ob¬ 
jects  in  the  sky  is  quite  different  from  their  coor¬ 
dinate  difference. 

Suppose  the  coordinates  of  a  star  A  are  a\  = 

10  h,  8 1  =70°  and  those  of  another  star  B,  a 2  = 

11  h,  82  =  80°. 

Using  the  Pythagorean  theorem  for  plane  tri¬ 
angles,  we  would  get 

d  =  s[(  15°)2  +  (10°)2=  18°. 


Example  2.5  Find  the  altitude  and  azimuth  of 
the  Moon  in  Helsinki  at  midnight  at  the  begin¬ 
ning  of  1996. 

The  right  ascension  is  a  =  2  h  55  min  7  s  = 
2.9186  h  and  declination  5  =  14°  42'  =  14.70°, 
the  sidereal  time  is  (9  =  6  h  19  min  26  s  = 
6.3239  h  and  latitude  0  =  60.16°. 

The  hour  angle  is  h  =  (9  —  a  =  3.4053  h  = 
51.08°.  Next  we  apply  the  equations  in  (2.16): 

sin  A  cos  a  =  sin  5 1 .08°  cos  14.70°  =  0.7526, 
cos  A  cos  a  =  cos  5 1.08°  cos  14.70°  sin  60. 16° 

—  sin  14.70°  cos  60. 16° 

=  0.4008, 

sin  <2  =  cos  5 1.08°  cos  14.70°  cos  60. 16° 

+  sin  14.70°  sin  60. 16° 


But  if  we  use  the  third  equation  in  (2.7),  we  get 


=  0.5225. 


cos  d  =  cos(aq  —  U2) 

X  sin(90°  —  8\)  sin(90°  —  82) 
+  cos(90°  -Si)cos(90°  -82) 
=  cos(oq  —  0L2 )  cos  8\  cos  82 
+  sin<5i  sin  <$2 
=  cos  15°  cos  70°  cos  80° 

+  sin  70°  sin  80° 

=  0.983, 


which  yields  d  =  10.6°.  The  figure  shows  why 
the  result  obtained  from  the  Pythagorean  theorem 
is  so  far  from  being  correct:  hour  circles  (circles 
with  a  =  constant)  approach  each  other  towards 
the  poles  and  their  angular  separation  becomes 
smaller,  though  the  coordinate  difference  remains 
the  same. 


Thus  the  altitude  is  a  =  31.5°.  To  find  the  az¬ 
imuth  we  have  to  compute  its  sine  and  cosine: 

sinA  =  0.8827,  cos  A  =  0.4701. 

Hence  the  azimuth  is  A  =  62.0°.  The  Moon  is 
in  the  southwest,  31.5  degrees  above  the  horizon. 
Actually,  this  would  be  the  direction  if  the  Moon 
were  infinitely  distant. 

Example  2.6  Find  the  topocentric  place  of  the 
Moon  in  the  case  of  the  previous  example. 

The  geocentric  distance  of  the  Moon  at  that 
time  is  R  =  62.58  equatorial  radii  of  the  Earth. 
For  simplicity,  we  can  assume  that  the  Earth  is 
spherical. 

We  set  up  a  rectangular  coordinate  frame  in 
such  a  way  that  the  z  axis  points  towards  the 
celestial  pole  and  the  observing  site  is  in  the 
vz  plane.  When  the  radius  of  the  Earth  is  used 
as  the  unit  of  distance,  the  radius  vector  of  the 
observing  site  is 

cos  <A  /0.4976\ 

0  r  0 

sin0/  \0.8674/ 
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The  radius  vector  of  the  Moon  is 

/  cos  8  cos  h  \  /  0.6077  \ 

r  =  R  I  —  cos5  sin/z  1  =  62.58  I  —0.7526  J  . 

V  sin 5  /  V  0.2538  / 

The  topocentric  place  of  the  Moon  is 

/  37.53  \ 

r  =  r  —  Vo  =  |  —47.10  1  . 

V  15.02  J 

We  divide  this  vector  by  its  length  62.07  to  get 
the  unit  vector  e  pointing  to  the  direction  of  the 
Moon.  This  can  be  expressed  in  terms  of  the 
topocentric  coordinates  8'  and  h'\ 

/  0.6047  \  /  cos 8' cosh'  \ 

e  =  I  —0.7588  J  =  |  —  cos^7 sink'  J  , 

\  0.2420  /  \  sin  8'  ) 

which  gives  8'  =  14.00°  and  h'  =  51.45°.  Next 
we  can  calculate  the  altitude  and  azimuth  as  in 
the  previous  example,  and  we  get  a  =  30.7°,  A  = 
61.9°. 

Another  way  to  find  the  altitude  is  to  take  the 
scalar  product  of  the  vectors  e  and  ro,  which 
gives  the  cosine  of  the  zenith  distance: 

cosz  =  e  •  ro  =  0.6047  x  0.4976 
+  0.2420  x  0.8674 
=  0.5108, 

whence  z  =  59.3°  and  a  =  90°  —  z  =  30.7°.  We 
see  that  this  is  0.8°  less  than  the  geocentric  alti¬ 
tude;  i.e.  the  difference  is  more  than  the  apparent 
diameter  of  the  Moon. 

Example  2.7  The  coordinates  of  Arcturus  are 
a  =  14  h  15.7  min,  8  =  19°  11'.  Find  the  side¬ 


real  time  at  the  moment  Arcturus  rises  or  sets  in 
Boston  (0  =  42°  19'). 

Neglecting  refraction,  we  get 

cos  h  =  —  tan  19°  lFtan42°  19; 

=  -0.348  x  0.910  =  -0.317. 

Hence,  h  =  ±108.47°  =  7  h  14  min.  The  more 
accurate  result  is 

cos  h  =  —  tan  19°  1 1'  tan  42°  19; 

sin35/ 

cos  19°  1  l/cos42°19/ 

=  -0.331, 

whence  h  =  ±109.35°  =  7  h  17  min.  The  plus 
and  minus  signs  correspond  to  setting  ant  ris¬ 
ing,  respectively.  When  Arcturus  rises,  the  side¬ 
real  time  is 

(~)  =  o'  ±  /?  =  1 4  h  16  min  —  7  h  17  min 
=  6  h  59  min 

and  when  it  sets,  the  sidereal  time  is 

@  =  14  h  16  min  ±  7  h  17  min 
=  21  h  33  min. 

Note  that  the  result  is  independent  of  the  date: 
a  star  rises  and  sets  at  the  same  sidereal  time  ev¬ 
ery  day. 

Example  2.8  The  proper  motion  of  Aldebaran  is 
li  =  0.20" /a  and  parallax  n  =  0.048".  The  spec¬ 
tral  line  of  iron  at  A  =  440.5  nm  is  displaced 
0.079  nm  towards  the  red.  What  are  the  radial  and 
tangential  velocities  and  the  total  velocity? 

The  radial  velocity  is  found  from 

AA  vY 
A  c 

0.079  o 

=>  vY  = - •  3  x  108  m/s 

440.5 

=  5.4  x  104  m/s  =  54  km/s. 
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The  tangential  velocity  is  now  given  by  (2.40), 
since  /x  and  n  are  in  correct  units: 


nt  =  4.74/xr  =  4.74/x/7r 


4.74  x  0.20 
0.048 


=  20  km/s. 


The  total  velocity  is 


Let  us  try  to  find  the  sidereal  time  in  Paris 
on  April  15  at  22:00,  Central  European  standard 
time  (=23:00  daylight  saving  time).  The  vernal 
equinox  occurs  on  the  average  on  March  21;  thus 
the  time  elapsed  since  the  equinox  is  10  +  15  = 
25  days.  Neglecting  the  equation  of  time,  the  lo¬ 
cal  time  T  is  52  minutes  less  than  the  zonal  time. 
Hence 


v  —  yj n2  +  n2  =  \/542  +  202  km/s  =  58  km/s. 

Example  2.9  Find  the  local  time  in  Paris  (longi¬ 
tude  A  =  2°)  at  12:00. 

Local  time  coincides  with  the  zonal  time  along 
the  meridian  15°  east  of  Greenwich.  Longitude 
difference  15°  —  2°  =  13°  equals  (13°/ 15°)  x 
60  min  =  52  minutes.  The  local  time  is  52  min¬ 
utes  less  than  the  official  time,  or  11:08.  This  is 
mean  solar  time.  To  find  the  true  solar  time,  we 
must  add  the  equation  of  time.  In  early  Febru¬ 
ary,  E.T.  =  — 14  min  and  the  true  solar  time  is 
11:08  —  14  min  =  10:54.  At  the  beginning  of 
November,  ET  =  +16  min  and  the  solar  time 
would  be  1 1 :24.  Since  — 14  min  and  +16  min  are 
the  extreme  values  of  E.T.,  the  true  solar  time  is 
in  the  range  10:54-1 1:24,  the  exact  time  depend¬ 
ing  on  the  day  of  the  year.  During  daylight  saving 
time,  we  must  still  subtract  one  hour  from  these 
times. 

Example  2.10  (Estimating  Sidereal  Time)  Since 
the  sidereal  time  is  the  hour  angle  of  the  vernal 
equinox  T,  it  is  0  h  when  T  culminates  or  transits 
the  south  meridian.  At  the  moment  of  the  vernal 
equinox,  the  Sun  is  in  the  direction  of  T  and  thus 
culminates  at  the  same  time  as  T.  So  the  sidereal 
time  at  12:00  local  solar  time  is  0:00,  and  at  the 
time  of  the  vernal  equinox,  we  have 

0  =  T  +  12  h, 

where  T  is  the  local  solar  time.  This  is  accu¬ 
rate  within  a  couple  of  minutes.  Since  the  sidereal 
time  runs  about  4  minutes  fast  a  day,  the  sidereal 
time,  n  days  after  the  vernal  equinox,  is 

(9^T  +  12h  +  ftx4  min. 

At  autumnal  equinox  T  culminates  at  0:00  local 
time,  and  sidereal  and  solar  times  are  equal. 


(9  =  r  +  12h  +  /ix4  min 

=  21  h  8  min  +  12  h  +  25  x  4  min 
=  34  h  48  min  =  10  h  48  min. 

The  time  of  the  vernal  equinox  can  vary  about  one 
day  in  either  direction  from  the  average.  There¬ 
fore  the  accuracy  of  the  result  is  roughly  5  min. 

Example  2.11  Find  the  rising  time  of  Arcturus 
in  Boston  on  January  10. 

In  Example  2.6  we  found  the  sidereal  time 
of  this  event,  0  =  6  h  59  min.  Since  we  do  not 
know  the  year,  we  use  the  rough  method  of  Ex¬ 
ample  2.9.  The  time  between  January  1  and  ver¬ 
nal  equinox  (March  21)  is  about  70  days.  Thus 
the  sidereal  time  on  January  1  is 

0  ~  T  +  12  h  —  70  x  4  min  =  T  +  7  h  20  min, 

from  which 

T  =  0  —  7  h  20  min  =  6  h  59  min  —  7  h  20  min 
=  30  h  59  min  —  7  h  20  min  =  23  h  39  min. 

The  longitude  of  Boston  is  7 1°  W,  and  the  Eastern 
standard  time  is  (4°/ 15°)  x  60  min  =16  minutes 
less,  or  23:23. 

Example  2.12  Find  the  sidereal  time  in  Helsinki 
on  April  15,  1982  at  20:00  UT. 

The  Julian  date  is  J  =  2,445,074.5  and 

_  2,445,074.5  -2,451,545.0 
““  36,525 

=  -0.1771526. 

Next,  we  use  (2.47)  to  find  the  sidereal  time  at 
OUT: 

0O  =  -1,506,521.0  s  =  -418  h  28  min  41  s 
=  13  h  31  min  19  s. 


2.17  Exercises 
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Since  the  sidereal  time  runs  3  min  57  s  fast  a  day 
as  compared  to  the  solar  time,  the  difference  in 
20  hours  will  be 

20 

—  x  3  min  57  s  =  3  min  17  s, 

24 

and  the  sidereal  time  at  20  UT  will  be  1 3  h  3 1  min 
19  s  +  20  h  3  min  17  s  =  33  h  34  min  36  s  =  9  h 
34  min  36  s. 

At  the  same  time  (at  22:00  Finnish  time, 
23:00  daylight  saving  time)  in  Helsinki  the  side¬ 
real  time  is  ahead  of  this  by  the  amount  cor¬ 
responding  to  the  longitude  of  Helsinki,  25°, 
i.e.  1  h  40  min  00  s.  Thus  the  sidereal  time  is 
11  h  14  min  36  s. 


2.17  Exercises 

Exercise  2.1  Find  the  distance  between  Helsinki 
and  Seattle  along  the  shortest  route.  Where  is  the 
northernmost  point  of  the  route,  and  what  is  its 
distance  from  the  North  Pole?  The  longitude  of 
Helsinki  is  25  °E  and  latitude  60°;  the  longitude 
of  Seattle  is  122°  W  and  latitude  48°.  Assume  that 
the  radius  of  the  Earth  is  6370  km. 

Exercise  2.2  A  star  crosses  the  south  meridian 
at  an  altitude  of  85°,  and  the  north  meridian  at 
45°.  Find  the  declination  of  the  star  and  the  lati¬ 
tude  of  the  observer. 

Exercise  2.3  Where  are  the  following  state¬ 
ments  true? 

(a)  Castor  (a  Gem,  declination  8  =  31°53/)  is 
circumpolar. 

(b)  Betelgeuze  ( a  Ori,  8  =  7°  247)  culminates  at 
zenith. 

(c)  a  Cen  (5  =  —60°  5 0r)  rises  to  an  altitude  of 
30°. 

Exercise  2.4  In  his  Old  Man  and  the  Sea  Hem¬ 
ingway  wrote: 

It  was  dark  now  as  it  becomes  dark  quickly  after 
the  Sun  sets  in  September.  He  lay  against  the  worn 


wood  of  the  bow  and  rested  all  that  he  could.  The 
first  stars  were  out.  He  did  not  know  the  name  of 
Rigel  but  he  saw  it  and  knew  soon  they  would  all 
be  out  and  he  would  have  all  his  distant  friends. 

How  was  Hemingway’s  astronomy? 

Exercise  2.5  The  right  ascension  of  the  Sun  on 
June  1,  1983,  was  4h  35  min  and  declination 
22°  002  Find  the  ecliptic  longitude  and  latitude 
of  the  Sun  and  the  Earth. 

Exercise  2.6  Show  that  on  the  Arctic  Circle  the 
Sun 

(a)  rises  at  the  same  sidereal  time  ®o  between 
December  22  and  June  22, 

(b)  sets  at  the  same  sidereal  time  ®o  between 
June  22  and  December  22. 

What  is  (90? 

Exercise  2.7  Derive  the  equations  (2.24),  which 
give  the  galactic  coordinates  as  functions  of  the 
ecliptic  coordinates. 

Exercise  2.8  The  coordinates  of  Sirius  for  the 
epoch  1900.0  were  a  =  6  h  40  min  45  s,  8  = 
—  16°  35',  and  the  components  of  its  proper  mo¬ 
tion  were  \ia  =  —0.037  s/a,  pis  =  —  1.12"a-1. 
Find  the  coordinates  of  Sirius  for  2000.0.  The 
precession  must  also  be  taken  into  account. 

Exercise  2.9  The  parallax  of  Sirius  is  0.375" 
and  radial  velocity  —8  km/s. 

(a)  What  are  the  tangential  and  total  velocities  of 
Sirius?  (See  also  the  previous  exercise.) 

(b)  When  will  Sirius  be  closest  to  the  Sun? 

(c)  What  will  its  proper  motion  and  parallax  be 
then? 

Exercise  2.10  The  average  period  of  the  vari¬ 
able  star  Mira  Ceti  is  331.96  days.  According  to 
a  catalogue  the  brightness  was  at  maximum  in 
September  22,  2000.  When  the  star  was  brightest 
in  2010? 
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Up  to  the  end  of  the  Middle  Ages,  the  most  im¬ 
portant  means  of  observation  in  astronomy  was 
the  human  eye.  It  was  aided  by  various  mechan¬ 
ical  devices  to  measure  the  positions  of  celes¬ 
tial  bodies  in  the  sky.  The  telescope  was  invented 
in  Holland  at  the  beginning  of  the  17th  century, 
and  in  1609  Galileo  Galilei  made  his  first  astro¬ 
nomical  observations  with  this  new  instrument. 
Astronomical  photography  was  introduced  at  the 
end  of  the  19th  century,  and  during  the  last  few 
decades  many  kinds  of  electronic  detectors  have 
been  adopted  for  the  study  of  electromagnetic 
radiation  from  space.  The  electromagnetic  spec¬ 
trum  from  the  shortest  gamma  rays  to  long  radio 
waves  can  now  be  used  for  astronomical  observa¬ 
tions. 


3.1  Observing  Through  the 
Atmosphere 

With  satellites  and  spacecraft,  observations  can 
be  made  outside  the  atmosphere.  Yet,  the  great 
majority  of  astronomical  observations  are  carried 
out  from  the  surface  of  the  Earth.  In  the  preceding 
chapter,  we  discussed  refraction,  which  changes 
the  apparent  altitudes  of  objects.  The  atmosphere 
affects  observations  in  many  other,  more  seri¬ 
ous  ways  as  well.  Real  problems  are  the  weather, 
restlessness  of  the  atmosphere,  and,  worst  of  all, 
opacity  of  the  atmosphere  in  many  wavelength  re¬ 
gions. 

The  air  is  never  quite  steady,  and  there  are  lay¬ 
ers  with  different  temperatures  and  densities;  this 
causes  convection  and  turbulence.  When  the  light 


from  a  star  passes  through  the  unsteady  air,  rapid 
changes  in  refraction  in  different  directions  re¬ 
sult.  Thus,  the  amount  of  light  reaching  a  detec¬ 
tor,  e.g.  the  human  eye,  constantly  varies;  the  star 
is  said  to  scintillate  (Fig.  3.1).  Planets  shine  more 
steadily,  since  they  are  not  point  sources  like  the 
stars,  and  variations  in  the  light  coming  from  dif¬ 
ferent  parts  partly  cancel  each  others. 

When  a  wavefront  passes  an  aperture  its  am¬ 
plitude  and  phase  vary  with  time.  The  amplitude 
variations  cause  scintillation  and  phase  changes 
blur  the  image.  A  telescope  collects  light  over 
a  larger  area,  which  evens  out  rapid  changes 
and  diminishes  scintillation.  Instead,  differences 
in  the  phase  do  not  cancel  out  the  same  way; 
they  smear  the  image  and  point  sources  are  seen 
in  telescopes  as  vibrating  speckles.  This  phe¬ 
nomenon  is  called  seeing.  When  the  exposure 
time  is  at  least  a  few  seconds  the  light  oscillates 
in  all  directions,  and  the  stellar  image  becomes  a 
round  disk  (Fig.  3.3).  The  diameter  of  the  seeing 
disk  can  vary  from  less  than  an  arc  second  to  even 
tens  of  arc  seconds.  If  the  size  of  the  disk  small, 
the  seeing  is  good.  It  is  only  in  good  conditions 
on  high  mountains  that  the  seeing  is  less  than  one 
arc  second. 

Seeing  limits  how  small  details  can  be  seen 
with  a  telescope.  Sometimes  the  atmosphere  can 
calm  down  for  a  short  period  of  time,  and  then  it 
is  possible  to  glimpse  visually  details  that  are  not 
seen  in  pictures  exposed  for  a  longer  time. 

Some  wavelength  regions  in  the  electromag¬ 
netic  spectrum  are  strongly  absorbed  by  the  atmo¬ 
sphere  (Fig.  3.2).  The  most  important  transparent 
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interval  is  the  optical  window  from  about  300  to 
800  nm.  This  interval  coincides  with  the  region  of 
sensitivity  of  the  human  eye  (about  400-700  nm). 

At  wavelengths  under  300  nm  absorption  by 
atmospheric  ozone  prevents  radiation  from  reach¬ 
ing  the  ground.  The  ozone  is  concentrated  in 
a  thin  layer  at  a  height  of  about  20-30  km,  and 
this  layer  protects  the  Earth  from  harmful  ultra¬ 
violet  radiation.  At  still  shorter  wavelengths,  the 


main  absorbers  are  O2,  N?  and  free  atoms.  Nearly 
all  of  the  radiation  under  300  nm  is  absorbed  by 
the  upper  parts  of  the  atmosphere. 

At  wavelengths  longer  than  visible  light,  in 
the  near-infrared  region,  the  atmosphere  is  fairly 
transparent  up  to  1.3  pm.  There  are  some  absorp¬ 
tion  belts  caused  by  water  and  molecular  oxy¬ 
gen,  but  the  atmosphere  gets  more  opaque  only 
at  wavelengths  of  longer  than  1.3  pm.  At  these 


Fig.  3.1  Scintillation  of 
Sirius  during  four  passes 
across  the  field  of  view. 
The  star  was  very  low  on 
the  horizon.  (Photo  by 
Pekka  Parviainen) 
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Fig.  3.2  The  transparency  of  the  atmosphere  at  different  sorbed  by  interstellar  gas,  as  shown  in  the  lowermost  very 
wavelengths.  100  %  transmission  means  that  all  radiation  schematic  figure.  The  interstellar  absorption  also  varies 
reaches  the  surface  of  the  Earth.  The  radiation  is  also  ab-  very  much  depending  on  the  direction  (Chap.  15) 
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Fig.  3.3  Stellar  images  vibrate  about  equally  often  in  all 
directions,  on  the  average.  If  the  picture  is  exposed  for 
more  than  about  one  second  the  image  of  the  star  spreads 
into  a  circular  seeing  disc.  The  better  the  observing  condi¬ 
tions  are  the  smaller  is  the  diameter  of  the  disc 

wavelengths,  radiation  reaches  the  lower  parts  of 
the  atmosphere  only  in  a  few  narrow  windows. 
All  wavelengths  between  20  pm  and  1  mm  are 
totally  absorbed. 

At  wavelengths  longer  than  1  mm,  there  is  the 
radio  window  extending  up  to  about  20  m.  At 
still  longer  wavelengths,  the  ionosphere  in  the  up¬ 
per  parts  of  the  atmosphere  reflects  all  radiation 
(Fig.  3.2).  The  exact  upper  limit  of  the  radio  win¬ 
dow  depends  on  the  strength  of  the  ionosphere, 
which  varies  during  the  day.  (The  structure  of  the 
atmosphere  is  described  in  Chap.  7.) 

At  optical  wavelengths  (300-800  nm),  light 
is  scattered  by  the  molecules  and  dust  in  the  at¬ 
mosphere,  and  the  radiation  is  attenuated.  Scat¬ 
tering  and  absorption  together  are  called  extinc¬ 
tion.  Extinction  must  be  taken  into  account  when 
one  measures  the  brightness  of  celestial  bodies 
(Chap.  4). 

In  the  19th  century  Lord  Rayleigh  succeeded 
in  explaining  why  the  sky  is  blue.  Scattering 
caused  by  the  molecules  in  the  atmosphere  is  in¬ 
versely  proportional  to  the  fourth  power  of  the 
wavelength.  Thus,  blue  light  is  scattered  more 
than  red  light.  The  blue  light  we  see  all  over  the 
sky  is  scattered  sunlight.  The  same  phenomenon 


Fig.  3.4  Night  views  from  the  top  of  Mount  Wilson.  The 
upper  photo  was  taken  in  1908,  the  lower  one  in  1988.  The 
lights  of  Los  Angeles,  Pasadena,  Hollywood  and  more 
than  40  other  towns  are  reflected  in  the  sky,  causing  con¬ 
siderable  disturbance  to  astronomical  observations.  (Pho¬ 
tos  by  Ferdinand  Ellerman  and  International  Dark-Sky  As¬ 
sociation) 

colours  the  setting  sun  red,  because  owing  to  the 
long,  oblique  path  through  the  atmosphere,  all  the 
blue  light  has  been  scattered  away. 

In  astronomy  one  often  has  to  observe  very 
faint  objects.  Thus,  it  is  important  that  the  back¬ 
ground  sky  be  as  dark  as  possible,  and  the  atmo¬ 
sphere  as  transparent  as  possible.  That  is  why  the 
large  observatories  have  been  built  on  mountain 
tops  far  from  the  cities.  The  air  above  an  observa¬ 
tory  site  must  be  very  dry,  the  number  of  cloudy 
nights  few,  and  the  seeing  good. 

Astronomers  have  looked  all  over  the  Earth  for 
optimal  conditions  and  have  found  some  excep¬ 
tional  sites.  In  the  1970’s,  several  new  major  ob¬ 
servatories  were  founded  at  these  sites.  Among 
the  best  sites  in  the  world  are:  the  extinct  vol¬ 
cano  Mauna  Kea  on  Hawaii,  rising  more  than 
4000  m  above  the  sea;  the  dry  mountains  in  north¬ 
ern  Chile;  the  Sonoran  desert  in  the  U.S.,  near 
the  border  of  Mexico;  and  the  mountains  on  La 
Palma,  in  the  Canary  Islands.  Many  older  ob¬ 
servatories  are  severely  plagued  by  the  lights  of 
nearby  cities  (Fig.  3.4). 
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In  radio  astronomy  atmospheric  conditions  are 
not  very  critical  except  when  observing  at  the 
shortest  wavelengths,  damped  by  atmospheric 
moisture.  Constructors  of  radio  telescopes  have 
much  greater  freedom  in  choosing  their  sites  than 
optical  astronomers.  Still,  radio  telescopes  are 
also  often  constructed  in  uninhabited  places  to 
isolate  them  from  disturbing  radio  and  television 
broadcasts. 


3.2  Optical  Telescopes 

The  telescope  fulfils  three  major  tasks  in  astro¬ 
nomical  observations: 

1.  It  collects  light  from  a  large  area,  making  it 
possible  to  study  very  faint  sources. 

2.  It  improves  resolution  and  increases  the  appar¬ 
ent  angular  diameter  of  the  object. 

3.  It  is  used  to  measure  the  positions  of  objects. 

The  light-collecting  surface  in  a  telescope 
is  either  a  lens  or  a  mirror.  Thus,  optical  tele¬ 
scopes  are  divided  into  two  types,  lens  telescopes 
or  refractors  and  mirror  telescopes  or  reflectors 
(Fig.  3.5). 

Geometrical  Optics  Refractors  have  two  len¬ 
ses,  the  objective  which  collects  the  incoming 
light  and  forms  an  image  in  the  focal  plane,  and 
the  eyepiece  which  is  a  small  magnifying  glass 
for  looking  at  the  image  (Fig.  3.5).  The  lenses 
are  at  the  opposite  ends  of  a  tube  which  can  be 
directed  towards  any  desired  point.  The  distance 


Fig.  3.5  A  lens  telescope  or  refractor  and  a  mirror  tele¬ 
scope  or  reflector 


between  the  eyepiece  and  the  focal  plane  can  be 
adjusted  to  get  the  image  into  focus.  The  image 
formed  by  the  objective  lens  can  also  be  regis¬ 
tered,  e.g.  on  a  photographic  film,  as  in  an  ordi¬ 
nary  camera. 

The  diameter  of  the  objective,  D,  is  called  the 
aperture  of  the  telescope.  The  ratio  of  the  aper¬ 
ture  D  to  the  focal  length  /,  F  =  D/f,  is  called 
the  aperture  ratio.  This  quantity  is  used  to  charac¬ 
terise  the  light-gathering  power  of  the  telescope. 
If  the  aperture  ratio  is  large,  near  unity,  one  has 
a  powerful,  “fast”  telescope;  this  means  that  one 
can  take  photographs  using  short  exposures,  since 
the  image  is  bright.  A  small  aperture  ratio  (the  fo¬ 
cal  length  much  greater  than  the  aperture)  means 
a  “slow”  telescope. 

In  astronomy,  as  in  photography,  the  aperture 
ratio  is  often  denoted  by  f/n  (e.g.  // 8),  where 
n  is  the  focal  length  divided  by  the  aperture.  For 
fast  telescopes  this  ratio  can  be  // 1 . . .  //  3,  but 
usually  it  is  smaller,  //8  . . .  //15. 

The  scale  of  the  image  formed  in  the  focal 
plane  of  a  refractor  can  be  geometrically  deter¬ 
mined  from  Fig.  3.6.  When  the  object  is  seen  at 
the  angle  u ,  it  forms  an  image  of  height  s, 

s  =  f  tan  w  ^  fu,  (3.1) 

since  u  is  a  very  small  angle.  If  the  telescope  has 
a  focal  length  of,  for  instance,  343  cm,  one  arc 
minute  corresponds  to 

s  =  343  cm  x  Y 
=  343  cm  x  (1/60)  x  (tt/180) 

=  1  mm. 


Fig.  3.6  The  scale  and  magnification  of  a  refractor.  The 
object  subtends  an  angle  u.  The  objective  forms  an  image 
of  the  object  in  the  focal  plane.  When  the  image  is  viewed 
through  the  eyepiece,  it  is  seen  at  an  angle  u! 


3.2  Optical  Telescopes 


55 


Fig.  3.7  Diffraction  and  resolving  power.  The  image  of 
a  single  star  (a)  consists  of  concentric  diffraction  rings, 
which  can  be  displayed  as  a  mountain  diagram  (b).  Wide 
pairs  of  stars  can  be  easily  resolved  (c).  For  resolving  close 

The  magnification  go  is  (from  Fig.  3.6) 

co  =  u'/u  «  ///',  (3.2) 

where  we  have  used  the  equation  s  =  fu.  Here, 
/  is  the  focal  length  of  the  objective  and  /'  that 
of  the  eyepiece.  For  example,  if  /  =  100  cm  and 
we  use  an  eyepiece  with  f  —  2  cm,  the  magni¬ 
fication  is  50-fold.  The  magnification  is  not  an 
essential  feature  of  a  telescope,  since  it  can  be 
changed  simply  by  changing  the  eyepiece. 

A  more  important  characteristic,  which  de¬ 
pends  on  the  aperture  of  the  telescope,  is  the  re¬ 
solving  power ,  which  determines,  for  example, 
the  minimum  angular  separation  of  the  compo¬ 
nents  of  a  binary  star  that  can  be  seen  as  two  sep¬ 
arate  stars.  The  theoretical  limit  for  the  resolution 
is  set  by  the  diffraction  of  light:  The  telescope 
does  not  form  a  point  image  of  a  star,  but  rather 
a  small  disk,  since  light  “bends  around  the  cor¬ 
ner”  like  all  radiation  (Fig.  3.7). 

The  theoretical  resolution  of  a  telescope  is  of¬ 
ten  given  in  the  form  introduced  by  Rayleigh  (see 
Box  3.1) 

sinO^O  =  1.22X/D,  [6>]=rad.  (3.3) 


binaries,  different  criteria  can  be  used.  One  is  the  Rayleigh 
limit  1 .22  X/D  (d).  In  practice,  the  resolution  can  be  writ¬ 
ten  X/D,  which  is  near  the  Dawes  limit  (e).  (Photo  (a)  Sky 
and  Telescope) 

As  a  practical  rule,  we  can  say  that  two  objects 
are  seen  as  separate  if  the  angular  distance  be¬ 
tween  them  is 

6>X/D,  [G]=  rad.  (3.4) 

This  formula  can  be  applied  to  optical  as  well  as 
radio  telescopes.  For  example,  if  one  makes  ob¬ 
servations  at  a  typical  yellow  wavelength  (A  = 
550  nm),  the  resolving  power  of  a  reflector  with 
an  aperture  of  1  m  is  about  0.1".  However,  seeing 
spreads  out  the  image  to  a  diameter  of  typically 
one  arc  second.  Thus,  the  theoretical  diffraction 
limit  cannot  usually  be  reached  on  the  surface  of 
the  Earth. 

In  photography  the  resolution  is  further  re¬ 
stricted  by  the  properties  of  the  photographic 
plate  or  the  pixel  size  of  the  CCD  camera,  de¬ 
creasing  the  resolution  as  compared  with  vi¬ 
sual  observations.  The  grain  size  of  photographic 
emulsions  is  about  0.01-0.03  mm,  which  is  also 
the  minimum  size  of  the  image.  For  a  focal  length 
of  1  m,  the  scale  is  1  mm  =  206",  and  thus 
0.01  mm  corresponds  to  about  2  arc  seconds.  This 
is  similar  to  the  theoretical  resolution  of  a  tele¬ 
scope  with  an  aperture  of  7  cm  in  visual  obser- 
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vations.  The  pixel  sizes  of  CCD  cameras  have 
decreased,  and  currently  the  resolution  can  be  as 
good  as  or  even  better  than  the  resolution  of  pho¬ 
tographic  films. 

In  practice,  the  resolution  of  visual  observa¬ 
tions  is  determined  by  the  ability  of  the  eye  to  see 
details.  In  night  vision  (when  the  eye  is  perfectly 
adapted  to  darkness)  the  resolving  capability  of 
the  human  eye  is  about  2! . 

The  maximum  magnification  comSLX  is  the 
largest  magnification  that  is  worth  using  in  tele¬ 
scopic  observations.  Its  value  is  obtained  from 
the  ratio  of  the  resolving  capability  of  the  eye, 
^~2/  =  5.8  x  10~4  rad,  to  the  resolving  power 
of  the  telescope,  0 , 


^max  —  e/0  «  C If  /k  — 


5.8  X  10“4D 
5.5  x  10-7  m 


^  D/l  mm. 


(3.5) 


If  we  use,  for  example,  an  objective  with  a  di¬ 
ameter  of  100  mm,  the  maximum  magnification 
is  about  100.  If  larger  magnifications  are  used 
the  object  is  seen  bigger  but  no  more  details 
are  resolved.  Especially  small  and  cheap  amateur 
telescopes  are  advertised  having  huge  magnifi¬ 
cations,  which  are  useless;  rather  they  make  ob¬ 
serving  very  difficult.  However,  experienced  ob¬ 
servers  may  sometimes  use  very  big  magnifica¬ 
tions  in  some  special  occasions. 

The  minimum  magnification  com jn  is  the  small¬ 
est  magnification  that  is  useful  in  visual  observa¬ 
tions.  Its  value  is  obtained  from  the  condition  that 
the  diameter  of  the  exit  pupil  L  of  the  telescope 
must  be  smaller  than  or  equal  to  the  pupil  of  the 
eye. 

The  exit  pupil  is  the  image  of  the  objective 
lens,  formed  by  the  eyepiece,  through  which  the 
light  from  the  objective  goes  behind  the  eyepiece. 
From  Fig.  3.8  we  obtain 

f  D 

L  =  —D  =  — .  (3.6) 

/  (O 

Thus  the  condition  L  <d  means  that 

co  >  D/d.  (3.7) 


/  /' 

- - — » 


Fig.  3.8  The  exit  pupil  L  is  the  image  of  the  objective 
lens  formed  by  the  eyepiece 


Fig.  3.9  Chromatic  aberration.  Light  rays  of  different 
colours  are  refracted  to  different  focal  points  (left).  The 
aberration  can  be  corrected  with  an  achromatic  lens  con¬ 
sisting  of  two  parts  (right) 


In  the  night,  the  diameter  of  the  pupil  of  the  hu¬ 
man  eye  is  about  6  mm,  and  thus  the  minimum 
magnification  of  a  100  mm  telescope  is  about  17. 

Refractors  In  the  first  refractors,  which  had 
a  simple  objective  lens,  the  observations  were 
hampered  by  the  chromatic  aberration.  Since 
glass  refracts  different  colours  by  different 
amounts,  all  colours  do  not  meet  at  the  same  fo¬ 
cal  point  (Fig.  3.9),  but  the  focal  length  increases 
with  increasing  wavelength.  To  remove  this  aber¬ 
ration,  achromatic  lenses  consisting  of  two  parts 
were  developed  in  the  18th  century.  The  colour 
dependence  of  the  focal  length  is  much  smaller 
than  in  single  lenses,  and  at  some  wavelength,  Ao, 
the  focal  length  has  an  extremum  (usually  a  min¬ 
imum).  Near  this  point  the  change  of  focal  length 
with  wavelength  is  very  small  (Fig.  3.10).  If  the 
telescope  is  intended  for  visual  observations,  we 
choose  Ao  =  550  nm,  corresponding  to  the  maxi¬ 
mum  sensitivity  of  the  eye.  Objectives  for  photo¬ 
graphic  refractors  were  usually  constructed  with 
Ao  ~  425  nm,  since  normal  photographic  plates 
were  most  sensitive  to  the  blue  part  of  the  spec¬ 
trum. 
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Fig.  3.10  The  wavelength  dependence  of  the  focal  length 
of  a  typical  achromatic  objective  for  visual  observations. 
The  focal  length  has  a  minimum  near  X  =  550  nm,  where 
the  eye  is  most  sensitive.  In  bluer  light  (X  =  450  nm)  or 
in  redder  light  (X  =  800  nm),  the  focal  length  increases  by 
a  factor  of  about  1.002 

By  combining  three  or  even  more  lenses  of  dif¬ 
ferent  glasses  in  the  objective,  the  chromatic  aber¬ 
ration  can  be  corrected  still  better  (as  in  apochro- 
matic  objectives).  Also,  special  glasses  have  been 
developed  where  the  wavelength  dependences  of 
the  refractive  index  cancel  out  so  well  that  two 
lenses  already  give  a  very  good  correction  of  the 
chromatic  aberration.  They  have,  however,  hardly 
been  used  in  astronomy  so  far. 

The  largest  refractors  in  the  world  have  an 
aperture  of  about  one  metre  (102  cm  in  the  Yerkes 
Observatory  telescope  (Fig.  3.11),  finished  in 
1897,  and  91  cm  in  the  Lick  Observatory  tele¬ 
scope  (1888)).  The  aperture  ratio  is  typically 
//10...//20. 

The  use  of  refractors  is  limited  by  their  small 
field  of  view  and  awkwardly  long  structure.  Re¬ 
fractors  are  still  used,  e.g.  in  solar  telescopes  and 
various  meridian  telescopes  for  measuring  the  po¬ 
sitions  of  stars,  and  also  for  visual  observations  of 
binary  stars. 

A  wider  field  of  view  is  obtained  by  using 
more  complex  lens  systems,  and  telescopes  of 
this  kind  are  called  astrographs.  Astrographs 
have  an  objective  made  up  of  typically  3-5  lenses 
and  an  aperture  of  less  than  60  cm.  The  aperture 
ratio  is  f/5  . . .  f/1  and  the  field  of  view  about  5°. 
Astrographs  are  used  to  photograph  large  areas  of 
the  sky,  e.g.  for  proper  motion  studies  and  for  sta¬ 
tistical  brightness  studies  of  the  stars. 


Reflectors  By  far  the  most  common  telescope 
type  in  astrophysical  research  nowadays  is  the 
mirror  telescope  or  reflector.  As  a  light-collecting 
surface,  it  employs  a  mirror  coated  with  a  thin 
layer  of  aluminium.  The  form  of  the  mirror  is  usu¬ 
ally  parabolic.  A  parabolic  mirror  reflects  all  light 
rays  entering  the  telescope  parallel  to  the  main 
axis  into  the  same  focal  point.  The  image  formed 
at  this  point  can  be  observed  through  an  eyepiece 
or  registered  with  a  detector.  One  of  the  advan¬ 
tages  of  reflectors  is  the  absence  of  chromatic 
aberration,  since  all  wavelengths  are  reflected  to 
the  same  point. 

In  the  very  largest  telescopes,  the  instruments 
can  be  installed  at  the  primary  focus  (Fig.  3.12) 
without  eclipsing  too  much  of  the  incoming  light. 
Sometimes  even  the  observer  could  sit  in  a  cage 
at  the  primary  focus.  In  smaller  telescopes,  this 
is  not  possible,  and  the  image  must  be  inspected 
from  outside  the  telescope.  In  modern  telescopes 
instruments  are  remotely  controlled,  and  the  ob¬ 
server  must  stay  away  from  the  telescope  to  re¬ 
duce  thermal  turbulence. 

The  idea  of  a  reflector  was  described  in  1663 
by  James  Gregory.  The  first  practical  reflector, 
however,  was  built  by  Isaac  Newton.  He  guided 
the  light  perpendicularly  out  from  the  telescope 
with  a  small  flat  mirror.  Therefore  the  focus  of 
the  image  in  such  a  system  is  called  the  New¬ 
ton  focus.  A  typical  aperture  ratio  of  a  Newtonian 
telescope  is  f/3  . . .  // 10.  Another  possibility  is 
to  bore  a  hole  at  the  centre  of  the  primary  mirror 
and  reflect  the  rays  through  it  with  a  small  hy¬ 
perbolic  secondary  mirror  in  the  front  end  of  the 
telescope.  In  such  a  design,  the  rays  meet  in  the 
Cassegrain  focus.  Cassegrain  systems  have  aper¬ 
ture  ratios  of  //8 . . .  //15. 

The  effective  focal  length  (/e)  of  a  Cassegrain 
telescope  is  determined  by  the  position  and  con¬ 
vexity  of  the  secondary  mirror.  Using  the  nota¬ 
tions  of  Fig.  3.13,  we  get 

/e=-/p.  (3.8) 

a 

If  we  choose  a  <X^b,  we  have  /e  /p.  In  this 
way  one  can  construct  short  telescopes  with  long 
focal  lengths.  Cassegrain  systems  are  especially 
well  suited  for  spectrographic,  photometric  and 
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other  instruments,  which  can  be  mounted  in  the 
secondary  focus,  easily  accessible  to  the  observer. 

More  complicated  arrangements  use  several 
mirrors  to  guide  the  light  through  the  declination 
axis  of  the  telescope  to  a  fixed  coude  focus  (from 


the  French  word  couder ,  to  bend),  which  can  even 
be  situated  in  a  separate  room  near  the  telescope 
(Fig.  3.14).  The  focal  length  is  thus  very  long 
and  the  aperture  ratio  // 30 . . .  // 40.  The  coude 
focus  is  used  mainly  for  accurate  spectroscopy, 


Fig.  3.1 1  The  largest 
refractor  in  the  world  is  at 
the  Yerkes  Observatory, 
University  of  Chicago.  It 
has  an  objective  lens  with 
a  diameter  of  102  cm. 
(Photo  by  Yerkes 
Observatory) 


Fig.  3.12  Different  locations  of  the  focus  in  reflectors:  observations  near  the  celestial  pole.  More  complex  coude 
primary  focus,  Newton  focus,  Cassegrain  focus  and  coude  systems  usually  have  three  flat  mirrors  after  the  primary 
focus.  The  coude  system  in  this  figure  cannot  be  used  for  and  secondary  mirrors 
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since  the  large  spectrographs  can  be  stationary 
and  their  temperature  can  be  held  accurately  con¬ 
stant.  Nowadays  this  can  also  be  achieved  by  us¬ 
ing  optical  fibers.  A  drawback  is  that  much  light 
is  lost  in  the  reflections  in  the  several  mirrors 
of  the  coude  system.  An  aluminised  mirror  re¬ 
flects  about  80  %  of  the  light  falling  on  it,  and 
thus  in  a  coude  system  of,  e.g.  five  mirrors  (in¬ 
cluding  the  primary  and  secondary  mirrors),  only 


Fig.  3.1 3  The  principle  of  a  Cassegrain  reflector.  A  con¬ 
cave  (paraboloid)  primary  mirror  M\  reflects  the  light  rays 
parallel  to  the  main  axis  towards  the  primary  focus  Si. 
A  convex  secondary  mirror  M2  (hyperboloid)  reflects  the 
rays  back  through  a  small  hole  at  the  centre  of  the  main 
mirror  to  the  secondary  focus  S2  outside  the  telescope 


0.85  ~  30  %  of  the  light  reaches  the  detector.  In 
new  big  telescopes  the  large  instruments  are  usu¬ 
ally  mounted  at  the  Nasmyth  focus  at  the  end  of 
the  horizontal  axis  of  the  telescope. 

The  reflector  has  its  own  aberration,  coma.  It 
affects  images  displaced  from  the  optical  axis. 
Light  rays  do  not  converge  at  one  point,  but  form 
a  figure  like  a  comet.  Due  to  the  coma,  the  clas¬ 
sical  reflector  with  a  paraboloid  mirror  has  a  very 
small  correct  field  of  view.  The  coma  limits  the 
diameter  of  the  useful  field  to  2-20  minutes  of 
arc,  depending  on  the  aperture  ratio  of  the  tele¬ 
scope.  The  5  m  Palomar  telescope,  for  instance, 
has  a  useful  field  of  view  of  about  4r,  correspond¬ 
ing  to  about  one-eighth  of  the  diameter  of  the 
Moon.  In  practice,  the  small  field  of  view  can  be 
enlarged  by  various  correcting  lenses. 

If  the  primary  mirror  were  spherical,  there 
would  be  no  coma.  However,  this  kind  of  mir¬ 
ror  has  its  own  error,  spherical  aberration :  light 
rays  from  the  centre  and  edges  converge  at  dif¬ 
ferent  points.  To  remove  the  spherical  aberration, 
the  Estonian  astronomer  Bernhard  Schmidt  de¬ 
veloped  a  thin  correcting  lens  that  is  placed  in 
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Fig.  3.1 4  The  coude  system  of  the  Kitt  Peak  2.1  m  reflector.  (Drawing  National  Optical  Astronomy  Observatories,  Kitt 
Peak  National  Observatory) 
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the  way  of  the  incoming  light.  Schmidt  cameras 
(Figs.  3.15  and  3.16)  have  a  very  wide  (about  7°), 
nearly  faultless  field  of  view,  and  the  correcting 
lens  is  so  thin  that  it  absorbs  very  little  light.  The 
images  of  the  stars  are  very  sharp.  In  Schmidt 
telescopes  the  diaphragm  with  the  correcting  lens 
is  positioned  at  the  centre  of  the  radius  of  curva¬ 
ture  of  the  mirror  (this  radius  equals  twice  the  fo¬ 
cal  length).  To  collect  all  the  light  from  the  edges 
of  the  field  of  view,  the  diameter  of  the  mirror 
must  be  larger  than  that  of  the  correcting  glass. 
The  Palomar  Schmidt  camera,  for  example,  has 
an  aperture  of  122  cm  (correcting  lens)/ 183  cm 
(mirror)  and  a  focal  length  of  300  cm.  The  largest 
Schmidt  telescope  in  the  world  is  in  Tauten- 
burg,  Germany,  and  its  corresponding  values  are 
134/203/400  cm. 

A  disadvantage  of  the  Schmidt  telescope  is 
the  curved  focal  plane,  consisting  of  a  part  of 
a  sphere.  When  the  telescope  is  used  for  pho¬ 
tography,  the  plate  must  be  bent  along  the  curved 
focal  plane.  Another  possibility  of  correcting  the 
curvature  of  the  field  of  view  is  to  use  an  extra 
correcting  lens  near  the  focal  plane.  Such  a  so¬ 
lution  was  developed  by  the  Finnish  astronomer 
Yrjo  Vaisala  in  the  1930’s,  independently  of 
Schmidt.  Such  a  system  is  often  called  a  Schmidt- 
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Corrector  lens  at  the 
centre  of  curvature 
of  the  main  mirror 

Curved 
focal  surface 

Fig.  3.15  The  principle  of  the  Schmidt  camera.  A  cor¬ 
recting  glass  at  the  centre  of  curvature  of  a  concave  spher¬ 
ical  mirror  deviates  parallel  rays  of  light  and  compensates 
for  the  spherical  aberration  of  the  spherical  mirror.  (In  the 
figure,  the  form  of  the  correcting  glass  and  the  change  of 
direction  of  the  light  rays  have  been  greatly  exaggerated.) 
Since  the  correcting  glass  lies  at  the  centre  of  curvature, 
the  image  is  practically  independent  of  the  incoming  an¬ 
gle  of  the  light  rays.  Thus  there  is  no  coma  or  astigmatism, 
and  the  images  of  stars  are  points  on  a  spherical  surface 
at  a  distance  of  R /2,  where  R  is  the  radius  of  curvature 
of  the  spherical  mirror.  In  photography,  the  plate  must  be 
bent  into  the  form  of  the  focal  surface,  or  the  field  rectified 
with  a  corrector  lens 


Vaisala  telescope.  The  chips  of  CCD  cameras  are 
usually  much  smaller  than  photographic  plates, 
and  then  the  curved  focal  surface  is  not  a  prob¬ 
lem. 

Schmidt  cameras  have  proved  to  be  very  ef¬ 
fective  in  mapping  the  sky.  They  have  been  used 
to  photograph  the  Palomar  Sky  Atlas  mentioned 
in  the  previous  chapter  and  its  continuation,  the 
E SO/SRC  Southern  Sky  Atlas. 

The  Schmidt  camera  is  an  example  of  a  cata- 
dioptric  telescope ,  which  has  both  lenses  and 
mirrors.  Schmidt-Cassegrain  telescopes  used  by 
many  amateurs  are  modifications  of  the  Schmidt 
camera.  They  have  a  secondary  mirror  mounted 
at  the  centre  of  the  correcting  lens;  the  mirror  re¬ 
flects  the  image  through  a  hole  in  the  primary  mir¬ 
ror.  Thus  the  effective  focal  length  can  be  rather 
long,  although  the  telescope  itself  is  very  short. 
Another  common  catadioptric  telescope  is  the 
Maksutov  telescope.  Both  surfaces  of  the  correct¬ 
ing  lens  as  well  as  the  primary  mirror  of  a  Mak¬ 
sutov  telescope  are  concentric  spheres. 

Another  way  of  removing  the  coma  of  the 
classical  reflectors  is  to  use  more  complicated 
mirror  surfaces.  The  Ritchey-Chretien  system 
has  hyperboloidal  primary  and  secondary  mir¬ 
rors,  providing  a  fairly  wide  useful  field  of  view. 
Ritchey-Chretien  optics  are  used  in  many  large 
telescopes. 

Mountings  of  Telescopes  A  telescope  has  to  be 
mounted  on  a  steady  support  to  prevent  its  shak¬ 
ing,  and  it  must  be  smoothly  rotated  during  obser¬ 
vations.  There  are  two  principal  types  of  mount¬ 
ing,  equatorial  and  azimuthal  (Fig.  3.17). 

In  the  equatorial  mounting,  one  of  the  axes  is 
directed  towards  the  celestial  pole.  It  is  called  the 
polar  axis  or  hour  axis.  The  other  one,  the  decli¬ 
nation  axis ,  is  perpendicular  to  it.  Since  the  hour 
axis  is  parallel  to  the  axis  of  the  Earth,  the  appar¬ 
ent  rotation  of  the  sky  can  be  compensated  for  by 
turning  the  telescope  around  this  axis  at  a  con¬ 
stant  rate. 

The  declination  axis  is  the  main  technical 
problem  of  the  equatorial  mounting.  When  the 
telescope  is  pointing  to  the  south  its  weight 
causes  a  force  perpendicular  to  the  axis.  When 
the  telescope  is  tracking  an  object  and  turns  west- 
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Fig.  3.16  The  large 
Schmidt  telescope  of  the 
European  Southern 
Observatory.  The  diameter 
of  the  mirror  is  1.62  m  and 
of  the  free  aperture  1  m. 
(Photo  ESO) 


ward,  the  bearings  must  take  an  increasing  load 
parallel  with  the  declination  axis. 

In  the  azimuthal  mounting,  one  of  the  axes  is 
vertical,  the  other  one  horizontal.  This  mounting 
is  easier  to  construct  than  the  equatorial  mount¬ 
ing  and  is  more  stable  for  very  large  telescopes. 
In  order  to  follow  the  rotation  of  the  sky,  the  tele¬ 
scope  must  be  turned  around  both  of  the  axes 
with  changing  velocities.  The  field  of  view  will 
also  rotate;  this  rotation  must  be  compensated  for 
when  the  telescope  is  used  for  photography. 

If  an  object  goes  close  to  the  zenith,  its  az¬ 
imuth  will  change  180°  in  a  very  short  time. 
Therefore,  around  the  zenith  there  is  a  small  re¬ 
gion  where  observations  with  an  azimuthal  tele¬ 
scope  are  not  possible. 


The  largest  telescopes  in  the  world  were  equa- 
torially  mounted  until  the  development  of  com¬ 
puters  made  possible  the  more  complicated  guid¬ 
ance  needed  for  azimuthal  mountings.  Most  of 
the  recently  built  large  telescopes  are  already  az- 
imuthally  mounted.  Azimuthally  mounted  tele¬ 
scopes  have  two  additional  obvious  places  for 
foci,  the  Nasmyth  foci  at  both  ends  of  the  hori¬ 
zontal  axis. 

The  Dobson  mounting ,  used  in  many  amateur 
telescopes,  is  azimuthal.  The  magnification  of  the 
Newtonian  telescope  is  usually  small,  and  the 
telescope  rests  on  pieces  of  teflon,  which  make 
it  very  easy  to  move.  Thus  the  object  can  easily 
be  tracked  manually. 
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Fig.  3.1 7  The  equatorial  mounting  (left)  and  the  azimuthal  mounting  (right) 


Another  type  of  mounting  is  the  coelostat , 
where  rotating  mirrors  guide  the  light  into  a  sta¬ 
tionary  telescope.  This  system  is  used  especially 
in  solar  telescopes. 

To  measure  absolute  positions  of  stars  and  ac¬ 
curate  time,  telescopes  aligned  with  the  north- 
south  direction  are  used.  They  can  be  rotated 
around  one  axis  only,  the  east-west  horizontal 
axis.  Meridian  circles  or  transit  instruments  with 
this  kind  of  mounting  were  widely  constructed 
for  different  observatories  during  the  19th  cen¬ 
tury.  A  few  are  still  used  for  astrometry,  but  they 
are  now  highly  automatic.  The  most  important  of 
the  still  operational  instruments  is  that  of  the  U.S. 
Naval  Observatory  in  Flagstaff.  The  meridian  cir¬ 
cle  on  La  Palma  funded  by  the  Carlsberg  founda¬ 
tion  was  closed  in  2013. 

New  Techniques  Detectors  are  already  ap¬ 
proaching  the  theoretical  limit  of  efficiency, 
where  all  incident  photons  are  registered.  Ulti¬ 
mately,  to  detect  even  fainter  objects  the  only 
solution  is  to  increase  the  light  gathering  area 
(Fig.  3.18),  but  also  the  mirrors  are  getting  close 
to  the  practical  maximum  size.  Thus,  new  techni¬ 
cal  solutions  are  needed. 


One  new  feature  is  active  optics ,  used  in  most 
new  big  telescopes.  The  mirror  is  very  thin,  but  its 
shape  is  kept  exactly  correct  by  a  computer  con¬ 
trolled  support  mechanism.  The  weight  and  pro¬ 
duction  cost  of  such  a  mirror  are  much  smaller 
compared  with  a  conventional  thick  mirror.  Be¬ 
cause  of  the  smaller  weight  also  the  supporting 
structure  can  be  made  lighter. 

Developing  the  control  mechanism  further 
leads  to  adaptive  optics.  A  reference  star  (or  an 
artificial  beam)  is  monitored  constantly  in  order 
to  obtain  the  shape  of  the  seeing  disk.  The  attitude 
or  the  shape  of  a  small  auxiliary  mirror  is  adjusted 
up  to  hundreds  of  times  a  second  to  keep  the  im¬ 
age  as  concentrated  as  possible.  Adaptive  optics 
has  been  taken  into  use  in  the  largest  telescopes 
of  the  world  from  about  the  year  2000  on. 

The  mirrors  of  large  telescopes  need  not  be 
monolithic,  but  can  be  made  of  smaller  pieces 
that  are,  e.g.  hexagonal.  These  mosaic  mirrors 
are  very  light  and  can  be  used  to  build  up  mir¬ 
rors  with  diameters  of  several  tens  of  metres 
(Fig.  3.20).  Using  active  optics,  the  hexagons  can 
be  accurately  focused.  The  California  Associa¬ 
tion  for  Research  in  Astronomy  has  constructed 
the  William  M.  Keck  telescope  with  a  10  m  mo- 
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Fig.  3.18  The  largest 
telescopes  in  the  world  in 
1947-2000.  (a)  For  nearly 
30  years,  the  5.1  m  Hale 
telescope  on  Mount 
Palomar,  California,  USA, 
was  the  largest  telescope  in 
the  world,  (b)  The  BTA, 
Big  Azimuthal  Telescope, 
is  situated  in  the  Caucasus 
in  the  southern  Soviet 
Union.  Its  mirror  has 
a  diameter  of  6  m.  It  was 
set  in  operation  at  the  end 
of  1975.  (c)  The  William 
M.  Keck  Telescope  on  the 
summit  of  Mauna  Kea, 
Hawaii,  was  completed  in 
1992.  The  10  m  mirror 
consists  of  36  hexagonal 
segments.  Another  similar 
telescope  next  to  it  was 
completed  in  1996.  (Photos 
Palomar  Observatory, 
Spetsialnaya 
Astrohzitsheskaya 
Observatorya,  and  Roger 
Ressmeyer-Starlight  for 
the  California  Association 
for  Research  in 
Astronomy) 


saic  mirror.  It  is  located  on  Mauna  Kea,  and  the 
last  segment  was  installed  in  1992.  Also  the  E- 
ELT  telescope  (Fig.  3.21)  planned  by  ESO  will 
have  a  mosaic  mirror. 

The  reflecting  surface  does  not  have  to  be  con¬ 
tinuous,  but  can  even  consist  of  several  separate 
telescopes.  Next  to  the  Keck  another  similar  tele¬ 
scope,  the  Keck  II,  was  completed  in  1996.  The 
pair  can  be  used  as  an  interferometer  that  has  the 
same  resolving  power  as  a  single  75  metre  mir¬ 
ror. 

The  European  Southern  Observatory  has  con¬ 
structed  its  own  multi-mirror  telescope.  ESO’s 
Very  Large  Telescope  (VLT)  has  four  closely  lo¬ 
cated  telescopes  (Fig.  3.19).  The  diameter  of  each 
mirror  is  eight  metres,  and  the  total  area  corre¬ 


sponds  to  one  telescope  with  a  16  m  mirror.  The 
resolution  is  even  better,  since  the  “aperture”,  i.e. 
the  maximum  distance  between  the  mirrors,  is 
several  tens  of  meters. 

An  important  astronomical  instruments  of  the 
20th  century  is  the  Hubble  Space  Telescope , 
launched  in  1990  (Fig.  3.22).  It  has  a  mirror  with 
a  diameter  of  2.4  m.  The  resolution  of  the  tele¬ 
scope  (after  the  faulty  optics  was  corrected)  is 
near  the  theoretical  diffraction  limit,  since  there 
is  no  disturbing  atmosphere.  A  second  genera¬ 
tion  Space  Telescope,  now  called  the  James  Webb 
Space  Telescope ,  will  have  a  mosaic  mirror  of 
6.5  m.  The  launch  has  been  postponed  mainly  for 
financial  reasons.  The  current  estimates  for  the 
launch  are  around  2018. 
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Fig.  3.18  (Continued) 


Space  telescopes  avoid  the  atmospheric  per¬ 
turbations.  However,  comparable  results  are 
nowadays  obtained  using  the  adaptive  optics.  Due 
to  budgetary  reasons,  the  majority  of  astronomi¬ 
cal  observations  will  still  be  carried  out  on  the 
Earth,  and  great  attention  will  be  given  to  improv¬ 
ing  ground-based  observatories  and  detectors. 

In  the  future,  satellites  will  continue  to  be 
mainly  used  for  those  wavelength  regions  where 
the  radiation  is  absorbed  by  the  atmosphere. 
(Fig.  3.2  and  Sect.  3.5). 


3.3  Detectors  and  Instruments 

Only  a  limited  amount  of  information  can  be  ob¬ 
tained  by  looking  through  a  telescope  with  the 


unaided  eye.  Yet  until  the  end  of  the  19th  cen¬ 
tury  this  was  the  only  way  to  make  observations. 
The  invention  of  photography  in  the  middle  of 
the  19th  century  brought  a  revolution  in  astron¬ 
omy.  The  next  important  step  forward  in  optical 
astronomy  was  the  development  of  photoelectric 
photometry  in  the  1940’s  and  1950’s.  A  new  revo¬ 
lution,  comparable  to  that  caused  by  the  invention 
of  photography,  took  place  in  the  middle  of  the 
1970’s  with  the  introduction  of  different  semicon¬ 
ductor  detectors.  The  sensitivity  of  detectors  has 
grown  so  much  that  today,  a  60  cm  telescope  can 
be  used  for  observations  similar  to  those  made 
with  the  Palomar  5  m  telescope  when  it  was  set 
in  operation  in  the  1940’s. 
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Fig.  3.18  (Continued) 


The  Photographic  Plate  Photography  has  long 
been  one  of  the  most  common  methods  of  obser¬ 
vation  in  astronomy.  In  astronomical  photogra¬ 
phy  glass  plates  were  used,  rather  than  film,  since 
they  keep  their  shape  better,  but  nowadays  they 
are  no  more  manufactured,  and  CCD-cameras 
have  largely  replaced  photography.  The  sensitive 
layer  on  the  surface  of  the  film  or  plate  is  made  up 
of  a  silver  halide,  usually  silver  bromide,  AgBr. 
A  photon  absorbed  by  the  halide  excites  an  elec¬ 
tron  that  can  move  from  one  atom  to  another. 
A  silver  ion,  Ag+,  can  catch  the  electron,  becom¬ 
ing  a  neutral  atom.  When  the  necessary  amount 
of  silver  atoms  have  been  accumulated  at  one 
place,  they  form  a  latent  image.  The  latent  image 
can  be  made  into  a  permanent  negative  by  treat¬ 


ing  the  plate  after  exposure  with  various  chemi¬ 
cals,  which  transform  the  silver  bromide  crystals 
enclosing  the  latent  image  into  silver  (“develop¬ 
ment”),  and  remove  the  unexposed  crystals  (“fix- 
ing”). 

The  photographic  plate  has  many  advantages 
over  the  human  eye.  The  plate  can  register  up  to 
millions  of  stars  (picture  elements)  at  one  time, 
while  the  eye  can  observe  at  most  one  or  two 
objects  at  a  time.  The  image  on  a  plate  is  prac¬ 
tically  permanent — the  picture  can  be  studied  at 
any  time.  In  addition,  the  photographic  plate  is 
cheap  and  easy  to  use,  as  compared  to  many  other 
detectors.  The  most  important  feature  of  a  plate 
is  its  capability  to  collect  light  over  an  extended 
time:  the  longer  exposures  are  used,  the  more  sil- 
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Fig.  3.19  Some  new  large  telescopes,  (a)  The  European 
Southern  Observatory  (ESO)  was  founded  by  Belgium, 
France,  the  Netherlands,  Sweden  and  West  Germany  in 
1962.  Other  European  countries  have  joined  them  later. 
The  VLT  (Very  Large  Telescope)  on  Cerro  Paranal  in 
Northern  Chile,  was  inaugurated  in  1998-2000.  (b)  Cur¬ 
rently  the  largest  telescope  is  the  GTC  (Gran  Telesco- 

ver  atoms  are  formed  on  the  plate  (the  plate  dark¬ 
ens).  By  increasing  the  exposure  times,  fainter 
objects  can  be  photographed.  The  eye  has  no  such 


pio  Canarias  or  GranTeCan)  on  La  Palma.  The  observa¬ 
tions  with  the  10.4  m  mirror  were  made  in  2009.  (c)  The 
largest  binocular  is  the  LBT  (Large  Binocular  Telescope) 
on  Mount  Graham  in  Arizona,  completed  in  2008.  It  has 
two  8.4  metre  mirrors.  (Photos  ESO,  IAC,  Max  Planck 
Institute) 


capacity:  if  a  faint  object  does  not  show  through 
a  telescope,  it  cannot  been  seen,  no  matter  how 
long  one  stares. 


3.3  Detectors  and  Instruments 


67 


Fig.  3.19  (Continued) 


Fig.  3.20  The  mirror  of 
a  telescope  can  be  made  up 
of  several  smaller 
segments,  which  are  much 
easier  to  manufacture,  as  in 
the  Hobby-Eberle 
Telescope  on  Mount 
Fowlkes,  Texas.  The 
effective  diameter  of  the 
mirror  is  9.1  m.  A  similar 
telescope  is  being  built  in 
South  Africa.  (Photo 
MacDonald  Observatory) 


One  disadvantage  of  the  photographic  plate  is 
its  low  sensitivity.  Only  one  photon  in  a  thou¬ 
sand  causes  a  reaction  leading  to  the  formation 
of  a  silver  grain.  Thus  the  quantum  efficiency  of 
the  plate  is  only  0.1  %.  Several  chemical  treat¬ 
ments  can  be  used  to  sensitise  the  plate  before 
exposure.  This  brings  the  quantum  efficiency  up 
to  a  few  percent.  Another  disadvantage  is  the  fact 
that  a  silver  bromide  crystal  that  has  been  ex¬ 
posed  once  does  not  register  anything  more,  i.e. 


a  saturation  point  is  reached.  On  the  other  hand, 
a  certain  number  of  photons  are  needed  to  pro¬ 
duce  an  image.  Doubling  the  number  of  pho¬ 
tons  does  not  necessarily  double  the  density  (the 
‘blackness’  of  the  image):  the  density  of  the  plate 
depends  nonlinearly  on  the  amount  of  incoming 
light  (Fig.  3.23).  The  sensitivity  of  the  plate  is 
also  strongly  dependent  on  the  wavelength  of  the 
light.  For  the  reasons  mentioned  above  the  ac¬ 
curacy  with  which  brightness  can  be  measured 
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Fig.  3.21  In  order  to  observe  objects  essentially  fainter 
than  nowadays  the  size  of  the  objective  must  be  increased 
considerably.  The  European  Southern  Observatory  ESO 
started  to  make  plans  for  a  100  metre  OWL  telescope 
(Overwhelmingly  Large  telescope),  but  when  the  price 
turned  out  to  be  “astronomical”  the  size  has  been  reduced 

Fig.  3.22  The  Hubble 
Space  Telescope  after  the 
latest  service  flight  in  2009. 

The  telescope  got  new 
solar  panels  and  several 
other  upgrades.  (Photo 
NASA) 


several  times.  When  the  E-ELT  telescope  (European  Ex¬ 
tremely  Large  Telescope)  will  be  completed  the  main  mir¬ 
ror  will  have  a  diameter  of  39  metres.  The  mirror  will  con¬ 
sist  of  798  hexagonal  pieces,  each  having  a  diameter  of 
1.45  metres.  (ESO) 


on  a  photographic  plate  is  usually  worse  than 
about  5  %.  Thus  the  photographic  plate  makes 
a  poor  photometer,  but  it  can  be  excellently  used, 
e.g.  for  measuring  the  positions  of  stars  (posi¬ 
tional  astronomy)  and  for  mapping  the  sky.  Old 
photography  archives  are  still  valuable  for  study¬ 
ing  proper  motions  and  brightness  variations  that 
are  not  too  minuscule. 


Photocathodes,  Photomultipliers  A  photo¬ 
cathode  is  a  more  effective  detector  than  the  pho¬ 
tographic  plate.  It  is  based  on  the  photoelectric 
effect.  A  light  quantum,  or  photon,  hits  the  pho¬ 
tocathode  and  loosens  an  electron.  The  electron 
moves  to  the  positive  electrode,  or  anode,  and 
gives  rise  to  an  electric  current  that  can  be  mea¬ 
sured.  The  quantum  efficiency  of  a  photocathode 
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Fig.  3  .23  The  density  of  a  photographic  emulsion  de¬ 
pends  on  the  incident  light.  If  there  is  very  little  light  it  has 
no  effect.  Even  then  the  him  is  not  completely  transparent 
since  it  has  a  light  background  fog.  When  the  amount  of 
light  increases  the  density  will  begin  to  grow  first  nonlin- 
early.  When  the  exposure  is  optimal,  which  depends  of  the 
type  of  the  him,  the  dependence  of  the  density  on  exposure 
is  nearly  linear.  When  there  is  too  much  light,  all  grains  of 
the  him  become  dark  and  the  image  is  saturated 

is  about  10-20  times  better  than  that  of  a  pho¬ 
tographic  plate;  optimally,  an  efficiency  of  30  % 
can  be  reached.  A  photocathode  is  also  a  linear 
detector:  if  the  number  of  electrons  is  doubled, 
the  outcoming  current  is  also  doubled. 

The  photomultiplier  is  one  of  the  most  impor¬ 
tant  applications  of  the  photocathode.  In  this  de¬ 
vice,  the  electrons  leaving  the  photocathode  hit 
a  dynode.  For  each  electron  hitting  the  dynode, 
several  others  are  released.  When  there  are  sev¬ 
eral  dynodes  in  a  row,  the  original  weak  current 
can  be  intensified  a  millionfold.  The  photomulti¬ 
plier  measures  all  the  light  entering  it,  but  does 
not  form  an  image.  Photomultipliers  are  mostly 
used  in  photometry,  and  an  accuracy  of  0.1-1  % 
can  be  attained. 

Photometers,  Polarimeters  A  detector  mea¬ 
suring  brightness,  a  photometer ,  is  usually  lo¬ 
cated  behind  the  telescope  in  the  Cassegrain  fo¬ 
cus.  In  the  focal  plane  there  is  a  small  hole,  the  di¬ 
aphragm ,  which  lets  through  light  from  the  object 
under  observation.  In  this  way,  light  from  other 
stars  in  the  field  of  view  can  be  prevented  from 
entering  the  photometer.  A  field  lens  behind  the 
diaphragm  refracts  the  light  rays  onto  a  photo¬ 
cathode.  The  outcoming  current  is  intensified  fur- 


Dia-  Colli-  Semitransparent 

phragm  mator  mirrors 


Fig.  3.24  The  principle  of  a  photoelectric  multicolour 
photometer.  Light  collected  by  the  telescope  arrives  from 
the  left.  The  light  enters  the  photometer  through  a  small 
hole  in  the  focal  plane,  the  diaphragm.  A  lens  collimates 
the  light  into  a  parallel  beam.  Semitransparent  mirrors  di¬ 
vide  the  beam  to  several  photomultipliers.  A  held  lens 
guides  the  light  through  a  filter  onto  the  photocathode  of 
the  photomultiplier.  The  quanta  of  light,  photons,  release 
electrons  from  the  cathodes.  The  electrons  are  accelerated 
towards  the  dynodes  with  a  voltage  of  about  1500  V.  The 
electrons  hitting  the  dynodes  release  still  more  electrons, 
and  the  current  is  greatly  enhanced.  Every  electron  emit¬ 
ted  from  the  cathode  gives  rise  to  a  pulse  of  up  to  108  elec¬ 
trons  at  the  anode;  the  pulse  is  amplified  and  registered  by 
a  pulse  counter.  In  this  way,  the  photons  from  the  star  are 
counted 


ther  in  a  preamplifier.  The  photomultiplier  needs 
a  voltage  of  1000-1500  volts. 

Observations  are  often  made  in  a  certain  wave¬ 
length  interval,  instead  of  measuring  all  the  ra¬ 
diation  entering  the  detector.  In  this  case  a  filter 
is  used  to  prevent  other  wavelengths  from  reach¬ 
ing  the  photomultiplier.  A  photometer  can  also 
consist  of  several  photomultipliers  (Fig.  3.23), 
which  measure  simultaneously  different  wave¬ 
length  bands.  In  such  an  instrument  beam  split¬ 
ters  or  semitransparent  mirrors  split  the  light 
beam  through  fixed  filters  to  the  photomultipliers. 

In  a  device  called  the  photopolarimeter ,  a  po¬ 
larising  filter  is  used,  either  alone  or  in  combina¬ 
tion  with  other  filters.  The  degree  and  direction  of 
polarisation  can  be  found  by  measuring  the  inten¬ 
sity  of  the  radiation  with  different  orientations  of 
the  polarisers. 

In  practice,  the  diaphragm  of  a  photometer 
will  always  also  let  through  part  of  the  back¬ 
ground  sky  around  the  observed  object.  The  mea¬ 
sured  brightness  is  in  reality  the  combined  bright¬ 
ness  of  the  object  and  the  sky.  In  order  to  find  the 
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brightness  of  the  object,  the  background  bright¬ 
ness  must  be  measured  separately  and  subtracted 
from  the  combined  brightness.  The  accuracy  of 
the  measurements  is  decreased  if  long  observa¬ 
tion  times  are  used  and  the  background  bright¬ 
ness  undergoes  fast  changes.  The  problem  can  be 
solved  by  observing  the  brightness  of  the  back¬ 
ground  sky  and  the  object  simultaneously. 

Photometric  observations  are  often  relative.  If 
one  is  observing,  e.g.  a  variable  star,  a  reference 
star  close  to  the  actual  target  is  observed  at  regu¬ 
lar  intervals.  Using  the  observations  of  this  ref¬ 
erence  star  it  is  possible  to  derive  a  model  for 
the  slow  changes  in  the  atmospheric  extinction 
(see  Chap.  4)  and  remove  their  effect.  The  instru¬ 
ment  can  be  calibrated  by  observing  some  stan¬ 
dard  stars ,  whose  brightness  is  known  very  accu¬ 
rately. 

Image  Intensifiers  Different  image  intensifiers 
based  on  the  photocathode  have  been  used  since 
the  1960’s.  In  the  intensifier  the  information 
about  the  starting  point  of  the  electron  on  the  pho¬ 
tocathode  is  preserved  and  the  intensified  image 
is  formed  on  a  fluorescent  screen.  The  image  can 
then  be  registered,  e.g.  with  a  CCD  camera.  One 
of  the  advantages  of  the  image  intensifier  is  that 
even  faint  objects  can  be  imaged  using  relatively 
short  exposures,  and  observations  can  be  made  at 
wavelengths  where  the  detector  is  insensitive. 

Another  common  type  of  detector  is  based  on 
the  TV  camera  ( Vidicon  camera).  The  electrons 
released  from  the  photocathode  are  accelerated 
with  a  voltage  of  a  few  kilovolts  before  they  hit 
the  electrode  where  they  form  an  image  in  the 
form  of  an  electric  charge  distribution.  After  ex¬ 
posure,  the  charge  at  different  points  of  the  elec¬ 
trode  is  read  by  scanning  its  surface  with  an  elec¬ 
tron  beam  row  by  row.  This  produces  a  video  sig¬ 
nal,  which  can  be  transformed  into  a  visible  im¬ 
age  on  a  TV  tube.  The  information  can  also  be 
saved  in  digital  form.  In  the  most  advanced  sys¬ 
tems,  the  scintillations  caused  by  single  electrons 
on  the  fluorescent  screen  of  the  image  intensi¬ 
fier  can  be  registered  and  stored  in  the  memory 
of  a  computer.  For  each  point  in  the  image  there 
is  a  memory  location,  called  a  picture  element  or 
pixel. 


Since  the  middle  of  the  1970’s,  detectors  using 
semiconductor  techniques  began  to  be  used  in  in¬ 
creasing  numbers.  With  semiconductor  detectors 
a  quantum  efficiency  of  about  70-80  %  can  be  at¬ 
tained;  thus,  sensitivity  cannot  be  improved  much 
more.  The  wavelength  regions  suitable  for  these 
new  detectors  are  much  wider  than  in  the  case  of 
the  photographic  plate.  The  detectors  are  also  lin¬ 
ear:  if  the  number  of  photons  is  doubled,  also  the 
signal  will  be  doubled.  Computers  are  used  for 
collecting,  saving  and  analysing  the  output  data 
available  in  digital  form. 

CCD  Camera  Currently  the  most  important 
detector  is  the  CCD  camera  (Charge  Coupled  De¬ 
vice).  The  detector  consists  of  a  surface  made 
up  of  light  sensitive  silicon  diodes,  arranged  in 
a  rectangular  array  of  image  elements  or  pix¬ 
els.  The  largest  cameras  can  have  as  many  as 
4096  x  4096  pixels,  although  most  are  consid¬ 
erably  smaller.  Recently,  CCD’s  have  become 
widely  used  in  ordinary  digital  cameras. 

A  photon  hitting  the  detector  can  release 
an  electron,  which  will  remain  trapped  inside 
a  pixel.  After  the  exposure  varying  potential 
differences  are  used  to  move  the  accumulated 
charges  row  by  row  to  a  readout  buffer.  In  the 
buffer  the  charges  are  moved  pixel  by  pixel  to 
an  analogy/digital  converter,  which  transmits  the 
digital  value  to  a  computer.  Reading  an  image 
also  clears  the  detector  (Fig.  3.24).  If  the  expo¬ 
sures  are  very  short  the  readout  times  may  take 
a  substantial  part  of  the  observing  time. 

The  CCD  camera  is  nearly  linear:  the  number 
of  electrons  is  directly  proportional  to  the  num¬ 
ber  of  photons.  Calibration  of  the  data  is  there¬ 
fore  much  easier  than  with  photographic  plates 
(Fig.  3.25). 

The  quantum  efficiency,  i.e.  the  number  of 
electrons  per  incident  photon,  is  high,  and  the 
CCD  camera  is  much  more  sensitive  than  a  pho¬ 
tographic  plate.  The  sensitivity  is  highest  in  the 
red  wavelength  range,  about  600-800  nm,  where 
the  quantum  efficiency  can  be  80-90  %  or  even 
higher. 

The  range  of  the  camera  extends  far  to  the  in¬ 
frared.  In  the  ultraviolet  the  sensitivity  drops  due 
to  the  absorption  of  the  silicon  very  rapidly  be¬ 
low  about  500  nm.  Two  methods  have  been  used 
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Fig.  3.25  The  principle  of  reading  a  CCD  camera, 
(a)  During  an  exposure  electrons  are  trapped  in  potential 
wells  corresponding  to  pixels  of  the  camera.  The  number 
at  each  pixel  shows  the  number  of  electrons,  (b)  After 
the  exposure  each  horizontal  line  is  moved  one  pixel  to 
the  right;  the  rightmost  row  moves  to  the  readout  buffer, 
(c)  The  contents  of  the  buffer  is  moved  down  by  one  pixel. 

to  avoid  this  problem.  One  is  to  use  a  coating  that 
absorbs  the  ultraviolet  photons  and  emits  light  of 
longer  wavelength.  Another  possibility  is  to  turn 
the  chip  upside  down  and  make  it  very  thin  to  re¬ 
duce  the  absorption. 

The  thermal  noise  of  the  camera  generates 
dark  current  even  if  the  camera  is  in  total  dark¬ 
ness.  To  reduce  the  noise  the  camera  must  be 
cooled.  Astronomical  CCD  cameras  are  usually 
cooled  with  liquid  nitrogen,  which  efficiently  re¬ 
moves  most  of  the  dark  current.  However,  the 
sensitivity  is  also  reduced  when  the  camera  is 
cooled;  so  too  cold  is  not  good  either.  The  tem¬ 
perature  must  be  kept  constant  in  order  to  obtain 
consistent  data.  For  amateurs  there  are  already 
moderately  priced  CCD  cameras,  which  are  elec¬ 
trically  cooled.  Many  of  them  are  good  enough 
also  for  scientific  work,  if  very  high  sensitivity  is 
not  required. 

The  dark  current  can  easily  be  measured  by 
taking  exposures  with  the  shutter  closed.  Sub¬ 
tracting  this  from  the  observed  image  gives  the 
real  number  of  electrons  due  to  incident  light 
(Fig.  3.26). 


The  lowermost  charge  moves  to  the  A/D  converter,  which 
sends  the  number  of  electrons  to  the  computer,  (d)  Af¬ 
ter  moving  the  buffer  down  several  times  one  vertical  row 
has  been  read,  (e)  The  image  is  again  shifted  right  by  one 
pixel.  This  procedure  is  repeated  till  the  whole  image  is 
read 


Fig.  3.26  The  signal  measured  by  a  CCD-camera  de¬ 
pends  almost  linearly  on  the  number  of  photons.  During 
the  exposure  also  the  dark  current  will  increase  and  has  to 
be  subtracted  from  the  observed  values.  Bias  is  the  dark 
current  registered  even  if  the  exposure  time  were  zero 

The  sensitivity  of  individual  pixels  may  be 
slightly  different.  This  can  be  corrected  for  by 
taking  an  image  of  an  evenly  illuminated  field, 
like  a  twilight  sky,  where  stars  are  not  visible. 
This  image  is  called  a  flat-field.  When  observa¬ 
tions  are  divided  by  the  flat-field,  the  error  caused 
by  different  pixels  is  removed  (Fig.  3.27). 
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Fig.  3.27  Typical  images  related  to  CCD  observations.  Top  left  dark  current,  top  right  flat  field,  bottom  left  original 
raw  image  and  bottom  right  reduced  image 


The  CCD  camera  is  very  stable.  Therefore  it  is 
not  necessary  to  repeat  the  dark  current  and  flat- 
field  observations  very  frequently.  Typically  these 
calibration  exposures  are  taken  during  evening 
and  morning  twilights,  just  before  and  after  ac¬ 
tual  observations. 

Cosmic  rays  are  charged  particles  that  can  pro¬ 
duce  extraneous  bright  dots  in  CCD  images.  They 
are  usually  limited  to  one  or  two  pixels,  and  are 
easily  identified.  Typically  a  short  exposure  of 
a  few  minutes  contains  a  few  traces  of  cosmic 


rays.  Instead  of  a  single  long  exposure  it  is  usu¬ 
ally  better  to  take  several  short  ones,  clean  the  im¬ 
ages  from  cosmic  rays,  and  finally  add  the  images 
on  a  computer. 

A  more  serious  problem  is  the  readout  noise 
of  the  electronics.  In  the  first  cameras  it  could 
be  hundreds  of  electrons  per  pixel.  In  modern 
cameras  it  is  a  few  electrons.  This  gives  a  limit 
to  the  faintest  detectable  signal:  if  the  signal  is 
weaker  than  the  readout  noise,  it  is  indistinguish¬ 
able  from  the  noise.  Nowadays  there  are  so  called 
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L3 -cameras  (low  light  level  CCD),  where  elec¬ 
trons  are  multiplied  during  reading  and  the  read¬ 
out  noise  is  the  only  source  of  noise. 

Although  the  CCD  camera  is  a  very  sensi¬ 
tive  detector,  even  bright  light  cannot  damage  it. 
A  photomultiplier,  on  the  other  hand,  can  be  eas¬ 
ily  destroyed  by  letting  in  too  much  light.  How¬ 
ever,  one  pixel  can  only  store  a  certain  number  of 
electrons,  after  which  it  becomes  saturated.  Ex¬ 
cessive  saturation  can  make  the  charge  to  over¬ 
flow  also  to  the  neighbouring  pixels.  If  the  camera 
becomes  badly  saturated  it  may  have  to  be  read 
several  times  to  completely  remove  the  charges. 

The  largest  CCD  cameras  are  quite  expensive, 
and  even  they  are  still  rather  small  compared 
with  the  largest  photographic  plates  and  films. 
Because  they  are  very  sensitive  and  processing 
of  the  data  is  easy,  they  have,  however,  replaced 
photographic  materials  almost  completely. 

Spectrographs  The  simplest  spectrograph  is 
a  prism  that  is  placed  in  front  of  a  telescope. 
This  kind  of  device  is  called  the  objective  prism 
spectrograph.  The  prism  spreads  out  the  different 
wavelengths  of  light  into  a  spectrum  which  can 
be  registered.  During  the  exposure,  the  telescope 
is  usually  slightly  moved  perpendicularly  to  the 
spectrum,  in  order  to  increase  the  width  of  the 
spectrum.  With  an  objective  prism  spectrograph, 
large  numbers  of  spectra  can  be  photographed, 
e.g.  for  spectral  classification  (Chap.  9). 

For  more  accurate  information  the  slit  spec¬ 
trograph  must  be  used  (Fig.  3.28).  It  has  a  nar¬ 
row  slit  in  the  focal  plane  of  the  telescope.  The 
light  is  guided  through  the  slit  to  a  collimator 
that  reflects  or  refracts  ah  the  light  rays  into 
a  parallel  beam.  After  this,  the  light  is  dispersed 
into  a  spectrum  by  a  prism  and  focused  with 
a  camera  onto  a  detector,  which  nowadays  is  usu¬ 
ally  a  CCD  camera.  A  comparison  spectrum  is 
exposed  next  to  the  stellar  spectrum  to  deter¬ 
mine  the  precise  wavelengths.  In  modern  spec¬ 
trographs  using  CCD  cameras,  the  comparison 
spectrum  is  usually  exposed  as  a  separate  im¬ 
age.  A  big  slit  spectrograph  is  often  placed  at  the 
coude  or  Nasmyth  focus  of  the  telescope. 

The  most  important  property  of  a  spectrograph 
is  the  scale  or  dispersion  of  the  spectrum.  The  dis- 
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Fig.  3.28  The  principle  of  the  slit  spectrograph.  Light 
rays  entering  through  a  slit  are  collimated  (made  paral¬ 
lel  to  each  other),  dispersed  into  a  spectrum  by  a  prism 
and  projected  onto  a  photographic  plate  or  a  CCD.  There 
can  also  be  several  prisms.  However,  grids  are  used  more 
commonly  since  they  give  a  better  resolution 

persion  tells  how  long  a  wavelength  range  cor¬ 
responds  to  one  distance  unit  of  the  detector. 
For  the  objective  prism  the  dispersion  is  typi¬ 
cally  a  few  tens  of  nanometres  per  millimetre.  Slit 
spectrographs  may  have  a  dispersion  as  high  as 
1-0.01  nm/mm,  which  makes  it  possible  to  study 
even  the  shapes  of  individual  spectral  lines.  Often 
the  dispersion  is  given  as  a  dimensionless  quan¬ 
tity.  For  instance,  the  dispersion  1  nm/mm  means 
that  the  wavelength  scale  has  increased  million- 
fold;  thus  it  can  be  expressed  as  106. 

Instead  of  the  prism  a  diffraction  grating  can 
be  used  to  form  the  spectrum.  A  grating  has  nar¬ 
row  grooves,  side  by  side,  typically  several  hun¬ 
dred  per  millimetre.  When  light  is  reflected  by 
the  walls  of  the  grooves,  the  adjoining  rays  in¬ 
terfere  with  each  other  and  give  rise  to  spectra  of 
different  orders.  There  are  two  kinds  of  gratings: 
reflection  and  transmission  gratings.  In  a  reflec¬ 
tion  grating  no  light  is  absorbed  by  the  glass  as  in 
the  prism  or  transmission  grating.  A  grating  usu¬ 
ally  has  higher  dispersion,  or  ability  to  spread  the 
spectrum,  than  a  prism.  The  dispersion  can  be  in¬ 
creased  by  increasing  the  density  of  the  grooves 
of  the  grating.  In  slit  spectrographs  the  reflection 
grating  is  most  commonly  used.  In  some  instru¬ 
ments  a  grism  is  used;  it  is  a  prism  with  a  trans¬ 
mission  grating  on  one  of  its  surfaces. 

Interferometers  The  resolution  of  a  big  tele¬ 
scope  is  in  practice  limited  by  seeing,  and  thus 
increasing  the  aperture  does  not  necessarily  im¬ 
prove  the  resolution.  To  get  nearer  to  the  theoret¬ 
ical  resolution  limit  set  by  diffraction  (Eq.  (3.3), 
Fig.  3.6),  different  interferometers  can  be  used. 
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There  are  two  types  of  optical  interferome¬ 
ters.  One  kind  uses  an  existing  large  telescope; 
the  other  a  system  of  two  or  more  separate  tele¬ 
scopes.  In  both  cases  the  light  rays  are  allowed 
to  interfere.  By  analysing  the  outcoming  interfer¬ 
ence  pattern,  the  structures  of  close  binaries  can 
be  studied,  apparent  angular  diameters  of  the  stars 
can  be  measured,  etc. 

One  of  the  earliest  interferometers  was  the 
Michelson  interferometer  that  was  built  shortly 
before  1920  for  the  largest  telescope  of  that  time. 
In  front  of  the  telescope,  at  the  ends  of  a  six  me¬ 
tre  long  beam,  there  were  flat  mirrors  reflecting 
the  light  into  the  telescope.  The  form  of  the  inter¬ 
ference  pattern  changed  when  the  separation  of 
the  mirrors  was  changed.  In  practice,  the  inter¬ 
ference  pattern  was  disturbed  by  seeing,  and  only 
a  few  positive  results  were  obtained  with  this  in¬ 
strument. 

The  diameters  of  over  30  of  the  brightest  stars 
have  been  measured  using  intensity  interferome¬ 
ters.  Such  a  device  consists  of  two  separate  tele¬ 
scopes  that  can  be  moved  in  relation  to  each  other. 
This  method  is  suitable  for  the  brightest  objects 
only. 

In  1970  the  Frenchman  Antoine  Lab  eyrie  in¬ 
troduced  the  principle  of  speckle  interferometry. 
In  traditional  imaging  the  pictures  from  long  ex¬ 
posures  consist  of  a  large  number  of  instanta¬ 
neous  images,  “speckles”,  that  together  form  the 
seeing  disk.  In  speckle  interferometry  very  short 
exposures  and  large  magnifications  are  used  and 
hundreds  of  pictures  are  taken.  When  these  pic¬ 
tures  are  combined  and  analysed  (usually  in  dig¬ 
ital  form),  the  actual  resolution  of  the  telescope 
can  nearly  be  reached. 

The  accuracy  of  interferometric  techniques 
was  improved  at  the  beginning  of  00’s.  The  first 
experiments  to  use  the  two  10  m  Keck  telescopes 
as  one  interferometer,  were  made  in  2001.  Simi¬ 
larly,  the  ESO  VLT  can  be  used  as  an  interferom¬ 
eter. 


3.4  Radio  Telescopes 

Radio  astronomy,  started  in  the  1930’s,  extended 
the  observable  range  of  the  electromagnetic  spec¬ 
trum  by  several  orders  of  magnitude.  Radio  fre¬ 


quencies  range  from  a  few  megahertz  (100  m) 
up  to  about  300  GHz  (1  mm).  The  low-frequency 
limit  of  the  radio  band  is  determined  by  the  opac¬ 
ity  of  the  ionosphere,  while  the  high-frequency 
limit  is  due  to  the  strong  absorption  from  oxy¬ 
gen  and  water  bands  in  the  lower  atmosphere. 
Neither  of  these  limits  is  very  strict,  and  un¬ 
der  favourable  conditions  radio  astronomers  can 
work  into  the  submillimetre  region  or  through 
ionospheric  holes  during  sunspot  minima. 

At  the  beginning  of  the  20th  century  attempts 
were  made  to  observe  radio  emission  from  the 
Sun.  These  experiments,  however,  failed  because 
of  the  low  sensitivity  of  the  antenna-receiver  sys¬ 
tems,  and  because  of  the  opaqueness  of  the  iono¬ 
sphere  at  the  low  frequencies  at  which  most  of 
the  experiments  were  carried  out.  The  first  ob¬ 
servations  of  cosmic  radio  emission  were  later 
made  by  the  American  engineer  Karl  G.  Jansky 
in  1932,  while  studying  thunderstorm  radio  dis¬ 
turbances  at  a  frequency  of  20.5  MHz  (14.6  m). 
He  discovered  radio  emission  of  unknown  ori¬ 
gin,  which  varied  within  a  24  hour  period.  Some¬ 
what  later  he  identified  the  source  of  this  radi¬ 
ation  to  be  in  the  direction  of  the  centre  of  our 
Galaxy. 

The  real  birth  of  radio  astronomy  may  perhaps 
be  dated  to  the  late  1930’s,  when  Grote  Reber 
started  systematic  observations  with  his  home¬ 
made  9.5  m  paraboloid  antenna.  Thereafter  ra¬ 
dio  astronomy  developed  quite  rapidly  and  has 
greatly  improved  our  knowledge  of  the  Universe. 

Observations  are  made  both  in  the  contin¬ 
uum  (broad  band)  and  in  spectral  lines  (radio 
spectroscopy).  Much  of  our  knowledge  about  the 
structure  of  our  Milky  Way  comes  from  radio  ob¬ 
servations  of  the  21  cm  line  of  neutral  hydrogen 
and,  more  recently,  from  the  2.6  mm  line  of  the 
carbon  monoxide  molecule.  Radio  astronomy  has 
resulted  in  many  important  discoveries;  e.g.  both 
pulsars  and  quasars  were  first  found  by  radio  as¬ 
tronomical  observations.  The  importance  of  the 
field  can  also  be  seen  from  the  fact  that  the  Nobel 
prize  in  physics  has  recently  been  awarded  three 
times  to  radio  astronomers. 

A  radio  telescope  collects  radiation  in  an  aper¬ 
ture  or  antenna,  from  which  it  is  transformed  to 
an  electric  signal  by  a  receiver,  called  a  radiome- 
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ter.  This  signal  is  then  amplified,  detected  and 
integrated,  and  the  output  is  registered  on  some 
recording  device,  nowadays  usually  by  a  com¬ 
puter.  Because  the  received  signal  is  very  weak, 
one  has  to  use  sensitive  receivers.  These  are  often 
cooled  to  minimise  the  noise,  which  could  other¬ 
wise  mask  the  signal  from  the  source.  Because  ra¬ 
dio  waves  are  electromagnetic  radiation,  they  are 
reflected  and  refracted  like  ordinary  light  waves. 
In  radio  astronomy,  however,  only  reflecting  tele¬ 
scopes  are  used. 

At  low  frequencies  the  antennas  are  usually 
dipoles  (similar  to  those  used  for  radio  or  TV), 
but  in  order  to  increase  the  collecting  area  and 
improve  the  resolution,  one  uses  dipole  arrays, 
where  all  dipole  elements  are  connected  to  each 
other. 

The  most  common  antenna  type,  however,  is 
a  parabolic  reflector,  which  works  exactly  as  an 
optical  mirror  telescope.  At  long  wavelengths  the 
reflecting  surface  does  not  need  to  be  solid,  be¬ 
cause  the  long  wavelength  photons  cannot  see  the 
holes  in  the  reflector,  and  the  antenna  is  there¬ 
fore  usually  made  in  the  form  of  a  metal  mesh. 
At  high  frequencies  the  surface  has  to  be  smooth, 
and  in  the  millimetre- submillimetre  range,  radio 
astronomers  even  use  large  optical  telescopes, 
which  they  equip  with  their  own  radiometers. 
To  ensure  a  coherent  amplification  of  the  signal, 
the  surface  irregularities  should  be  less  than  one- 
tenth  of  the  wavelength  used. 

The  main  difference  between  a  radio  telescope 
and  an  optical  telescope  is  in  the  recording  of 
the  signal.  Radio  telescopes  are  not  imaging  tele¬ 
scopes  (except  for  synthesis  telescopes,  which 
will  be  described  later);  instead,  a  feed  horn, 
which  is  located  at  the  antenna  focus,  transfers 
the  signal  to  a  receiver.  The  wavelength  and  phase 
information  is,  however,  preserved. 

The  resolving  power  of  a  radio  telescope,  0, 
can  be  deduced  from  the  same  formula  (3.4)  as 
for  optical  telescopes,  i.e.  k/D ,  where  A  is  the 
wavelength  used  and  D  is  the  diameter  of  the 
aperture.  Since  the  wavelength  ratio  between  ra¬ 
dio  and  visible  light  is  of  the  order  of  10,000,  ra¬ 
dio  antennas  with  diameters  of  several  kilometres 
are  needed  in  order  to  achieve  the  same  resolution 
as  for  optical  telescopes.  In  the  early  days  of  radio 


astronomy  poor  resolution  was  the  biggest  draw¬ 
back  for  the  development  and  recognition  of  radio 
astronomy.  For  example,  the  antenna  used  by  Jan¬ 
sky  had  a  fan  beam  with  a  resolution  of  about  30° 
in  the  narrower  direction.  Therefore  radio  obser¬ 
vations  could  not  be  compared  with  optical  ob¬ 
servations.  Neither  was  it  possible  to  identify  the 
radio  sources  with  optical  counterparts. 

The  world’s  biggest  radio  telescope  is  the 
Arecibo  antenna  in  Puerto  Rico,  whose  main  re¬ 
flector  has  a  diameter  of  305  m.  It  is  a  fixed  metal 
mesh  built  over  a  natural  round  valley  (Fig.  3.29). 
In  the  late  1970’s  the  antenna  surface  and  re¬ 
ceivers  were  upgraded,  enabling  the  antenna  to  be 
used  down  to  wavelengths  of  5  cm.  The  mirror  of 
the  Arecibo  telescope  is  not  parabolic  but  spher¬ 
ical,  and  the  antenna  is  equipped  with  a  movable 
feed  system,  which  makes  observations  possible 
within  a  20°  radius  around  the  zenith. 

A  similar  and  only  partly  steerable  radio  tele¬ 
scope  is  being  built  in  China.  It  will  be  the  largest 
in  the  world  with  an  antenna  diameter  of  500  m. 

The  biggest  completely  steerable  radio  tele¬ 
scope  is  the  Green  Bank  telescope  in  Virginia, 
U.S.A.,  dedicated  at  the  end  of  2000.  It  is  slightly 
asymmetric  with  a  diameter  of  100  x  110  m 
(Fig.  3.30).  Before  the  Green  Bank  telescope,  for 
over  two  decades  the  largest  telescope  was  the 
Effelsberg  telescope  in  Germany.  This  antenna 
has  a  parabolic  main  reflector  with  a  diameter  of 
100  m.  The  inner  80  m  of  the  dish  is  made  of 
solid  aluminium  panels,  while  the  outmost  por¬ 
tion  of  the  disk  is  a  metal  mesh  structure.  By  us¬ 
ing  only  the  inner  portion  of  the  telescope,  it  has 
been  possible  to  observe  down  to  wavelengths  of 
4  mm.  The  oldest  and  perhaps  best-known  big  ra¬ 
dio  telescope  is  the  76  m  antenna  at  Jodrell  Bank 
in  Britain,  which  was  completed  in  the  end  of  the 
1950’s. 

The  biggest  telescopes  are  usually  incapable 
of  operating  below  wavelengths  of  1  cm,  because 
the  surface  cannot  be  made  accurate  enough. 
However,  the  millimetre  range  has  become  more 
and  more  important.  In  this  wavelength  range 
there  are  many  transitions  of  interstellar  mole¬ 
cules,  and  one  can  achieve  quite  high  angular 
resolution  even  with  a  single  dish  telescope.  At 
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Fig.  3.29  The  largest  radio  telescope  in  the  world  is  the  Arecibo  dish  in  Puerto  Rico.  It  has  been  constructed  over 
a  natural  bowl  and  is  300  m  in  diameter.  (Photo  Arecibo  Observatory) 


Fig.  3.30  The  largest  fully  steerable  radio  telescope  is  in  Green  Bank,  Virginia.  Its  diameter  is  100  x  110  m.  (Photo 
NRAO) 


present,  the  typical  size  of  a  mirror  of  a  mil-  big  millimetre  telescopes  are  in  operation  (Ta- 
limetre  telescope  is  about  15  m.  The  develop-  ble  C.24).  Among  them  are  the  40  m  Nobey ama 
ment  of  this  field  is  rapid,  and  at  present  several  telescope  in  Japan,  which  can  be  used  down  to 
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Fig.  3.31  The  Atacama  Large  Millimetre  Array  (ALMA)  built  in  cooperation  by  Europe,  U.S.A.  and  Japan.  The 
telescope  has  64  antennas.  (Photo  ESO/NOAJ) 


3  mm,  the  30  m  IRAM  telescope  at  Pico  Veleta 
in  Spain,  which  is  usable  down  to  1  mm,  and  the 
15  m.  The  largest  project  in  the  first  decade  of 
the  21st  century  is  ALMA  (Atacama  Large  Mil¬ 
limetre  Array),  which  consists  of  64  telescopes 
with  a  diameter  of  12  m  (Fig.  3.31).  It  was  built 
as  an  international  project  by  the  United  States, 
Europe  and  Japan  in  Chile  on  the  Chajnantor 
plateau. 

As  already  mentioned,  the  resolving  power  of 
a  radio  telescope  is  far  poorer  than  that  of  an  op¬ 
tical  telescope.  The  biggest  radio  telescopes  can 
at  present  reach  a  resolution  of  5  arc  seconds, 
and  that  only  at  the  very  highest  frequencies.  To 
improve  the  resolution  by  increasing  the  size  is 
difficult,  because  the  present  telescopes  are  al¬ 
ready  close  to  the  practical  upper  limit.  However, 
by  combining  radio  telescopes  and  interferome¬ 
ters,  it  is  possible  to  achieve  even  better  resolu¬ 
tion  than  with  optical  telescopes. 

As  early  as  1891  Michelson  used  an  interfer¬ 
ometer  for  astronomical  purposes.  While  the  use 
of  interferometers  has  proved  to  be  quite  difficult 
in  the  optical  wavelength  regime,  interferometers 


Fig.  3  .32  The  principle  of  an  interferometer.  If  the  ra¬ 
diation  reaches  the  radio  telescopes  in  the  same  phase, 
the  waves  amplify  each  other  and  a  maximum  is  obtained 
in  the  combined  radiation  (cases  1  and  3).  If  the  incom¬ 
ing  waves  are  in  opposite  phase,  they  cancel  each  other 
(case  2) 

are  extremely  useful  in  the  radio  region.  To  form 
an  interferometer,  one  needs  at  least  two  anten¬ 
nas  coupled  together.  The  spacing  between  the 
antennas,  D ,  is  called  the  baseline.  Let  us  first 
assume  that  the  baseline  is  perpendicular  to  the 
line  of  sight  (Fig.  3.32).  Then  the  radiation  ar¬ 
rives  at  both  antennas  with  the  same  phase,  and 
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the  summed  signal  shows  a  maximum.  However, 
due  to  the  rotation  of  the  Earth,  the  direction  of 
the  baseline  changes,  producing  a  phase  differ¬ 
ence  between  the  two  signals.  The  result  is  a  sinu¬ 
soidal  interference  pattern,  in  which  minima  oc¬ 
cur  when  the  phase  difference  is  180  degrees.  The 
distance  between  the  peaks  is  given  by 

0D  =  A, 

where  0  is  the  angle  the  baseline  has  turned  and 
A  is  the  wavelength  of  the  received  signal.  The 
resolution  of  the  interferometer  is  thus  equal  to 
that  of  an  antenna  with  a  linear  size  equal  to  D. 

If  the  source  is  not  a  point  source,  the  radi¬ 
ation  emitted  from  different  parts  of  the  source 
will  have  phase  differences  when  it  enters  the  an¬ 
tennas.  In  this  case  the  minima  of  the  interference 
pattern  will  not  be  zero,  but  will  have  some  posi¬ 
tive  value  Emin-  If  we  denote  the  maximum  value 
of  the  interference  pattern  by  Pmax,  the  ratio 

P max  P min 

P max  T  P min 

gives  a  measure  of  the  source  size  (fringe  visibil¬ 
ity). 

More  detailed  information  about  the  source 
structure  can  be  obtained  by  changing  the  base¬ 
line  by  changing  the  relative  positions  of  the 
antennas.  In  principle  it  is  possible  to  observe 
the  same  amount  of  details  as  with  a  single  an¬ 
tenna  with  the  same  size  as  the  area  covered 
by  the  movable  antennas  in  their  different  posi¬ 
tions.  This  way  interferometry  is  transformed  into 
a  technique  called  aperture  synthesis. 

The  theory  and  techniques  of  aperture  synthe¬ 
sis  were  developed  particularly  by  the  British  as¬ 
tronomer  Sir  Martin  Ryle.  In  Fig.  3.33  the  prin¬ 
ciple  of  aperture  synthesis  is  illustrated.  If  the 
telescopes  are  located  on  an  east-west  track,  the 
spacing  between  them,  projected  onto  the  sky, 
will  in  12  hours  describe  a  circle  or  an  ellipse,  de¬ 
pending  on  the  position  of  the  source  as  the  Earth 
rotates  around  its  axis.  If  one  varies  the  distance 
between  the  telescopes,  one  will  get  a  series  of 
circles  or  ellipses  on  the  sky.  As  we  can  see  from 
Fig.  3.29,  one  does  not  have  to  cover  all  the  spac- 
ings  between  the  telescopes,  because  any  antenna 


Fig.  3  .33  To  illustrate  the  principle  of  aperture  synthe¬ 
sis,  let  us  consider  an  east- west  oriented  interferometer 
pointed  towards  the  celestial  north.  Each  antenna  is  iden¬ 
tical,  has  a  diameter  D  and  operates  at  a  wavelength  X. 
The  minimum  spacing  between  each  antenna  element  is  a, 
and  the  maximum  spacing  is  6a.  In  (a)  there  are  only 
two  antennas,  A  and  B,  displaced  by  the  maximum  spac¬ 
ing  6a.  When  the  earth  rotates,  antennas  A  and  B  will, 
in  the  course  of  12  hours,  track  a  circle  on  the  plane  of 
the  sky  with  a  diameter  X/(6a),  the  maximum  resolution 
that  can  be  achieved  with  this  interferometer.  In  (b)  the 
antenna  C  is  added  to  the  interferometer,  thus  providing 
two  more  baselines,  which  track  the  circles  AC  and  BC 
with  radii  of  X/(2a)  and  X/(4 a),  respectively.  In  (c)  there 
is  still  another  antenna  D  added  to  the  interferometer.  In 
this  case  two  of  the  baselines  are  equal,  AD  and  CD,  and 
therefore  only  two  new  circles  are  covered  on  the  plane 
of  the  sky.  By  adding  more  interferometer  elements,  one 
can  fill  in  the  missing  parts  within  the  primary  beam,  i.e. 
the  beam  of  one  single  dish,  and  thus  obtain  a  full  cov¬ 
erage  of  the  beam.  The  resolution  of  the  system  depends 
on  the  maximum  separation  of  the  antennas.  It  is  evident 
from  (c),  that  not  all  of  the  antenna  positions  are  needed  to 
provide  all  the  different  spacings;  some  antenna  spacings 
will  in  such  a  case  be  equal  and  therefore  provide  no  addi¬ 
tional  information.  It  is  essential  to  position  the  antennas 
in  such  a  way  that  as  many  different  baselines  as  possi¬ 
ble  are  formed.  The  observed  values  and  the  image  of  the 
target  are  related  by  a  Fourier  transform 


combination  which  has  the  same  relative  distance 
will  describe  the  same  path  on  the  sky.  In  this  way 
one  can  synthesise  an  antenna,  a  filled  aperture, 
with  a  size  equal  to  the  maximum  spacing  be¬ 
tween  the  telescopes.  Interferometers  working  ac¬ 
cording  to  this  principle  are  called  aperture  syn- 
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Fig.  3.34  The  VLA  at  Socorro,  New  Mexico,  is  a  synthesis  telescope  consisting  of  27  movable  antennas 


thesis  telescopes.  If  one  covers  all  the  spacings  up 
to  the  maximum  baseline,  the  result  will  be  an  ac¬ 
curate  map  of  the  source  over  the  primary  beam 
of  an  individual  antenna  element.  Aperture  syn¬ 
thesis  telescopes  therefore  produce  an  image  of 
the  sky,  i.e.  a  “radio  photograph ”. 

A  typical  aperture  synthesis  telescope  consists 
of  one  fixed  telescope  and  a  number  of  mov¬ 
able  telescopes,  usually  located  on  an  east-west 
track,  although  T  or  Y  configurations  are  also 
quite  common.  The  number  of  telescopes  used 
determines  how  fast  one  can  synthesise  a  larger 
disk,  because  the  number  of  possible  antenna 
combinations  increases  as  n(n  —  1),  where  n  is 
the  number  of  telescopes.  It  is  also  possible  to 
synthesise  a  large  telescope  with  only  one  fixed 
and  one  movable  telescope  by  changing  the  spac¬ 
ing  between  the  telescopes  every  12  hours,  but 
then  a  full  aperture  synthesis  can  require  several 
months  of  observing  time.  In  order  for  this  tech¬ 
nique  to  work,  the  source  must  be  constant,  i.e. 
the  signal  cannot  be  time  variable  during  the  ob¬ 
serving  session. 

The  most  efficient  aperture  synthesis  tele¬ 
scopes  at  present  are  the  VLA  (Very  Large  Array) 


in  New  Mexico,  USA  (Fig.  3.34)  and  ALMA  in 
Chile. 

VLA  consists  of  27  paraboloid  antennas,  each 
with  a  diameter  of  25  m,  which  are  located  on 
a  Y-shaped  track.  The  Y-formation  was  chosen 
because  it  provides  a  full  aperture  synthesis  in 
8  hours.  Each  antenna  can  be  moved  by  a  spe¬ 
cially  built  carrier,  and  the  locations  of  the  tele¬ 
scopes  are  chosen  to  give  optimal  spacings  for 
each  configuration.  In  the  largest  configuration 
each  arm  is  about  21  km  long,  thereby  resulting 
in  an  antenna  with  an  effective  diameter  of  35  km. 
If  the  VLA  is  used  in  its  largest  configuration 
and  at  its  highest  frequency,  23  GHz  (1.3  cm), 
the  resolution  achieved  is  0.1  arc  second,  clearly 
superior  to  any  optical  telescope.  Similar  resolu¬ 
tion  can  also  be  obtained  with  the  British  MER¬ 
LIN  telescope,  where  already  existing  telescopes 
have  been  coupled  together  by  radio  links.  Other 
well-known  synthesis  telescopes  are  the  Cam¬ 
bridge  5  km  array  in  Britain  and  the  Westerbork 
array  in  the  Netherlands,  both  located  on  east- 
west  tracks. 

Even  higher  resolution  can  be  obtained  with 
an  extension  of  the  aperture  synthesis  technique, 
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called  VLB  I  (Very  Long  Baseline  Interferome¬ 
try).  With  the  VLB  I  technique  the  spacing  be¬ 
tween  the  antennas  is  restricted  only  by  the  size 
of  the  Earth.  Some  of  the  antennas  can  even  be 
on  satellites,  in  which  case  the  baseline  can  be 
arbitrarily  long.  The  antennas  are  all  pointed  to¬ 
wards  the  same  source.  The  signal  is  recorded  to¬ 
gether  with  accurate  timing  signals  from  atomic 
clocks.  The  data  files  are  correlated  against  each 
other,  resulting  in  maps  similar  to  those  obtained 
with  a  normal  aperture  synthesis  telescope.  With 
VLBI  techniques  it  is  possible  to  achieve  resolu¬ 
tions  of  0.0000 1"  if  one  of  the  telescopes  is  or¬ 
biting  the  Earth.  Because  interferometry  is  very 
sensitive  to  the  distance  between  the  telescopes, 
the  VLBI  technique  also  provides  one  of  the  most 
accurate  methods  to  measure  distances.  Currently 
one  can  measure  distances  with  an  accuracy  of 
a  few  millimetres  on  intercontinental  baselines. 
This  is  utilised  in  geodetic  VLBI  experiments, 
which  study  continental  drift  and  polar  motion  as 
a  function  of  time. 

In  radio  astronomy  the  maximum  size  of  sin¬ 
gle  antennas  has  also  been  reached.  The  trend 
is  to  build  synthesis  antennas.  In  the  1990’s  The 
United  States  built  a  chain  of  antennas  extending 
across  the  whole  continent,  and  the  Australians 
have  constructed  a  similar,  but  north-south  an¬ 
tenna  chain  across  their  country. 

More  and  more  observations  are  being  made 
in  the  submillimetre  region.  The  disturbing  effect 
of  atmospheric  water  vapour  becomes  more  se¬ 
rious  at  shorter  wavelengths;  thus,  submillimetre 
telescopes  must  be  located  on  mountain  tops,  like 
optical  telescopes.  All  parts  of  the  mirror  are  ac¬ 
tively  controlled  in  order  to  accurately  maintain 
the  proper  form  like  in  the  new  optical  telescopes. 
Several  new  submillimetre  telescopes  are  under 
construction. 


3.5  Other  Wavelength  Regions 

The  Earth  receives  electromagnetic  radiation 
from  the  sky  at  all  wavelengths  However,  as 
mentioned  in  Sect.  3.1,  not  all  radiation  reaches 
the  ground.  The  wavelength  regions  absorbed  by 
the  atmosphere  have  only  been  studied  more  ex¬ 
tensively  since  the  1970’s,  using  Earth- orbiting 


satellites  (Fig.  3.37).  Besides  the  optical  and  ra¬ 
dio  regions,  there  are  only  some  narrow  wave¬ 
length  ranges  in  the  infrared  that  can  be  observed 
from  high  mountain  tops. 

The  first  observations  in  each  new  wavelength 
region  were  usually  carried  out  from  balloons,  but 
not  until  rockets  came  into  use  could  observa¬ 
tions  be  made  from  outside  the  atmosphere.  The 
first  actual  observations  of  an  X-ray  source,  for 
instance,  were  made  on  a  rocket  flight  in  June 
1962,  when  the  detector  rose  above  the  atmo¬ 
sphere  for  about  6  minutes.  Finally,  Earth  orbiting 
satellites  have  made  it  possible  to  map  the  whole 
sky  in  the  wavelength  regions  invisible  from  the 
ground. 

Gamma  Radiation  Gamma  ray  astronomy 
studies  radiation  quanta  with  energies  of 
105— 1014  eV.  The  boundary  between  gamma  and 
X-ray  astronomy,  105  eV,  corresponds  to  a  wave¬ 
length  of  10“ 11  m.  The  boundary  is  not  fixed;  the 
regions  of  hard  (=  high-energy)  X-rays  and  soft 
gamma  rays  partly  overlap. 

While  ultraviolet,  visible  and  infrared  radia¬ 
tion  are  all  produced  by  changes  in  the  energy 
states  of  the  electron  envelopes  of  atoms,  gamma 
and  hard  X-rays  are  produced  by  transitions  in 
atomic  nuclei  or  in  mutual  interactions  of  ele¬ 
mentary  particles.  Thus  observations  of  the  short¬ 
est  wavelengths  give  information  on  processes 
different  from  those  giving  rise  to  longer  wave¬ 
lengths. 

The  first  observations  of  gamma  sources  were 
obtained  at  the  end  of  the  1960’s,  when  a  device 
in  the  OSO  3  satellite  (Orbiting  Solar  Observa¬ 
tory)  detected  gamma  rays  from  the  Milky  Way. 
Later  on,  many  satellites  have  been  especially 
designed  for  gamma  astronomy.  The  most  re¬ 
cent  satellites  include  the  Compton  Gamma  Ray 
Observatory,  operating  in  1991-2000,  the  Euro¬ 
pean  Integral,  launched  in  2002,  and  the  Ameri¬ 
can  Fermi,  originally  called  GLAST. 

The  quanta  of  gamma  radiation  have  energies 
a  million  times  greater  than  those  of  visible  light, 
and  therefore  they  cannot  be  observed  with  sim¬ 
ilar  detectors.  Observations  are  made  with  vari¬ 
ous  scintillation  detectors ,  usually  composed  of 
several  layers  of  detector  plates,  where  gamma 
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Fig.  3.35  The  MAGIC  telescope  on  La  Palma  in  the  Ca¬ 
nary  islands  observes  the  Cherenkov  radiation  produced 
by  gamma  rays.  (Photo  Robert  Wagner,  Max  Planck  Insti- 
tut  fiir  Physik) 

radiation  is  transformed  by  the  photoelectric  ef¬ 
fect  into  visible  light,  detectable  by  photomulti¬ 
pliers. 

The  energy  of  a  gamma  quantum  can  be  deter¬ 
mined  from  the  depth  to  which  it  penetrates  the 
detector.  Analysing  the  trails  left  by  the  quanta 
gives  information  on  their  approximate  direction. 
The  field  of  view  is  limited  by  the  grating.  The 
directional  accuracy  is  low,  and  in  gamma  astron¬ 
omy  the  resolution  is  still  far  below  that  of  other 
wavelength  regions. 

Gamma  radiation  cannot  pass  through  the  at¬ 
mosphere,  yet  it  has  effects  that  can  be  observed 
with  ground  based  instruments.  When  a  high- 
energy  photon  enters  the  atmosphere  it  can  cre¬ 
ate  particle-antiparticle  pairs.  These  particles  can 
get  such  a  high  energy  that  they  move  faster  than 
the  local  speed  of  light  in  the  atmosphere.  Such 
particles  radiate  Cherenkov  radiation  that  can  be 
detected  as  visible  light  (Fig.  3.35). 

X-rays  The  observational  domain  of  X-ray  as¬ 
tronomy  includes  the  energies  between  102  and 
105  eV,  or  the  wavelengths  10-0.01  nm.  The  re¬ 
gions  10-0.1  nm  and  0.1-0.01  nm  are  called  soft 


and  hard  X-rays ,  respectively.  X-rays  were  dis¬ 
covered  in  the  late  19th  century.  Systematic  stud¬ 
ies  of  the  sky  at  X-ray  wavelengths  only  became 
possible  in  the  1970’s  with  the  advent  of  satellite 
technology. 

The  first  all- sky  mapping  was  made  in  the 
early  1970’s  by  SAS  1  (Small  Astronomical 
Satellite),  also  called  Uhuru.  At  the  end  of  the 
1970’s,  two  High-Energy  Astronomy  Observa¬ 
tories,  HE  AO  1  and  2  (the  latter  called  Einstein), 
mapped  the  sky  with  much  higher  sensitivity  than 
Uhuru. 

The  Einstein  Observatory  was  able  to  detect 
sources  about  a  thousand  times  fainter  than  ear¬ 
lier  X-ray  telescopes.  In  optical  astronomy,  this 
would  correspond  to  a  jump  from  a  15  cm  reflec¬ 
tor  to  a  5  m  telescope.  Thus  X-ray  astronomy  has 
developed  in  20  years  as  much  as  optical  astron¬ 
omy  in  300  years. 

The  latest  X-ray  satellites  have  been  the  Amer¬ 
ican  Chandra  and  the  European  XMM-Newton, 
both  launched  in  1999. 

Besides  satellites  mapping  the  whole  sky, 
there  have  been  several  satellites  observing  the  X- 
ray  radiation  of  the  Sun.  The  most  recent  ones  are 
the  Japanese  Yohkoh  and  Hinode  and  the  Ameri¬ 
can  RHESSI. 

The  first  X-ray  telescopes  used  detectors  sim¬ 
ilar  to  those  in  gamma  astronomy.  Their  di¬ 
rectional  accuracy  was  never  better  than  a  few 
arc  minutes.  The  more  precise  X-ray  telescopes 
utilise  the  principle  of  grazing  reflection 
(Fig.  3.36).  An  X-ray  hitting  a  surface  perpendic¬ 
ularly  is  not  reflected,  but  absorbed.  If,  however, 
X-rays  meet  the  mirror  nearly  parallel  to  its  sur¬ 
face,  just  grazing  it,  a  high  quality  surface  can 
reflect  the  ray. 

The  mirror  of  an  X-ray  reflector  is  on  the  in¬ 
ner  surface  of  a  slowly  narrowing  cone.  The  outer 
part  of  the  surface  is  a  paraboloid  and  the  inner 
part  a  hyperboloid.  The  rays  are  reflected  by  both 
surfaces  and  meet  at  a  focal  plane.  In  practice, 
several  tubes  are  installed  one  within  another.  For 
instance,  the  four  cones  of  the  Einstein  Obser¬ 
vatory  had  as  much  polished  optical  surface  as 
a  normal  telescope  with  a  diameter  of  2.5  m.  The 
resolution  in  X-ray  telescopes  is  of  the  order  of 
a  few  arc  seconds  and  the  field  of  view  about 
1  deg. 
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Fig.  3.36  X-rays  are  not  reflected  by  an  ordinary  mirror, 
and  the  principle  of  grazing  reflection  must  be  used  for 
collecting  them.  Radiation  meets  the  paraboloid  mirror  at 
a  very  small  angle,  is  reflected  onto  a  hyperboloid  mirror 

The  detectors  in  X-ray  astronomy  are  usually 
Geiger-Miiller  counters ,  proportional  counters 
or  scintillation  detectors.  Geiger-Miiller  and  pro¬ 
portional  counters  are  boxes  filled  with  gas.  The 
walls  form  a  cathode,  and  an  anode  wire  runs 
through  the  middle  of  the  box;  in  more  accu¬ 
rate  counters,  there  are  several  anode  wires.  An 
X-ray  quantum  entering  the  box  ionises  the  gas, 
and  the  potential  difference  between  the  anode 
and  cathode  gives  rise  to  a  current  of  electrons 
and  positive  ions.  In  recent  imaging  instruments 
(like  the  ACIS  in  Chandra)  also  CCD  cameras  are 
used. 

Ultraviolet  Radiation  Between  X-rays  and  the 
optical  region  lies  the  domain  of  ultraviolet  radi¬ 
ation,  with  wavelengths  between  10  and  400  nm. 
Most  ultraviolet  observations  have  been  carried 
out  in  the  soft  UV  region,  at  wavelengths  near 
those  of  optical  light,  since  most  of  the  UV  radi¬ 
ation  is  absorbed  by  the  atmosphere.  The  wave¬ 
lengths  below  300  nm  are  completely  blocked 
out.  The  short  wavelength  region  from  10  to 
91.2  nm  is  called  the  extreme  ultraviolet  (EUV, 
XUV). 

Extreme  ultraviolet  was  one  of  the  last  regions 
of  the  electromagnetic  radiation  to  be  observed 
systematically.  The  reason  for  this  is  that  the  ab¬ 
sorption  of  interstellar  hydrogen  makes  the  sky 
practically  opaque  at  these  wavelengths.  The  vis¬ 
ibility  in  most  directions  is  limited  to  some  hun¬ 
dred  light  years  in  the  vicinity  of  the  Sun.  In  some 
directions,  however,  the  density  of  the  interstellar 


and  further  to  a  focal  point.  In  practice,  several  mirrors  are 
placed  one  inside  another,  collecting  radiation  in  a  com¬ 
mon  focus 


gas  is  so  low  that  even  extragalactic  objects  can 
be  seen. 

The  first  dedicated  EUV  satellite  was  the  Ex¬ 
treme  Ultraviolet  Explorer  (EUVE),  operating  in 
1992-2000.  It  observed  about  a  thousand  EUV 
sources.  In  EUV  grazing  reflection  telescopes 
similar  to  those  used  in  X-ray  astronomy  are  em¬ 
ployed. 

In  nearly  all  branches  of  astronomy  important 
information  is  obtained  by  observations  of  ultra¬ 
violet  radiation.  Many  emission  lines  from  stel¬ 
lar  chromospheres  or  coronas,  the  Lyman  lines  of 
atomic  hydrogen,  and  most  of  the  radiation  from 
hot  stars  are  found  in  the  UV  domain.  In  the  near¬ 
ultraviolet ,  telescopes  can  be  made  similar  to  op¬ 
tical  telescopes  and,  equipped  with  a  photome¬ 
ter  or  spectrometer,  installed  in  a  satellite  orbiting 
the  Earth. 

The  most  effective  satellites  in  the  UV  have 
been  the  European  TD-1,  the  American  Orbit¬ 
ing  Astronomical  Observatories  OAO  2  and  3 
(Copernicus),  and  the  International  Ultraviolet 
Explorer  IUE.  The  instruments  of  the  TD-1  satel¬ 
lite  included  both  a  photometer  and  a  spectrome¬ 
ter.  The  satellite  measured  the  magnitudes  of  over 
30,000  stars  in  four  different  spectral  regions  be¬ 
tween  135  and  274  nm,  and  registered  UV  spec¬ 
tra  from  over  1000  stars.  The  OAO  satellites  were 
also  used  to  measure  magnitudes  and  spectra,  and 
OAO  3  worked  for  over  eight  years. 

The  IUE  satellite,  launched  in  1978,  was  one 
of  the  most  successful  astronomical  satellites. 
IUE  had  a  45  cm  Ritchey-Chretien  telescope  with 
an  aperture  ratio  of  // 15  and  a  field  of  view  of 
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Fig.  3.37  (a)  The  European  X-ray  satellite  XMM- 
Newton  was  launched  in  1999.  (Drawing  D.  Ducros, 
XMM  Team,  ESA.)  (b)  FUSE  satellite  has  photographed 

16  arc  minutes.  The  satellite  had  two  spectro¬ 
graphs  to  measure  spectra  of  higher  or  lower  res¬ 
olution  in  wavelength  intervals  of  115-200  nm 


far  ultraviolet  objects  from  Earth  orbit  since  1999.  (Graph¬ 
ics  NASA/JHU  Applied  Physics  Laboratory) 


or  190-320  nm.  For  registration  of  the  spec¬ 
tra,  a  Vidicon  camera  was  used.  IUE  was  differ¬ 
ent  from  earlier  satellites,  since  it  could  be  used 
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almost  like  ground  based  telescopes.  The  ob¬ 
server  could  follow  the  observations  and  change 
them  in  real  time.  IUE  worked  on  the  orbit  for 
20  years. 

The  Hubble  space  telescope  can  also  make 
observations  in  the  ultraviolet.  Some  ultraviolet 
telescopes  have  also  been  on  shuttle  flight  in  the 
1990’s. 

Infrared  Radiation  Radiation  with  longer 
wavelengths  than  visible  light  is  called  infrared 
radiation.  This  region  extends  from  about  1  mi¬ 
crometre  to  1  millimetre,  where  the  radio  region 
begins.  Sometimes  the  near-infrared ,  at  wave¬ 
lengths  below  5  pm,  and  the  submillimetre  do¬ 
main,  at  wavelengths  between  0. 1  and  1  mm,  are 
considered  separate  wavelength  regions. 

In  infrared  observations  radiation  is  collected 
by  a  telescope,  as  in  the  optical  region.  The  in¬ 
coming  radiation  consists  of  radiation  from  the 
object,  from  the  background  and  from  the  tele¬ 
scope  itself.  Both  the  source  and  the  background 
must  be  continually  measured,  the  difference  giv- 


Fig.  3.38  Refractors  are  not  suitable  for  infrared  tele¬ 
scopes,  because  infrared  radiation  cannot  penetrate  glass. 
The  Cassegrain  reflectors  intended  especially  for  infrared 
observations  have  secondary  mirrors  nodding  rapidly  back 
and  forth  between  the  object  and  the  background  near  the 
object.  By  subtracting  the  brightness  of  the  background 
from  the  brightness  of  the  object,  the  background  can  be 
eliminated 


ing  the  radiation  from  the  object.  The  background 
measurements  can  be  made  with  a  Cassegrain 
secondary  mirror  oscillating  between  the  source 
and  the  background  at  a  rate  of,  say,  100  oscil¬ 
lations  per  second,  and  thus  the  changing  back¬ 
ground  can  be  eliminated  (Fig.  3.38).  To  register 
the  measurements,  semiconductor  detectors  are 
used.  The  detector  must  always  be  cooled  to  min¬ 
imise  its  own  thermal  radiation.  Sometimes  the 
whole  telescope  is  cooled. 

Infrared  observatories  have  been  built  on  high 
mountain  tops,  where  most  of  the  atmospheric 
water  vapour  remains  below.  Some  favourable 
sites  are,  e.g.  Mauna  Kea  on  Hawaii,  Mount 
Lemon  in  Arizona  and  Pico  del  Teide  on  Tenerife. 
For  observations  in  the  far-infrared  these  moun¬ 
tains  are  not  high  enough;  these  observations  are 
carried  out,  e.g.  on  aeroplanes.  One  of  the  best- 
equipped  planes  is  the  Kuiper  Airborne  Observa¬ 
tory  KAO,  named  after  the  well-known  planetary 
scientist  Gerard  Kuiper. 

Balloons  and  satellites  are  also  used  for  in¬ 
frared  observations.  The  first  powerful  infrared 
observatory  was  the  InfraRed  Astronomy  Satel¬ 
lite  IRAS,  built  by  the  United  States  and  Holland 
and  launched  in  1983.  It  made  observations  for 
eight  months  and  mapped  the  sky  at  four  wave¬ 
lengths  (12,  25,  60  and  100  pm),  and  found  over 
200,000  new  infrared  sources. 

The  European  Infrared  Space  Observatory 
ISO,  made  more  detailed  observations  of  thou¬ 
sands  of  infrared  objects  in  1996-1998.  Since 
2003  Spitzer  (originally  SIRTF,  Space  InfraRed 
Telescope  Facility),  has  been  mapping  the  in¬ 
frared  sky.  In  2009  ESA  launched  its  own  Her- 
schel  satellite  (Fig.  3.39),  which  operated  till 
2013,  when  the  liquid  helium  used  for  cooling 
was  exhausted. 

Infrared  satellites  have  also  been  used  to  map 
the  afterglow  of  the  big  bang  of  the  universe. 
A  very  successful  satellite  was  the  1989  launched 
COBE  (Cosmic  Background  Explorer),  which 
mapped  the  background  radiation  in  submillime¬ 
tre  and  infrared  wavelengths.  The  Microwave 
Anisotropy  Probe  (MAP)  has  continued  the  work 
of  COBE,  starting  in  2001.  In  2009  the  even 
more  powerful  European  Planck  satellite  was 
launched. 
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Fig.  3  .39  The  European 
Herschel  launched  in  2009 
was  the  most  efficient 
recent  infrared  satellite. 

The  diameter  of  its  main 
mirror,  3.5  metres,  was 
bigger  than  in  any  earlier 
astronomical  satellite.  The 
final  shape  to  the  highly 
curved  mirror  was  given  by 
the  Opteon  company  at  the 
Tuorla  observatory  in 
Finland.  The  observations 
ended  in  2103  when  the 
liquid  helium  used  to  cool 
down  the  camera  ran  out. 
(Photos  ESA,  Rami 
Rekola) 


3.6  Other  Forms  of  Energy 

Besides  electromagnetic  radiation,  energy  arrives 
from  space  in  other  forms:  particles  ( cosmic  rays , 
neutrinos )  and  gravitational  radiation. 

Cosmic  Rays  Cosmic  rays,  consisting  of  elec¬ 
trons  and  totally  ionised  nuclei  of  atoms,  are  re¬ 
ceived  in  equal  amounts  from  all  directions.  Their 
incoming  directions  do  not  reveal  their  origin, 


since  cosmic  rays  are  electrically  charged;  thus 
their  paths  are  continually  changed  when  they 
move  through  the  magnetic  fields  of  the  Milky 
Way.  The  high  energies  of  cosmic  rays  mean  that 
they  have  to  be  produced  by  high-energy  phe¬ 
nomena  like  supernova  explosions.  The  major¬ 
ity  of  cosmic  rays  are  protons  (nearly  90  %) 
and  helium  nuclei  (10  %),  but  some  are  heav¬ 
ier  nuclei;  their  energies  lie  between  108  and 
1020  eV. 
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Fig.  3.39  (Continued) 


The  most  energetic  cosmic  rays  give  rise  to 
secondary  radiation ,  mainly  muons,  when  they 
hit  molecules  of  the  atmosphere.  This  secondary 
radiation  can  be  observed  from  the  ground.  Just 
like  particles  produces  by  gamma  rays  particles 
of  the  secondary  radiation  can  move  faster  than 

V 

light  in  the  atmosphere  emitting  Cerenkov  radi¬ 
ation.  Thus  the  cosmic  rays  can  also  be  detected 
by  their  Cerenkov  radiation. 

The  Pierre  Auger  telescope  in  Argentina  ob- 
serves  both  secondary  particles  and  Cerenkov  ra¬ 
diation  to  detect  cosmic  rays  with  energies  at  least 
of  the  order  of  10 18  eV.  Since  only  about  one 
such  particle  hits  an  area  of  one  square  kilometre 
in  a  century,  the  detectors  have  been  positioned  in 
an  area  of  3000  square  kilometres. 

Primary  cosmic  rays  can  only  be  directly  ob¬ 
served  outside  the  atmosphere.  The  detectors 
used  to  observe  cosmic  rays  are  similar  to  those 
used  in  particle  physics.  Since  Earth-based  accel¬ 
erators  reach  energies  of  only  about  10 12  eV,  cos¬ 
mic  rays  offer  an  excellent  “natural”  laboratory 
for  particle  physics.  Many  satellites  and  space¬ 
craft  have  detectors  for  cosmic  rays. 

Neutrinos  Neutrinos  are  elementary  particles 
with  no  electric  charge.  Earlier  it  was  thought  that 


they  are  massless,  but  currently  they  seem  to  have 
a  tiny  rest  mass.  They  interact  with  other  mat¬ 
ter  only  through  the  weak  nuclear  force,  which 
is  very  much  weaker  than  the  electromagnetic  or 
strong  nuclear  force. 

Most  neutrinos  are  produced  in  nuclear  reac¬ 
tions  within  stars  and  in  supernova  explosions. 
Since  they  react  very  weakly  with  other  matter, 
they  escape  directly  from  the  stellar  interior. 

Due  to  their  weak  interaction  neutrinos  are 
very  difficult  to  observe;  trillions  of  them  pass 
through  this  page  every  second  without  any  no¬ 
ticeable  consequences. 

The  first  method  of  detection  was  the  radio¬ 
chemical  method.  As  a  reactive  agent,  e.g.  tetra- 
chloroethene  (C2CI4)  can  be  used.  When  a  neu¬ 
trino  hits  a  chlorine  atom,  the  chlorine  is  trans¬ 
formed  into  argon,  and  an  electron  is  freed: 

37Cl  +  v^37Ar  +  e“. 

The  argon  atom  is  radioactive  and  can  be  ob¬ 
served.  Instead  of  chlorine,  lithium  and  gallium 
might  be  used  to  detect  neutrinos.  The  first  gal¬ 
lium  detectors  have  been  running  in  Italy  and 
Russia  from  the  end  of  the  1980’s. 

Another  observation  method  is  based  on  the 
Cerenkov  radiation  produced  by  neutrinos  in  ex- 
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Fig.  3.40  The  Sudbury  neutrino  observatory  (SNO)  is  at 
the  depth  of  two  kilometres  in  a  still  operational  mine.  The 
spherical  vessel  contains  liquid  where  neutrinos  produce 
Cherenkov  radiation.  9600  photomultipliers  are  used  to 
register  the  flashes  of  the  radiation.  Earlier  the  liquid  was 
heavy  water  but  that  was  later  replaced  with  alkyl  benzene. 
The  observations  have  already  solved  the  solar  neutrino 
problem  (Sect.  13.1).  The  new  SNO+  will  be  able  to  ob¬ 
serve  neutrinos  of  even  lower  energies.  (Photos  SNOlab, 
H.  Karttunen) 


tremely  pure  water.  The  flashes  of  light  are  regis¬ 
tered  with  photomultipliers,  and  thus  it  is  possi¬ 
ble  to  find  out  the  direction  of  the  radiation.  This 
method  is  used  e.g.  in  the  Japanese  Kamiokande 
detector  and  in  the  Sudbury  neutrino  observatory 
in  Canada  (Fig.  3.40). 

The  largest  neutrino  detector  is  the  IceCube  in 
the  Antarctic,  finished  in  2010.  It  consists  of  one 
cubic  kilometre  of  ice. 

Neutrino  detectors  must  be  located  deep  under 
the  ground  to  protect  them  from  the  secondary 
radiation  caused  by  cosmic  rays.  Also  radioactive 
decay  inside  the  Earth  cases  background  noise. 

The  detectors  have  observed  neutrinos  from 
the  Sun,  and  the  Supernova  1987 A  in  the  Large 
Magellanic  Cloud  was  also  observed  in  1987.  In 
the  future,  also  the  dark  matter  and  other  neutrino 
producing  phenomena  will  be  studied. 

Gravitational  Radiation  Gravitational  astron¬ 
omy  is  as  young  as  neutrino  astronomy.  The 
first  attempts  to  measure  gravitational  waves  were 
made  in  the  1960’s.  Gravitational  radiation  is 
emitted  by  accelerating  masses,  just  as  electro¬ 
magnetic  radiation  is  emitted  by  electric  charges 
in  accelerated  motion.  Detection  of  gravitational 
waves  is  very  difficult,  and  they  have  been  ob¬ 
served  only  very  recently. 

The  first  type  of  gravitational  wave  antenna 
was  the  Weber  cylinder.  It  is  an  aluminium  cylin¬ 
der  which  starts  vibrating  at  its  proper  frequency 
when  hit  by  a  gravitational  pulse.  The  distance 
between  the  ends  of  the  cylinder  changes  by 
about  10“ 17  m,  and  the  changes  in  the  length  are 
studied  by  strain  sensors  welded  to  the  side  of  the 
cylinder. 

Another  type  of  modern  gravity  radiation  de¬ 
tectors  measures  “spatial  strain”  induced  by  grav¬ 
ity  waves  and  consists  of  two  sets  of  mirrors  in 
directions  perpendicular  to  each  other  (Michel- 
son  interferometer),  or  one  set  of  parallel  mir¬ 
rors  (Fabry-Perot  interferometer).  The  relative 
distances  between  the  mirrors  are  monitored  by 
laser  interferometers.  If  a  gravity  pulse  passes  the 
detector,  the  distances  change  and  the  changes 
can  be  measured.  The  longest  baseline  between 
the  mirrors  is  in  the  American  LIGO  (Laser  Inter- 
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Fig.  3.41  The  LIGO  Livingston  Observatory  seen  from 
the  air.  (Photo  LIGO/Caltech) 


ferometer  Gravitational-wave  Observatory)  sys¬ 
tem  (Fig.  3.41).  LIGO  made  the  first  scientific 
observations  in  2002.  In  early  2016  the  first  pos¬ 
itive  detection  of  gravitational  waves  was  an¬ 
nounced.  The  observation  was  made  in  Septem¬ 
ber  14,  2015. 

Although  the  gravitational  detectors  are  iso¬ 
lated  from  their  surroundings  as  carefully  as  pos¬ 
sible,  many  perturbations  can  make  them  oscil¬ 
late.  A  positive  detection  requires  that  the  same 
phenomenon  is  observed  by  different  instruments 
located  far  away  from  each  others. 


Box  3.1  (Diffraction  by  a  Circular  Aperture) 
Consider  a  circular  hole  of  radius  R  in  the 
vy  plane.  Coherent  light  enters  the  hole  from 
the  direction  of  the  negative  z  axis  (see  fig¬ 
ure).  We  consider  light  rays  leaving  the  hole 
parallel  to  the  vz  plane  forming  an  angle  0 
with  the  z  axis.  The  light  waves  interfere  on 
a  screen  far  away.  The  phase  difference  be¬ 
tween  a  wave  through  a  point  (x ,  y )  and  a  wave 
going  through  the  centre  of  the  hole  can  be 
calculated  from  the  different  path  lengths  s  = 
v  sin  0: 

s  2tt  sin  6 
8  =  —2i r  = - v  =  kx. 


3.7  Examples 
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We  substitute  x  =  Rt  and  define  p  =  kR  = 
( 2nr  sin0)/A,  thus  getting 


a  oc 


t 2  cos  pt  dt. 


The  zero  points  of  the  intensity  observed  on 
the  screen  are  obtained  from  the  zero  points  of 
the  amplitude, 


j(p)  = 


t 2  cos  pt  dt  =  0. 


Inspecting  the  function  J(p ),  we  see  that  the 
first  zero  is  at  p  =  3.8317,  or 


2tt  R  sin  0 
A 


3.8317. 


The  radius  of  the  diffraction  disk  in  angular 
units  can  be  estimated  from  the  condition 


sin  0  = 


3.8317A 

2nR 


A 

1.22—, 

D 


where  D  =  2R  is  the  diameter  of  the  hole. 

In  mirror  telescopes  diffraction  is  caused 
also  by  the  support  structure  of  the  secondary 
mirror.  If  the  aperture  is  more  complex  and 
only  elementary  mathematics  is  used  calcula¬ 
tions  may  become  rather  cumbersome.  How¬ 
ever,  it  can  be  shown  that  the  diffraction  pattern 
can  be  obtained  as  the  Fourier  transform  of  the 
aperture. 


3.7  Examples 

Example  3.1  The  distance  between  the  compo¬ 
nents  of  the  binary  star  f  Herculis  is  1 .38".  What 
should  the  diameter  of  a  telescope  be  to  resolve 
the  binary?  If  the  focal  length  of  the  objective  is 
80  cm,  what  should  the  focal  length  of  the  eye¬ 
piece  be  to  resolve  the  components,  when  the  res¬ 
olution  of  the  eye  is  2r? 

In  the  optical  region,  we  can  use  the  wave¬ 
length  value  of  A  ^  550  nm.  The  diameter  of  the 
objective  is  obtained  from  the  equation  for  the 


resolution  (3.4), 


A 

D  &  —  = 


550  x  1CT9 


6  (1.38/3600)  x  (jt/180) 

=  0.08  m  =  8  cm. 


m 


The  required  magnification  is 


co  = 


1.38 


// 


=  87. 


The  magnification  is  given  by 


co  = 


r’ 


and,  thus,  the  focal  length  of  the  eyepiece  should 
be 


,  f  80  cm 


f  co  87 


=  0.9  cm. 


Example  3.2  A  telescope  has  an  objective  with 
a  diameter  of  90  mm  and  focal  length  of  1200  mm. 


(a)  What  is  the  focal  length  of  an  eyepiece,  the 
exit  pupil  of  which  is  6  mm  (about  the  size  of 
the  pupil  of  the  eye)? 

(b)  What  is  the  magnification  of  such  an  eye¬ 
piece? 

(c)  What  is  the  angular  diameter  of  the  Moon 
seen  through  this  telescope  and  eyepiece? 

(a)  From  Fig.  3.7  we  get 


whence 


f'  =  f 


L 

D 


=  1200  mm 


6  mm 
90  mm 


=  80  mm. 


(b)  The  magnification  is  co  =  / /  f  =  1200  mm/ 
80  mm  =  15. 

(c)  Assuming  the  angular  diameter  of  the  Moon 
is  a  =  3F  =  0.52°,  its  diameter  through  the 
telescope  is  coa  =  7.8°. 
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3.8  Exercises 

Exercise  3.1  The  Moon  was  photographed  with 
a  telescope,  the  objective  of  which  had  a  diameter 
of  20  cm  and  focal  length  of  150  cm.  The  expo¬ 
sure  time  was  0.1  s. 

(a)  What  should  the  exposure  time  be,  if  the  di¬ 
ameter  of  the  objective  were  15  cm  and  focal 
length  200  cm? 

(b)  What  is  the  size  of  the  image  of  the  Moon  in 
both  cases? 


(c)  Both  telescopes  are  used  to  look  at  the  Moon 
with  an  eyepiece  the  focal  length  of  which  is 
25  mm.  What  are  the  magnifications? 

Exercise  3.2  The  radio  telescopes  at  Amherst, 
Massachusetts,  and  Onsala,  Sweden,  are  used  as 
an  interferometer,  the  baseline  being  2900  km. 

(a)  What  is  the  resolution  at  22  GHz  in  the  direc¬ 
tion  of  the  baseline? 

(b)  What  should  be  the  size  of  an  optical  tele¬ 
scope  with  the  same  resolution? 


Photometric  Concepts  and  Magnitudes 


Most  astronomical  observations  utilise  electro¬ 
magnetic  radiation  in  one  way  or  another.  We  can 
obtain  information  on  the  physical  nature  of  a  ra¬ 
diation  source  by  studying  the  energy  distribution 
of  its  radiation.  We  shall  now  introduce  some  ba¬ 
sic  concepts  that  characterise  electromagnetic  ra¬ 
diation. 


4.1  Intensity,  Flux  Density  and 
Luminosity 

Let  us  assume  we  have  some  radiation  passing 
through  a  surface  element  d A  (Fig.  4.1).  Some  of 
the  radiation  will  leave  d A  within  a  solid  angle 
d co;  the  angle  between  d co  and  the  normal  to  the 
surface  is  denoted  by  6.  The  amount  of  energy 
with  frequency  in  the  range  [v,  v  +  dv]  entering 
this  solid  angle  in  time  d t  is 

dEv  =  Iv  cosO  dAdv  dcodt.  (4.1) 

Here,  the  coefficient  Iv  is  the  specific  intensity 
of  the  radiation  at  the  frequency  v  in  the  di¬ 
rection  of  the  solid  angle  d co.  Its  dimension  is 
W  m-2  Hz-1  sterad-1 . 

The  projection  of  the  surface  element  d A  as 
seen  from  the  direction  0  is  d An  =  dAcos0, 
which  explains  the  factor  cos0.  If  the  intensity 
does  not  depend  on  direction,  the  energy  d Ev  is 
directly  proportional  to  the  surface  element  per¬ 
pendicular  to  the  direction  of  the  radiation. 

The  intensity  including  all  possible  frequen¬ 
cies  is  called  the  total  intensity  /,  and  is  obtained 


Fig.  4.1  The  intensity  Iv  of  radiation  is  related  to  the  en¬ 
ergy  passing  through  a  surface  element  dA  into  a  solid 
angle  dco,  in  a  direction  0 


by  integrating  Iv  over  all  frequencies: 


More  important  quantities  from  the  observa¬ 
tional  point  of  view  are  the  energy  flux  (Lv,  L) 
or,  briefly,  the  flux  and  the  flux  density  (Fv,  F). 
The  flux  density  gives  the  power  of  radiation  per 
unit  area;  hence  its  dimension  is  Wm_2Hz_1 
or  W  m-2,  depending  on  whether  we  are  talking 
about  the  flux  density  at  a  certain  frequency  or 
about  the  total  flux  density. 

Observed  flux  densities  are  usually  rather 
small,  and  Wm-2  would  be  an  inconveniently 
large  unit.  Therefore,  especially  in  radio  astron- 
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omy,  flux  densities  are  often  expressed  in  Jan¬ 
sky  s\  one  Jansky  (Jy)  equals  10-26  W  m-2  Hz-1 . 

When  we  are  observing  a  radiation  source,  we 
in  fact  measure  the  energy  collected  by  the  de¬ 
tector  during  some  period  of  time,  which  equals 
the  flux  density  integrated  over  the  radiation¬ 
collecting  area  of  the  instrument  and  the  time  in¬ 
terval. 

The  flux  density  Fv  at  a  frequency  v  can  be 
expressed  in  terms  of  the  intensity  as 

Fv  = - - -  f  d Ev 

dA  dv  dt  Js 

=  /  Iv  cos  6  dco,  (4.2) 

Js 

where  the  integration  is  extended  over  all  possible 
directions.  Analogously,  the  total  flux  density  is 


I  cos  6  dco. 


For  example,  if  the  radiation  is  isotropic ,  i.e.  if 
I  is  independent  of  the  direction,  we  get 

F=  /  cos  0  dco  —  /  /  cosOdco.  (4.3) 

Js  Js 

The  solid  angle  element  dco  is  equal  to  a  surface 
element  on  a  unit  sphere.  In  spherical  coordinates 
it  is  (Fig.  4.2;  also  cf.  Appendix  A. 5): 


dco  =  sin  0  dO  d<fi . 


Substitution  into  (4.3)  gives 

/>7r  r>2n 

F  =  I  /  cos  0  sin  0  dO  d<p  =  0, 

Jo  =  o  J(f)  =  o 

so  there  is  no  net  flux  of  radiation.  This  means 
that  there  are  equal  amounts  of  radiation  entering 
and  leaving  the  surface.  If  we  want  to  know  the 
amount  of  radiation  passing  through  the  surface, 
we  can  find,  for  example,  the  radiation  leaving  the 
surface.  For  isotropic  radiation  this  is 

/»  7t/2  /*  2tt 

F\  =  I  I  /  cos  0  sin  0  dO  dcp  =  Tt  F  (4.4) 

Jo  =  0  J(f)  =  0 

In  the  astronomical  literature,  terms  such  as 
intensity  and  brightness  are  used  rather  vaguely. 


Fig.  4.2  An  infinitesimal  solid  angle  dco  is  equal  to 
the  corresponding  surface  element  on  a  unit  sphere: 
dco  =  sin  0  d 0  dcp 

Flux  density  is  hardly  ever  called  flux  density 
but  intensity  or  (with  luck)  flux.  Therefore  the 
reader  should  always  carefully  check  the  mean¬ 
ing  of  these  terms. 

Flux  means  the  power  going  through  some 
surface,  expressed  in  watts.  The  flux  emitted  by 
a  star  into  a  solid  angle  co  is  L  =  cor2  F,  where 
F  is  the  flux  density  observed  at  a  distance  r.  To¬ 
tal  flux  is  the  flux  passing  through  a  closed  sur¬ 
face  encompassing  the  source.  Astronomers  usu¬ 
ally  call  the  total  flux  of  a  star  the  luminosity  L . 
We  can  also  talk  about  the  luminosity  Lv  at  a  fre¬ 
quency  v  ([Lv]  =  WHz-1).  (This  must  not  be 
confused  with  the  luminous  flux  used  in  physics; 
the  latter  takes  into  account  the  sensitivity  of  the 
eye.) 

If  the  source  (like  a  typical  star)  radiates 
isotropically,  its  radiation  at  a  distance  r  is  dis¬ 
tributed  evenly  on  a  spherical  surface  whose  area 
is  4nr2  (Fig.  4.3).  If  the  flux  density  of  the  radi¬ 
ation  passing  through  this  surface  is  F ,  the  total 
flux  is 

L  =  4nr2F.  (4.5) 

If  we  are  outside  the  source,  where  radiation  is 
not  created  or  destroyed,  the  luminosity  does  not 
depend  on  distance.  The  flux  density,  on  the  other 
hand,  falls  off  proportional  to  1/r2. 


4.2  Apparent  Magnitudes 
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For  extended  objects  (as  opposed  to  objects 
such  as  stars  visible  only  as  points)  we  can  de¬ 
fine  the  surface  brightness  as  the  flux  density  per 
unit  solid  angle  (Fig.  4.4).  Now  the  observer  is  at 
the  apex  of  the  solid  angle.  The  surface  bright¬ 
ness  is  independent  of  distance,  which  can  be  un¬ 
derstood  in  the  following  way.  The  flux  density 
arriving  from  an  area  A  is  inversely  proportional 
to  the  distance  squared.  But  also  the  solid  an¬ 
gle  subtended  by  the  area  A  is  proportional  to 
1/r2  ( co  =  A/r2).  Thus  the  surface  brightness 
B  =  F/cn  remains  constant. 


Fig.  4.3  An  energy  flux  which  at  a  distance  r  from  a  point 
source  is  distributed  over  an  area  A  is  spread  over  an 
area  4 A  at  a  distance  2 r .  Thus  the  flux  density  decreases 
inversely  proportional  to  the  distance  squared 


The  energy  density  u  of  radiation  is  the  amount 
of  energy  per  unit  volume  (J  m~3): 

u  =  -  f  I  d co.  (4.6) 

c  Js 

This  can  be  seen  as  follows.  Suppose  we  have  ra¬ 
diation  with  intensity  I  arriving  from  a  solid  an¬ 
gle  d co  perpendicular  to  the  surface  dA  (Fig.  4.5). 
In  the  time  dt,  the  radiation  travels  a  distance  c  d t 
and  fills  a  volume  d  V  =  cdtdA.  Thus  the  energy 
in  the  volume  dV  is  (now  cos  6  =  1) 

1 

d E  =  I  dAdcodt  =  -I  dcodV. 

c 

Hence  the  energy  density  du  of  the  radiation  ar¬ 
riving  from  the  solid  angle  d co  is 

d  E  1 

du  =  —  =  -I  dco, 
dV  c 

and  the  total  energy  density  is  obtained  by  inte¬ 
grating  this  over  all  directions.  For  isotropic  radi¬ 
ation  we  get 

An 

u  =  — I.  (4.7) 

c 


4.2  Apparent  Magnitudes 

As  early  as  the  second  century  B.C.,  Hipparchos 
divided  the  visible  stars  into  six  classes  according 
to  their  apparent  brightness.  The  first  class  con¬ 
tained  the  brightest  stars  and  the  sixth  the  faintest 
ones  still  visible  to  the  naked  eye. 


Fig.  4.4  An  observer  sees 
radiation  coming  from 
a  constant  solid  angle  co. 
The  area  giving  off 
radiation  into  this  solid 
angle  increases  when  the 
source  moves  further  away 
(A  oc  r2).  Therefore  the 
surface  brightness  or  the 
observed  flux  density  per 
unit  solid  angle  remains 
constant 
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d  t 
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A 
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Fig.  4.5  In  time  d t,  the  radiation  fills  a  volume 
dV  =  cdtdA,  where  d A  is  the  surface  element  perpen¬ 
dicular  to  the  propagation  direction  of  the  radiation 

The  response  of  the  human  eye  to  the  bright¬ 
ness  of  light  is  not  linear.  If  the  flux  densities 
of  three  stars  are  in  the  proportion  1:10:100,  the 
brightness  difference  of  the  first  and  second  star 
seems  to  be  equal  to  the  difference  of  the  second 
and  third  star.  Equal  brightness  ratios  correspond 
to  equal  apparent  brightness  differences:  the  hu¬ 
man  perception  of  brightness  is  logarithmic. 

The  rather  vague  classification  of  Hipparchos 
was  replaced  in  1856  by  Norman  R.  Pogson.  The 
new,  more  accurate  classification  followed  the  old 
one  as  closely  as  possible,  resulting  in  another  of 
those  illogical  definitions  typical  of  astronomy. 
Since  a  star  of  the  first  class  is  about  one  hun¬ 
dred  times  brighter  than  a  star  of  the  sixth  class, 
Pogson  defined  the  ratio  of  the  brightnesses  of 
classes  n  and  n  +  1  as  \/ 100  =  2.512. 

The  brightness  class  or  magnitude  can  be  de¬ 
fined  accurately  in  terms  of  the  observed  flux  den¬ 
sity  F  ([E]  =  W  m-2).  We  decide  that  the  magni¬ 
tude  0  corresponds  to  some  preselected  flux  den¬ 
sity  Fq.  All  other  magnitudes  are  then  defined  by 
the  equation 


m  =  — 2.51g  (4.8) 

Fo 

Note  that  the  coefficient  is  exactly  2.5,  not  2.512! 
Magnitudes  are  dimensionless  quantities,  but  to 
remind  us  that  a  certain  value  is  a  magnitude,  we 
can  write  it,  for  example,  as  5  mag  or  5m. 

It  is  easy  to  see  that  (4.8)  is  equivalent  to  Pog- 
son’s  definition.  If  the  magnitudes  of  two  stars 
are  m  and  m  +  1  and  their  flux  densities  Fm 
and  Fm+\ ,  respectively,  we  have 


m  —  (m  +  1)  =  —2.5  lg  — —  +  2.5  lg  m+1 


=  -2.51g 


F 


Fo 


m 


Fm+1 


whence 

-Ai_  =  4/T00. 

Fm+ 1 

In  the  same  way  we  can  show  that  the  magnitudes 
m\  and  m2  of  two  stars  and  the  corresponding 
flux  densities  F\  and  F2  are  related  by 

F\ 

mi— m2  =  —2.5  lg — .  (4.9) 

F2 

Magnitudes  extend  both  ways  from  the  orig¬ 
inal  six  values.  The  magnitude  of  the  brightest 
star,  Sirius,  is  in  fact  negative  —1.5.  The  magni¬ 
tude  of  the  Sun  is  —26.8  and  that  of  a  full  moon 
—  12.5.  The  magnitude  of  the  faintest  objects  ob¬ 
served  depends  on  the  size  of  the  telescope,  the 
sensitivity  of  the  detector  and  the  exposure  time. 
The  limit  keeps  being  pushed  towards  fainter  ob¬ 
jects;  currently  the  magnitudes  of  the  faintest  ob¬ 
served  objects  are  over  30. 


4.3  Magnitude  Systems 

The  apparent  magnitude  m,  which  we  have  just 
defined,  depends  on  the  instrument  we  use  to 
measure  it.  The  sensitivity  of  the  detector  is  dif¬ 
ferent  at  different  wavelengths.  Also,  different 
instruments  detect  different  wavelength  ranges. 
Thus  the  flux  measured  by  the  instrument  equals 
not  the  total  flux,  but  only  a  fraction  of  it.  De¬ 
pending  on  the  method  of  observation,  we  can  de¬ 
fine  various  magnitude  systems.  Different  mag¬ 
nitudes  have  different  zero  points,  i.e.  they  have 
different  flux  densities  Fq  corresponding  to  the 
magnitude  0.  The  zero  points  are  usually  defined 
by  a  few  selected  standard  stars. 

In  daylight  the  human  eye  is  most  sensitive  to 
radiation  with  a  wavelength  of  about  550  nm,  the 
sensitivity  decreasing  towards  red  (longer  wave¬ 
lengths)  and  violet  (shorter  wavelengths).  The 
magnitude  corresponding  to  the  sensitivity  of  the 
eye  is  called  the  visual  magnitude  mv. 

Photographic  plates  are  usually  most  sensitive 
at  blue  and  violet  wavelengths,  but  they  are  also 
able  to  register  radiation  not  visible  to  the  hu¬ 
man  eye.  Thus  the  photographic  magnitude  mpg 
usually  differs  from  the  visual  magnitude.  The 
sensitivity  of  the  eye  can  be  simulated  by  using 
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a  yellow  filter  and  plates  sensitised  to  yellow  and 
green  light.  Magnitudes  thus  observed  are  called 
photovisual  magnitudes  mpv. 

If,  in  ideal  case,  we  were  able  to  measure  the 
radiation  at  all  wavelengths,  we  would  get  the 
bolometric  magnitude  m\>0\.  In  practice  this  is 
very  difficult,  since  part  of  the  radiation  is  ab¬ 
sorbed  by  the  atmosphere;  also,  different  wave¬ 
lengths  require  different  detectors.  (In  fact  there 
is  a  gadget  called  the  bolometer,  which,  however, 
is  not  a  real  bolometer  but  an  infrared  detector.) 
The  bolometric  magnitude  can  be  derived  from 
the  visual  magnitude  if  we  know  the  bolometric 
correction  BC: 

mbol  =  mY  —  BC.  (4.10) 

By  definition,  the  bolometric  correction  is  zero 
for  radiation  of  solar  type  stars  (or,  more  pre¬ 
cisely,  stars  of  the  spectral  class  F5).  Although  the 
visual  and  bolometric  magnitudes  can  be  equal, 
the  flux  density  corresponding  to  the  bolometric 
magnitude  must  always  be  higher.  The  reason  of 
this  apparent  contradiction  is  in  the  different  val¬ 
ues  of  Fq. 

The  more  the  radiation  distribution  differs 
from  that  of  the  Sun,  the  higher  the  bolometric 
correction  is.  The  correction  is  positive  for  stars 
both  cooler  or  hotter  than  the  Sun.  Sometimes 
the  correction  is  defined  as  mt,0l  =  rav  +  BC  in 
which  case  BC  <  0  always.  The  chance  for  er¬ 
rors  is,  however,  very  small,  since  we  must  have 
^bol  25-  mv. 

The  most  accurate  magnitude  measurements 
are  made  using  photoelectric  photometers  or 
CCD  cameras.  Usually  filters  are  used  to  allow 
only  a  certain  wavelength  band  to  enter  the  de¬ 
tector.  One  of  the  multicolour  magnitude  sys¬ 
tems  used  widely  in  photoelectric  photometry  is 
the  UBV  system  developed  in  the  early  1950’s 
by  Harold  L.  Johnson  and  William  W.  Morgan. 
Magnitudes  are  measured  through  three  filters, 
U  =  ultraviolet,  B  =  blue  and  V  =  visual.  Fig¬ 
ure  4.6  and  Table  4.1  give  the  wavelength  bands 
of  these  filters.  The  magnitudes  observed  through 
these  filters  are  called  U ,  B  and  V  magnitudes, 
respectively. 

The  UBV  system  was  later  augmented  by 
adding  more  bands.  One  commonly  used  system 


Fig.  4.6  Relative  transmission  profiles  of  filters  used  in 
the  UBVRI  magnitude  system.  The  maxima  of  the  bands 
are  normalised  to  unity.  The  R  and  I  bands  are  based  on 
the  system  of  Johnson,  Cousins  and  Glass,  which  includes 
also  infrared  bands  J,  H,  K,  L  and  M.  Previously  used  R 
and  I  bands  differ  considerably  from  these.  The  curves  of 
the  ugriz  magnitudes  (« dashed  lines)  give  quantum  efficien¬ 
cies.  They  include  the  atmospheric  extinction  for  airmass 
1.3  (Sect.  4.5) 


Table  4.1  Wavelength  bands  of  the  UBVRI  and  uvby  fil¬ 
ters  and  their  effective  (~  average)  wavelengths 


Magnitude 

Band 

width  [nm] 

Effective 
wavelength  [nm] 

u 

ultraviolet 

66 

367 

B 

blue 

94 

436 

V 

visual 

88 

545 

R 

red 

138 

638 

I 

infrared 

149 

797 

u 

ultraviolet 

30 

349 

V 

violet 

19 

411 

b 

blue 

18 

467 

y 

yellow 

23 

547 

is  the  five  colour  UBVRI  system,  which  includes 
R  =  red  and  I  =  infrared  filters. 

There  are  also  other  broad  band  systems,  but 
they  are  not  as  well  standardised  as  the  UBV, 
which  has  been  defined  moderately  well  using 
a  great  number  of  standard  stars  all  over  the  sky. 
The  magnitude  of  an  object  is  obtained  by  com¬ 
paring  it  to  the  magnitudes  of  standard  stars. 

In  Stromgren ’s  four-colour  or  uvby  system ,  the 
bands  passed  by  the  filters  are  much  narrower 
than  in  the  UBV  system.  The  uvby  system  is  also 
well  standardised,  but  it  is  not  quite  as  common 
as  the  UBV.  Other  narrow  band  systems  exist  as 
well.  By  adding  more  filters,  more  information  on 
the  radiation  distribution  can  be  obtained. 
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In  any  multicolour  system,  we  can  define 
colour  indices ;  a  colour  index  is  the  difference 
of  two  magnitudes.  By  subtracting  the  B  magni¬ 
tude  from  U  we  get  the  colour  index  U  —  B,  and 
so  on.  If  the  UB  V  system  is  used,  it  is  common  to 
give  only  the  V  magnitude  and  the  colour  indices 
U  -  B  and  B  -  V. 

The  constants  Fo  in  (4.8)  for  U,  B  and  V  mag¬ 
nitudes  have  been  selected  in  such  a  way  that  the 
colour  indices  B  —  V  and  U  —  B  are  zero  for 
stars  of  spectral  type  AO  (for  spectral  types,  see 
Chap.  8).  The  surface  temperature  of  such  a  star  is 
about  10,000  K.  For  example,  Vega  (a  Lyr,  spec¬ 
tral  class  AOV)  has  V  =  0.04,  B  —  V  —  U  —  B  — 
0.00.  The  Sun  has  V  =  —26.8,  B  —  V  =  0.64  and 
U  -  B  =  0.12. 

Before  the  UBV  system  was  developed,  a 
colour  index  C.I.,  defined  as  C.I.  =  mpg  —  mv 
was  used.  The  definition  shows  that  C.I.  corre¬ 
sponds  to  the  colour  index  B  —  V .  In  fact,  C.I.  = 
(B  -  V)  -0.11. 

Nowadays  it  is  becoming  more  customary  to 
use  the  AB  system  (AB solute),  in  which  Fq  is 
the  same,  3631  Jy,  for  all  wavelength  bands.  For 
instance  the  ugriz  magnitudes  used  by  the  Sloan 
Digital  Sky  Survey  (SDSS)  are  based  on  this  sys¬ 
tem.  There  are  several  different  transformation 
equations  between  the  UBV  and  ugriz  systems 
for  different  kinds  of  objects.  For  ordinary  stars, 
we  can  use  the  following: 

V  =  g-  0.2906  (u-g)  +  0.0885, 


Fig.  4.7  The  apparent  magnitude  at  a  distance  r  depends 
on  the  flux  density  F(r).  The  absolute  magnitude  is  de¬ 
fined  as  the  apparent  magnitude  at  a  distance  of  10  parsecs 
from  the  star  depending  only  on  the  flux  density  F(10)  10 
parsecs  away  from  the  star 

the  apparent  magnitude  at  a  distance  of  10  par¬ 
secs  from  the  star  (Fig.  4.7).  Officially  this  defi¬ 
nition  was  accepted  in  the  general  meeting  of  the 
IAUin  1922. 

We  shall  now  derive  an  equation  which  relates 
the  apparent  magnitude  m,  the  absolute  magni¬ 
tude  M  and  the  distance  r.  Because  the  flux  em¬ 
anating  from  a  star  into  a  solid  angle  co  has,  at 
a  distance  r,  spread  over  an  area  cor2,  the  flux 
density  is  inversely  proportional  to  the  distance 
squared.  Therefore  the  ratio  of  the  flux  density  at 
a  distance  r,  F(r),  to  the  flux  density  at  a  distance 
of  10  parsecs,  F(10),  is 


V  =  g-  0.5784  (g  -  r)  -  0.0038, 
R  =  r  -  0.1837  (g  -  r)  -  0.0971, 
R  =  r-  0.2936  (r  -  i)  -  0. 1439, 
I  =  r  —  1.2444  (r  —  i)  —  0.3820, 
I  =  i-  0.3780  (/  -z)~  0.3974. 


(4.11) 


F(r)  _  / 10  pc\2 

nm  ~  )  ' 

Thus  the  difference  of  magnitudes  at  r  and  10  pc, 
or  the  distance  modulus  m  —  M,  is 


m  —  M  =  —2.5  lg 


F(r) 
F  (10) 


-2.5  lg 


10  pc 
r 


2 


4.4  Absolute  Magnitudes 

Thus  far  we  have  discussed  only  apparent  mag¬ 
nitudes.  They  do  not  tell  us  anything  about  the 
true  brightness  of  stars,  since  the  distances  differ. 
A  quantity  measuring  the  intrinsic  brightness  of 
a  star  is  the  absolute  magnitude.  It  is  defined  as 


or 

m  —  M  =  5\g  — ^ — .  (4.12) 

10  pc 

For  historical  reasons,  this  equation  is  almost  al¬ 
ways  written  as 

m  —  M  =  5  lgr  —  5, 


(4.13) 
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Fig.  4.8  The  interstellar 
medium  absorbs  and 
scatters  radiation;  this 
usually  reduces  the  energy 
flux  L  in  the  solid  angle  co 
(d L  <  0) 


which  is  valid  only  if  the  distance  is  expressed  in 
parsecs.  (The  logarithm  of  a  dimensional  quan¬ 
tity  is,  in  fact,  physically  absurd.)  Sometimes  the 
distance  is  given  in  kiloparsecs  or  megaparsecs, 
which  require  different  constant  terms  in  (4.13). 
To  avoid  confusion,  we  highly  recommend  the 
form  (4.12). 

Absolute  magnitudes  are  usually  denoted  by 
capital  letters.  Note,  however,  that  the  U ,  B  and 
V  magnitudes  are  apparent  magnitudes.  The  cor¬ 
responding  absolute  magnitudes  are  Mu,  Mb  and 
My. 

The  absolute  bolometric  magnitude  can  be  ex¬ 
pressed  in  terms  of  the  luminosity.  Let  the  total 
flux  density  at  a  distance  r  =  10  pc  be  F  and  let 
Fq  be  the  equivalent  quantity  for  the  Sun.  Since 
the  luminosity  is  L  =  Anr2F ,  we  get 


(4.12)  no  longer  holds,  because  part  of  the  ra¬ 
diation  is  absorbed  by  the  medium  (and  usually 
re-emitted  at  a  different  wavelength,  which  may 
be  outside  the  band  defining  the  magnitude),  or 
scattered  away  from  the  line  of  sight.  All  these 
radiation  losses  are  called  the  extinction. 

Now  we  want  to  find  out  how  the  extinction 
depends  on  the  distance.  Assume  we  have  a  star 
radiating  a  flux  Lo  into  a  solid  angle  co  in  some 
wavelength  range.  Since  the  medium  absorbs  and 
scatters  radiation,  the  flux  L  will  now  decrease 
with  increasing  distance  r  (Fig.  4.8).  In  a  short 
distance  interval  [r,  r  +  dr],  the  extinction  dL  is 
proportional  to  the  flux  L  and  the  distance  trav¬ 
elled  in  the  medium: 

dL  =  —aLdr.  (4.15) 


or 


Mboi  Mboi,© 


Mboi  Mboi,© 


-2.5  lg 


F 


-2.5  lg 


L/Anr 2 
L©/47rr2  ’ 


(4.14) 


The  factor  a  tells  how  effectively  the  medium  can 
obscure  radiation.  It  is  called  the  opacity.  From 
(4.15)  we  see  that  its  dimension  is  [a]  =  m-1. 
The  opacity  is  zero  for  a  perfect  vacuum  and  ap¬ 
proaches  infinity  when  the  substance  becomes  re¬ 
ally  murky.  We  can  now  define  a  dimensionless 
quantity,  the  optical  thickness  x  by 


dr  =  a  dr.  (4.16) 


The  absolute  bolometric  magnitude  Mboi  =  0  cor-  Substituting  this  into  (4.15)  we  get 
responds  to  a  luminosity  Lo  =  3.0  x  1028  w. 

dL  =  —L  dr. 


4.5  Extinction  and  Optical 
Thickness 

Equation  (4.12)  shows  how  the  apparent  magni¬ 
tude  increases  (and  brightness  decreases!)  with 
increasing  distance.  If  the  space  between  the  ra¬ 
diation  source  and  the  observer  is  not  completely 
empty,  but  contains  some  interstellar  medium, 


Next  we  integrate  this  from  the  source  (where 
L  =  Lo  and  r  =  0)  to  the  observer: 

rL  dL  r 

which  gives 


L  =  Lqq  t  . 


(4.17) 
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Here,  r  is  the  optical  thickness  of  the  material  be¬ 
tween  the  source  and  the  observer  and  L,  the  ob¬ 
served  flux.  Now,  the  flux  L  falls  off  exponen¬ 
tially  with  increasing  optical  thickness.  Empty 
space  is  perfectly  transparent,  i.e.  its  opacity  is 
a  =  0;  thus  the  optical  thickness  does  not  in¬ 
crease  in  empty  space,  and  the  flux  remains  con¬ 
stant. 

Let  Fq  be  the  flux  density  on  the  surface  of 
a  star  and  F(r),  the  flux  density  at  a  distance  r. 
We  can  express  the  fluxes  as 

L  =  cor2  F  (r) ,  Lq  =  coR2Fq, 

where  R  is  the  radius  of  the  star.  Substitution  into 
(4.16)  gives 


R 2 

F(r)  =  F0^e-T. 

For  the  absolute  magnitude  we  need  the  flux  den¬ 
sity  at  a  distance  of  10  parsecs,  E(10),  which  is 
still  evaluated  without  extinction: 


F(  10)  =  F0 


R2 

(10  pc)2' 


The  distance  modulus  m  —  M  is  now 


m  —  M  —  —2.5  lg 


=  51g 


—  5  lg 


10  pc 


10  pc 


F(r) 
F  (10) 


-  2.5  lge 


+  (2.5  lg  e)r 


or 


m  —  M  =  5  lg 


10  pc 


+  A, 


(4.18) 


where  A  >  0  is  the  extinction  in  magnitudes  due 
to  the  entire  medium  between  the  star  and  the  ob¬ 
server.  If  the  opacity  is  constant  along  the  line  of 
sight,  we  have 


where  the  constant  a  =  2.5a  lge  gives  the  extinc¬ 
tion  in  magnitudes  per  unit  distance. 

Colour  Excess  Another  effect  caused  by  the  in¬ 
terstellar  medium  is  the  reddening  of  light :  blue 
light  is  scattered  and  absorbed  more  than  red. 
Therefore  the  colour  index  B  —  V  increases.  The 
visual  magnitude  of  a  star  is,  from  (4.18), 

V  =  M\  +  5  lg  -J—  +  Ay,  (4.20) 

10  pc 

where  My  is  the  absolute  visual  magnitude  and 
Ay  is  the  extinction  in  the  V  passband.  Similarly, 
we  get  for  the  blue  magnitudes 

B  =  Mb  +  5  lg  — - h  Ab  . 

10  pc 

The  observed  colour  index  is  now 

B  —  V  =  Mb  —  My  +  Ab  —  Ay, 


or 


£-!/  =  (£- V)o  +  £B-v,  (4.21) 

where  ( B  —  V)o  =  Mb  —  My  is  the  intrinsic 
colour  of  the  star  and  Eb-v  =  (B  —  V)  —  (B  — 
V)0  is  the  colour  excess.  Studies  of  the  interstel¬ 
lar  medium  show  that  the  ratio  of  the  visual  ex¬ 
tinction  Ay  to  the  colour  excess  Eb-v  is  almost 
constant  for  all  stars: 


E  b— v 


This  makes  it  possible  to  find  the  visual  extinction 
if  the  colour  excess  is  known: 

Ay  ~  3.0  Eb-v-  (4.22) 

When  Ay  is  obtained,  the  distance  can  be  solved 
directly  from  (4.19),  when  V  and  My  are  known. 

We  shall  study  interstellar  extinction  in  more 
detail  in  Sect.  15.1  (“Interstellar  Dust”). 


r  =  a 


dr  =  ar , 


and  (4.18)  becomes 


r 

m  —  M  =  5  lg - 1-  ar , 

&  10  pc 


Atmospheric  Extinction  As  we  mentioned  in 
Sect.  3.1,  the  Earth’s  atmosphere  also  causes  ex¬ 
tinction.  The  observed  magnitude  m  depends  on 
the  location  of  the  observer  and  the  zenith  dis¬ 
tance  of  the  object,  since  these  factors  deter¬ 
mine  the  distance  the  light  has  to  travel  in  the 
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Fig.  4.9  If  the  zenith  distance  of  a  star  is  z,  the  light  of 
the  star  travels  a  distance  H /  cosz  in  the  atmosphere;  H  is 
the  height  of  the  atmosphere 


atmosphere.  To  compare  different  observations, 
we  must  first  reduce  them,  i.e.  remove  the  at¬ 
mospheric  effects  somehow.  The  magnitude  mo 
thus  obtained  can  then  be  compared  with  other 
observations. 

If  the  zenith  distance  z  is  not  too  large,  we  can 
approximate  the  atmosphere  by  a  plane  layer  of 
constant  thickness  (Fig.  4.9).  If  the  thickness  of 
the  atmosphere  is  used  as  a  unit,  the  light  must 
travel  a  distance 

X  =  1  /  cos  z  =  sec  z  (4.23) 

in  the  atmosphere.  The  quantity  X  is  the  air  mass. 
According  to  (4.18),  the  magnitude  increases  lin¬ 
early  with  the  distance  X : 

m  =  mo  +  kX,  (4.24) 

where  k  is  the  extinction  coefficient. 


The  extinction  coefficient  can  be  determined 
by  observing  the  same  source  several  times  dur¬ 
ing  a  night  with  as  wide  a  zenith  distance  range 
as  possible.  The  observed  magnitudes  are  plot¬ 
ted  in  a  diagram  as  a  function  of  the  air  mass  X . 
The  points  lie  on  a  straight  line  the  slope  of 
which  gives  the  extinction  coefficient  k.  When 
this  line  is  extrapolated  to  X  =  0,  we  get  the  mag¬ 
nitude  mo,  which  is  the  apparent  magnitude  out¬ 
side  the  atmosphere. 

In  practice,  observations  with  zenith  distances 
higher  than  70°  (or  altitudes  less  than  20°)  are 
not  used  to  determine  k  and  mo,  since  at  low  al¬ 
titudes  the  curvature  of  the  atmosphere  begins  to 
complicate  matters.  The  value  of  the  extinction 
coefficient  k  depends  on  the  observation  site  and 
time  and  also  on  the  wavelength,  since  extinction 
increases  strongly  towards  short  wavelengths. 


4.6  Examples 

Example  4.1  Show  that  intensity  is  independent 
of  distance. 

Suppose  we  have  some  radiation  leaving  the 
surface  element  d A  in  the  direction  0 .  The  energy 
entering  the  solid  angle  d co  in  time  d t  is 

d E  =  I  cos  6  d  A  d co  dt , 

where  I  is  the  intensity.  If  we  have  another  sur¬ 
face  d A'  at  a  distance  r  receiving  this  radiation 
from  direction  O',  we  have 

d  co  =  dA'  cosO' /r2. 


The  definition  of  the  intensity  gives 


d  E  =  I'  cos  0'  dA'  dco'  dt, 


where  I'  is  the  intensity  at  dA'  and 

dco'  =  d  A  cos  0/r2. 
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Substitution  of  dco  and  d co'  into  the  expressions 
of  d E  gives 


dA'  cos  6' 

I  cos  0  d 0  d  A - z - dt 


,  ,  .dAcos# 

=  /  cos  0  d A - « —  dt 


=>  r  =  l 


The  solid  angle  subtended  by  the  Sun  is 

co  =  7i  ^  =  n  x  0.004652 

=  6.81  x  10-5  sterad, 
and  the  surface  brightness 


Thus  the  intensity  remains  constant  in  empty 
space. 

Example  4.2  (Surface  Brightness  of  the  Sun) 
Assume  that  the  Sun  radiates  isotropically.  Let  R 
be  the  radius  of  the  Sun,  Fq  the  flux  density  on 
the  surface  of  the  Sun  and  F  the  flux  density  at 
a  distance  r.  Since  the  luminosity  is  constant, 

L  =  4n  R2  Fq  =  4nr2F, 


s 

B  =  —  =  2.01  x  107  W  m2  sterad-1 . 
co 

Example  4.3  (Magnitude  of  a  Binary  Star) 
Since  magnitudes  are  logarithmic  quantities,  they 
can  be  a  little  awkward  for  some  purposes.  For 
example,  we  cannot  add  magnitudes  like  flux 
densities.  If  the  magnitudes  of  the  components 
of  a  binary  star  are  1  and  2,  the  total  magnitude 
is  certainly  not  3.  To  find  the  total  magnitude,  we 
must  first  solve  the  flux  densities  from 


the  flux  density  equals 


R 2 

F  =  Fq  —  . 

y  ^ 

At  a  distance  r  R,  the  Sun  subtends  a  solid 
angle 


1  =  — 2.51g 


which  give 


Fi=  F0x  IQ"0-4, 


F2  =  F0x  lO-0'8. 


Thus  the  total  flux  density  is 


A 


where  A  =  ttR2  is  the  cross  section  of  the  Sun. 
The  surface  brightness  B  is 


F  =  Fi  +  F2  =  F0(  IQ-04  +  lO-08) 


and  the  total  magnitude, 


m  =  —2.5  lg 


F0(10-0-4  +  lO-08) 

To 


=  —2.5  lgO.5566  =  0.64. 


Applying  (4.4)  we  get 


Thus  the  surface  brightness  is  independent  of  dis¬ 
tance  and  equals  the  intensity.  We  have  found 
a  simple  interpretation  for  the  somewhat  abstract 
concept  of  intensity. 

The  flux  density  of  the  Sun  on  the  Earth,  the 
solar  constant ,  is  Sq  ~  1370  W  m-2.  The  angular 
diameter  of  the  Sun  is  a  =  32r,  whence 


a  1  32  it 

—  =  -  x  —  x - =  0.00465  rad. 

2  2  60 


Example  4.4  The  distance  of  a  star  is  r  = 
100  pc  and  its  apparent  magnitude  m  =  6.  What 
is  its  absolute  magnitude? 

Substitution  into  (4.12) 


r 

m  —  M  =  5  lg - 

10  pc 


gives 

100 

M  =  6  —  5  lg - =  1. 

&  10 

Example  4.5  The  absolute  magnitude  of  a  star 
is  M  =  —  2  and  the  apparent  magnitude  m  =  8. 
What  is  the  distance  of  the  star? 


4.7  Exercises 
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We  can  solve  the  distance  r  from  (4.12): 
r  =  10  pc  X  10 (>”-M)/5  =  1Q  x  J  Q 10/5  pc 
=  1000  pc  =  1  kpc. 

Example  4.6  Although  the  amount  of  interstel¬ 
lar  extinction  varies  considerably  from  place  to 
place,  we  can  use  an  average  value  of  2  mag/kpc 
near  the  galactic  plane.  Find  the  distance  of  the 
star  in  Example  4.5,  assuming  such  extinction. 
Now  the  distance  must  be  solved  from  (4.19): 

8  —  (—2)  =  5  lg  —  +  0.002  r, 

where  r  is  in  parsecs.  This  equation  cannot  be 
solved  analytically,  but  we  can  always  use  a  nu¬ 
merical  method.  We  try  a  simple  iteration  (Ap¬ 
pendix  A.7),  rewriting  the  equation  as 

r  =  10  x  io2-°  0004r. 

The  value  r  =  1000  pc  found  previously  is  a  good 
initial  guess: 

ro  =  1000, 

ri  =  10  x  io2-0  0004><  1000  =  398, 
r2  =  693, 


ri2  =  ri3  =  584. 

The  distance  is  r  ~  580  pc,  which  is  much  less 
than  our  earlier  value  1000  pc.  This  should  be 
quite  obvious,  since  due  to  extinction,  radiation 
is  now  reduced  much  faster  than  in  empty  space. 

Example  4.7  What  is  the  optical  thickness  of 
a  layer  of  fog,  if  the  Sun  seen  through  the  fog 
seems  as  bright  as  a  full  moon  in  a  cloudless  sky? 

The  apparent  magnitudes  of  the  Sun  and  the 
Moon  are  —26.8  and  —12.5,  respectively.  Thus 
the  total  extinction  in  the  cloud  must  be  A  =  14.3. 
Since 

A  =  (2.5  lge)r, 

we  get 

r  =  A/(2.51ge)  =  14.3/1.086  =  13.2. 


The  optical  thickness  of  the  fog  is  13.2.  In  reality, 
a  fraction  of  the  light  scatters  several  times,  and 
a  few  of  the  multiply  scattered  photons  leave  the 
cloud  along  the  line  of  sight,  reducing  the  total 
extinction.  Therefore  the  optical  thickness  must 
be  slightly  higher  than  our  value. 

Example  4.8  (Reduction  of  Observations)  The 
altitude  and  magnitude  of  a  star  were  measured 
several  times  during  a  night.  The  results  are  given 
in  the  following  table. 


Altitude 

Zenith 

distance 

Air 

mass 

Magnitude 

50° 

40° 

1.31 

0.90 

35° 

55° 

1.74 

0.98 

25° 

65° 

2.37 

1.07 

20° 

70° 

2.92 

1.17 

By  plotting  the  observations  as  in  the  follow¬ 
ing  figure,  we  can  determine  the  extinction  coef¬ 
ficient  k  and  the  magnitude  mo  outside  the  atmo¬ 
sphere.  This  can  be  done  graphically  (as  here)  or 
using  a  least-squares  fit. 


Extrapolation  to  the  air  mass  X  =  0  gives 
mo  =  0.68.  The  slope  of  the  line  gives  k  =  0.17. 


4.7  Exercises 

Exercise  4.1  The  total  magnitude  of  a  triple  star 
is  0.0.  Two  of  its  components  have  magnitudes 
1.0  and  2.0.  What  is  the  magnitude  of  the  third 
component? 

Exercise  4.2  The  absolute  magnitude  of  a  star 
in  the  Andromeda  galaxy  (distance  690  kpc)  is 
M  =  5.  It  explodes  as  a  supernova,  becoming  one 
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billion  (109)  times  brighter.  What  is  its  apparent 
magnitude? 

Exercise  4.3  Assume  that  all  stars  have  the 
same  absolute  magnitude  and  stars  are  evenly 
distributed  in  space.  Let  N(m)  be  the  number  of 
stars  brighter  than  m  magnitudes.  Find  the  ratio 
N(m  +  \)/N(m). 

Exercise  4.4  The  V  magnitude  of  a  star  is  15.1, 
B  —  V  =  1.6,  and  absolute  magnitude  M\  =  1.3. 
The  extinction  in  the  direction  of  the  star  in  the 


visual  band  is  ay  =  1  mag  kpc  1 .  What  is  the  in¬ 
trinsic  colour  of  the  star? 

Exercise  4.5  Stars  are  observed  through  a  triple 
window.  Each  surface  reflects  away  15  %  of  the 
incident  light. 

(a)  What  is  the  magnitude  of  Regulus  (My  = 
1.36)  seen  through  the  window? 

(b)  What  is  the  optical  thickness  of  the  window? 


Radiation  Mechanisms 


In  the  previous  chapters  we  have  studied  the 
physical  properties  and  detection  of  electromag¬ 
netic  radiation.  Next  we  shall  briefly  discuss  con¬ 
cepts  related  to  emission  and  absorption  of  radi¬ 
ation.  Since  we  can  give  here  only  a  summary  of 
some  essential  results  without  delving  into  quan¬ 
tum  mechanical  explanations,  the  reader  inter¬ 
ested  in  the  details  is  advised  to  consult  any  good 
physics  textbook. 


5.1  Radiation  of  Atoms  and 
Molecules 

Electromagnetic  radiation  is  emitted  or  absorbed 
when  an  atom  or  a  molecule  moves  from  one  en¬ 
ergy  level  to  another.  If  the  energy  of  the  atom 
decreases  by  an  amount  A  E,  the  atom  emits  or 
radiates  a  quantum  of  electromagnetic  radiation, 
called  a  photon ,  whose  frequency  v  is  given  by 
the  equation 

A  E  =  hv,  (5.1) 

where  h  is  the  Planck  constant ,  h  =  6.6256  x 
10-34  Js.  Similarly,  if  the  atom  receives  or  ab¬ 
sorbs  a  photon  of  a  frequency  v,  its  energy  in¬ 
creases  by  A E  =  hv. 

The  classical  model  describes  an  atom  as 
a  nucleus  surrounded  by  a  swarm  of  electrons. 
The  nucleus  consists  of  Z  protons,  each  hav¬ 
ing  a  charge  +e  and  N  electrically  neutral  neu¬ 
trons;  Z  is  the  charge  number  of  the  atom  and 
A  =  Z  +  N  is  its  mass  number.  A  neutral  atom 
has  as  many  electrons  (charge  —  e)  as  protons. 


An  energy  level  of  an  atom  usually  refers  to 
an  energy  level  of  its  electrons.  The  energy  E  of 
an  electron  cannot  take  arbitrary  values;  only  cer¬ 
tain  energies  are  allowed:  the  energy  levels  are 
quantised.  An  atom  can  emit  or  absorb  radiation 
only  at  certain  frequencies  v#  corresponding  to 
energy  differences  between  some  initial  and  final 
states  i  and  f:  \E{  —  Ef  \  =  hv if.  This  gives  rise 
to  the  line  spectrum ,  specific  for  each  element 
(Fig.  5.1).  Hot  gas  under  low  pressure  produces 
an  emission  spectrum  consisting  of  such  discrete 
lines.  If  the  same  gas  is  cooled  down  and  ob¬ 
served  against  a  source  of  white  light  (which  has 
a  continuous  spectrum),  the  same  lines  are  seen 
as  dark  absorption  lines. 

At  low  temperatures  most  atoms  are  in  their 
lowest  energy  state,  the  ground  state.  Higher  en¬ 
ergy  levels  are  excitation  states ;  a  transition  from 
lower  to  higher  state  is  called  excitation.  Usually 
the  excited  atom  will  return  to  the  lower  state  very 
rapidly,  radiating  a  photon  (, spontaneous  emis¬ 
sion );  a  typical  lifetime  of  an  excited  state  might 
be  1CU8  seconds.  The  frequency  of  the  emitted 
photon  is  given  by  (5.1).  The  atom  may  return  to 
the  lower  state  directly  or  through  some  interme¬ 
diate  states,  emitting  one  photon  in  each  transi¬ 
tion. 

Downward  transitions  can  also  be  induced 
by  radiation.  Suppose  our  atom  has  swallowed 
a  photon  and  become  excited.  Another  photon, 
whose  frequency  v  corresponds  to  some  possi¬ 
ble  downward  transition  from  the  excited  state, 
can  now  irritate  the  atom,  causing  it  to  jump  to 
a  lower  state,  emitting  a  photon  with  the  same 
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Fig.  5.1  Origin  of  line  spectra,  (a)  Emission  spectrum. 
Atoms  of  glowing  gas  returning  from  excited  states  to 
lower  states  emit  photons  with  frequencies  corresponding 
to  the  energy  difference  of  the  states.  Each  element  emits 
its  own  characteristic  wavelengths,  which  can  be  mea- 

frequency  v.  This  is  called  induced  or  stimulated 
emission.  Photons  emitted  spontaneously  leave 
the  atom  randomly  in  all  directions  with  random 
phases:  the  radiation  is  isotropic  and  incoherent. 
Induced  radiation,  on  the  other  hand,  is  coherent ; 
it  propagates  in  the  same  direction  as  and  in  phase 
with  the  inducing  radiation. 

Also  collisions  of  atoms  cause  transitions 
up  and  down.  In  these  transitions  photons  are 
not  emitted,  instead  the  kinetic  energies  of  the 
atoms  are  changed.  (An  upward  transition  can,  of 
course,  cause  e.g.  a  spontaneous  emission.)  The 
higher  the  density  of  the  gas  the  more  dominant 
the  collisional  transitions  are. 

The  zero  level  of  the  energy  states  is  usually 
chosen  so  that  a  bound  electron  has  negative  en¬ 
ergy  and  a  free  electron  positive  energy  (cf.  the 
energy  integral  of  planetary  orbits,  Chap.  6).  If 
an  electron  with  energy  E  <  0  receives  more  en¬ 
ergy  than  \E\,  it  will  leave  the  atom,  which  be¬ 
comes  an  ion.  In  astrophysics  ionisation  is  often 
called  a  bound-free  transition  (Fig.  5.2).  Unlike 


sured  by  spreading  the  light  into  a  spectrum  with  a  prism 
or  diffraction  grating,  (b)  Absorption  spectrum.  When 
white  light  containing  all  wavelengths  travels  through  gas, 
the  wavelengths  characteristic  of  the  gas  are  absorbed 


Fig.  5.2  Different  kinds  of  transitions  between  energy 
levels.  Absorption  and  emission  occur  between  two  bound 
states,  whereas  ionisation  and  recombination  occur  be¬ 
tween  a  bound  and  a  free  state.  Interaction  of  an  atom  with 
an  free  electron  can  result  in  a  free-free  transition 

in  excitation  all  values  of  energy  (E  >  0)  are  now 
possible.  The  extraneous  part  of  the  absorbed  en¬ 
ergy  goes  to  the  kinetic  energy  of  the  liberated 
electron.  The  inverse  process,  in  which  an  atom 
captures  a  free  electron,  is  the  recombination  or 
free-bound  transition. 


5.2  The  Hydrogen  Atom 
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Fig.  5.3  Polarisation  of  light.  The  light  of  an  incandes¬ 
cent  bulb  contains  all  possible  directions  of  vibration  and 
is  therefore  unpolarised.  Some  crystals,  for  example,  pass 
electric  fields  oscillating  only  in  certain  directions,  and  the 
transmitted  part  of  the  light  becomes  linearly  polarised. 
E  is  the  electric  held  and  B  the  magnetic  held 

When  an  electron  scatters  from  a  nucleus  or  an 
ion  without  being  captured,  the  electromagnetic 
interaction  can  change  the  kinetic  energy  of  the 
electron  producing  free-free  radiation.  In  a  very 
hot  gas  (T  >  106  K)  hydrogen  is  fully  ionised, 
and  the  free-free  radiation  is  the  most  important 
source  of  emission.  It  is  then  usually  called  ther¬ 
mal  bremsstrahlung.  The  latter  part  of  the  name 
derives  from  the  fact  that  decelerating  electrons 
hitting  the  anode  of  an  X-ray  tube  emit  similar  ra¬ 
diation.  In  an  analogous  way  the  absorption  pro¬ 
cess  can  be  called  a  bound-bound  transition. 

Electromagnetic  radiation  is  transverse  wave 
motion;  the  electric  and  magnetic  fields  oscillate 
perpendicular  to  each  other  and  also  perpendic¬ 
ular  to  the  direction  of  propagation.  The  light  of 
an  ordinary  incandescent  lamp  has  a  random  dis¬ 
tribution  of  electric  fields  vibrating  in  all  direc¬ 
tions.  If  the  directions  of  electric  fields  in  the 
plane  perpendicular  to  the  direction  of  propaga¬ 
tion  are  not  evenly  distributed,  the  radiation  is 
polarised  (Fig.  5.3).  The  direction  of  polarisation 
of  linearly  polarised  light  means  the  plane  deter¬ 
mined  by  the  electric  vector  and  the  direction  of 
the  light  ray.  If  the  electric  vector  describes  a  cir¬ 
cle,  the  radiation  is  circularly  polarised.  If  the 
amplitude  of  the  electric  field  varies  at  the  same 
time,  the  polarisation  is  elliptic. 

If  polarised  radiation  travels  through  a  mag¬ 
netic  field,  the  direction  of  the  polarisation  will 
rotate.  The  amount  of  such  Faraday  rotation  is 
proportional  to  the  component  of  the  magnetic 
field  parallel  to  the  line  of  sight,  number  of  elec¬ 
trons  along  the  line  of  sight,  distance  travelled, 
and  square  of  the  wavelength  of  the  radiation. 

Scattering  is  an  absorption  followed  by  an 
instantaneous  emission  at  the  same  wavelength 


but  usually  in  a  new  direction.  On  the  macro¬ 
scopic  scale,  radiation  seems  to  be  reflected  by 
the  medium.  The  light  coming  from  the  sky  is 
sunlight  scattered  from  atmospheric  molecules. 
Scattered  light  is  always  polarised,  the  degree  of 
polarisation  being  highest  in  the  direction  perpen¬ 
dicular  to  the  direction  of  the  original  radiation. 


5.2  The  Hydrogen  Atom 

The  hydrogen  atom  is  the  simplest  atom,  consist¬ 
ing  of  a  proton  and  an  electron.  According  to  the 
Bohr  model  the  electron  orbits  the  proton  in  a  cir¬ 
cular  orbit.  (In  spite  of  the  fact  that  this  model  has 
very  little  to  do  with  reality,  it  can  be  successfully 
used  to  predict  some  properties  of  the  hydrogen 
atom.)  Bohr’s  first  postulate  says  that  the  angu¬ 
lar  momentum  of  the  electron  must  be  a  multiple 
of  h\ 

mvr=nh ,  (5.2) 

where 

m  =  mass  of  the  electron, 
v  =  speed  of  the  electron, 
r  =  radius  of  the  orbit, 
n  =  the  principal  quantum  number, 
n  =  1 , 2,  3, ... , 
h  —  h/2n, 

h  =  the  Planck  constant. 

The  quantum  mechanical  interpretation  of 
Bohr’s  first  postulate  is  obvious:  the  electron  is 
represented  as  a  standing  wave,  and  the  “length 
of  the  orbit”  must  be  a  multiple  of  the  de  Broglie 
wavelength,  X  =  h/ p  =  h/mv. 

A  charged  particle  in  a  circular  orbit  (and  thus 
in  accelerated  motion)  should  emit  electromag¬ 
netic  radiation,  losing  energy,  were  it  to  obey  the 
rules  of  classical  electrodynamics.  Therefore  our 
electron  should  spiral  down  towards  the  nucleus. 
But  obviously,  Nature  does  not  behave  this  way, 
and  we  have  to  accept  Bohr’s  second  postulate, 
which  says  that  an  electron  moving  in  an  allowed 
orbit  around  a  nucleus  does  not  radiate.  Radia- 
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Fig.  5.4  Transitions  of 
a  hydrogen  atom.  The 
lower  picture  shows  a  part 
of  the  spectrum  of  the  star 
HD  193 182.  On  both  sides 
of  the  stellar  spectrum  we 
see  an  emission  spectrum 
of  iron.  The  wavelengths  of 
the  reference  lines  are 
known,  and  they  can  be 
used  to  calibrate  the 
wavelengths  of  the 
observed  stellar  spectrum. 
The  hydrogen  B  aimer  lines 
are  seen  as  dark  absorption 
lines  converging  towards 
the  B aimer  ionisation  limit 
(also  called  the  B aimer 
discontinuity)  at 
X  =  364.7  nm  to  the  left. 
The  numbers  (15, ... ,  40) 
refer  to  the  quantum 
number  n  of  the  higher 
energy  level.  (Photo  by 
Mt.  Wilson  Observatory) 


364.7  nm  373.5  nm 


B  aimer  H40  H30  H20  HI  5 


discontinuity 


tion  is  emitted  only  when  the  electron  jumps  from 
a  higher  energy  state  to  a  lower  one.  The  emitted 
quantum  has  an  energy  hv ,  equal  to  the  energy 
difference  of  these  states: 

hv  —  En2  En  i .  (5.3) 

We  shall  now  try  to  find  the  energy  of  an  electron 
in  the  state  En.  Coulomb’s  law  gives  the  force 
pulling  the  electron  towards  the  proton: 


The  acceleration  of  a  particle  moving  in  a  cir¬ 
cular  orbit  of  radius  rn  is 


a  = 


and  applying  Newton’s  second  law  (F  =  m a),  we 
get 


mv2  1  e 2 
rn  An€f)  r2  ' 


(5.5) 


1  e2 

F=  - - 

4jt€o  r~ 

where 

=  the  vacuum  permittivity 


From  (5.2)  and  (5.5)  i 

(5.4) 

e 2  1 

n  47T6o  hn 

The  total  energy  of  a 
now 


follows  that 

47T6o^2  o 
rn  =  y  77  . 
mez 

electron  in  the  orbit  n  is 


=  8.85  x  10-12  N-1  m_2C2, 
e  =  the  charge  of  the  electron  =  1.6  x  10“ 19  C, 


En  — 


1  9 

T  +  V  =  -mv; 

2 


1 


me 


4 


1 


4jt€o  rn 

1 


32jr262fr2  n2 


=  -C 


n 


2  ’ 


rn  =  the  distance  between  the  electron  and  the 
proton. 


(5.6) 
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where  C  is  a  constant.  For  the  ground  state  ( n  = 
1),  we  get  from  (5.6) 


Ei  =  -2.18  x  10"18  J  =  -13.6  eV. 

From  (5.3)  and  (5.6)  we  get  the  energy  of  the 
quantum  emitted  in  the  transition  Eni  — >►  Eni: 


In  terms  of  the  wavelength  A  this  can  be  ex¬ 
pressed  as 


(5.8) 

where  R  is  the  Rydberg  constant ,  R  =  1.097  x 
107  m-1. 

Equation  (5.8)  was  derived  experimentally  for 
n\  =  2  by  Johann  Jakob  B  aimer  as  early  as  1885. 
That  is  why  we  call  the  set  of  lines  produced  by 
transitions  En  — >►  E2  the  B aimer  series .  These 
lines  are  in  the  visible  part  of  the  spectrum.  For 
historical  reasons  the  B  aimer  lines  are  often  de¬ 
noted  by  symbols  Ha,  H^,  Hy  etc.  If  the  elec¬ 
tron  returns  to  its  ground  state  (En  ->  E 1),  we 
get  the  Lyman  series ,  which  is  in  the  ultravio¬ 
let.  The  other  series  with  specific  names  are  the 
Paschen  series  (n\  =  3),  Bracket  series  (n\  =4) 
and  Pfund  series  (n\  =5)  (see  Fig.  5.4). 


5.3  Line  Profiles 

The  previous  discussion  suggests  that  spectral 
lines  would  be  infinitely  narrow  and  sharp.  In  re¬ 
ality,  however,  they  are  somewhat  broadened.  We 
will  now  consider  briefly  the  factors  affecting  the 
shape  of  a  spectral  line,  called  a  line  profile.  An 
exact  treatment  would  take  us  too  deep  into  quan¬ 
tum  mechanics,  so  we  cannot  go  into  the  details 
here. 

According  to  quantum  mechanics  everything 
cannot  be  measured  accurately  at  the  same  time. 
For  example,  even  in  principle,  there  is  no  way 
to  determine  the  x  coordinate  and  the  momen¬ 
tum  px  in  the  direction  of  the  x  axis  with  arbi¬ 
trary  precision  simultaneously.  These  quantities 


Fig.  5.5  Each  spectral  line  has  its  characteristic  natural 
width  (solid  line).  Motions  of  particles  broaden  the  line 
further  due  to  the  Doppler  effect,  resulting  in  the  Voigt 
profile  (dashed  line).  Both  profiles  have  the  same  area 


have  small  uncertainties  Av  and  A px,  such  that 

AxApx  ~  h. 

A  similar  relation  holds  for  other  directions,  too. 
Time  and  energy  are  also  connected  by  an  uncer¬ 
tainty  relation, 


AEAt  ss  h. 

The  natural  width  of  spectral  lines  is  a  conse¬ 
quence  of  this  Heisenberg  uncertainty  principle. 

If  the  average  lifetime  of  an  excitation  state 
is  T,  the  energy  corresponding  to  the  transi¬ 
tion  can  only  be  determined  with  an  accuracy 
of  A E  =  h/T  =  h/(2nT).  From  (5.1)  it  follows 
that  Av  =  A E/h.  In  fact,  the  uncertainty  of  the 
energy  depends  on  the  lifetimes  of  both  the  ini¬ 
tial  and  final  states.  The  natural  width  of  a  line  is 
defined  as 


AE{  +  AEf 
h 


1  1 

Ti  +  T{ 


(5.9) 


It  can  be  shown  that  the  corresponding  line  profile 
is 


y_ _ ^0 _ 

2n  (v  -  vq)2  +  y2/ 4’ 


(5.10) 
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where  vo  is  the  frequency  at  the  centre  of  the  line 
and  Io  the  total  intensity  of  the  line.  At  the  centre 
of  the  line  the  intensity  per  frequency  unit  is 

2 

A>o  —  An 

ny 

and  at  the  frequency  v  =  vo  +  y  /  2, 

1  _  1 

ho+y/2  ~  ——  h)  ~  -Iv0' 

Tty  z 

Thus  the  width  y  is  the  width  of  the  line  profile 
at  a  depth  where  the  intensity  is  half  of  the  max¬ 
imum.  This  is  called  the  full  width  at  half  maxi¬ 
mum  (FWHM). 

Doppler  Broadening  Atoms  of  a  gas  are  mov¬ 
ing  the  faster  the  higher  the  temperature  of  the 
gas.  Thus  spectral  lines  arising  from  individual 
atoms  are  shifted  by  the  Doppler  effect.  The  ob¬ 
served  line  consists  of  a  collection  of  lines  with 
different  Doppler  shifts,  and  the  shape  of  the  line 
depends  on  the  number  of  atoms  with  different 
velocities. 

Each  Doppler  shifted  line  has  its  character¬ 
istic  natural  width.  The  resulting  line  profile  is 
obtained  by  giving  each  Doppler  shifted  line 
a  weight  proportional  to  the  number  of  atoms 
given  by  the  velocity  distribution  and  integrat¬ 
ing  over  all  velocities.  This  gives  rise  to  the  Voigt 
profile  (Fig.  5.5),  which  already  describes  most 
spectral  lines  quite  well.  The  shapes  of  different 
profiles  don’t  seem  very  different;  the  most  ob¬ 
vious  consequence  of  the  broadening  is  that  the 
maximum  depth  decreases. 

One  way  to  describe  the  width  of  a  line  is  to 
give  its  full  width  at  half  maximum  (Fig.  5.6). 
Due  to  Doppler  broadening  this  is  usually  greater 
than  the  natural  width.  The  equivalent  width  is 
another  measure  of  a  line  strength.  It  is  the  area 
of  a  rectangular  line  that  has  the  same  area  as 
the  line  profile  and  that  emits  no  light  at  all.  The 
equivalent  width  can  be  used  to  describe  the  en¬ 
ergy  corresponding  to  a  line  independently  of  the 
shape  of  the  line  profile. 


Fig.  5.6  The  full  width  at  half  maximum  (FWHM)  of 
a  spectral  line  is  the  width  at  the  depth  where  the  inten¬ 
sity  is  half  of  the  maximum.  The  equivalent  width  W  is 
defined  so  that  the  line  and  the  shaded  rectangle  have  the 
same  area  in  the  picture.  The  two  measures  are  not  gen¬ 
erally  the  same,  although  they  usually  are  close  to  each 
other 


5.4  Quantum  Numbers,  Selection 
Rules,  Population  Numbers 

Quantum  Numbers  The  Bohr  model  needs 
only  one  quantum  number,  n,  to  describe  all  the 
energy  levels  of  the  electron.  This  can  explain 
only  the  coarse  features  of  an  atom  with  a  single 
electron. 

Quantum  mechanics  describes  the  electron  as 
a  three  dimensional  wave,  which  only  gives  the 
probability  of  finding  the  electron  in  a  certain 
place.  Quantum  mechanics  has  accurately  pre¬ 
dicted  all  the  energy  levels  of  hydrogen  atoms. 
The  energy  levels  of  heavier  atoms  and  molecules 
can  also  be  computed;  however,  such  calculations 
are  very  complicated.  Also  the  existence  of  quan¬ 
tum  numbers  can  be  understood  from  the  quan¬ 
tum  mechanical  point  of  view. 

The  quantum  mechanical  description  involves 
four  quantum  numbers,  one  of  which  is  our  n,  the 
principal  quantum  number.  The  principal  quan¬ 
tum  number  describes  the  quantised  energy  lev¬ 
els  of  the  electron.  The  classical  interpretation  of 
discrete  energy  levels  allows  only  certain  orbits 
given  by  (5.6).  The  orbital  angular  momentum  of 
the  electron  is  also  quantised.  This  is  described  by 
the  angular  momentum  quantum  number  /.  The 
angular  momentum  corresponding  to  a  quantum 
number  /  is 


L  =  y/l(l  +  1  )H. 
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The  classical  analogy  would  be  to  allow  some  el¬ 
liptic  orbits.  The  quantum  number  l  can  take  only 
the  values 


/  =  0,  1, . . . ,  n  —  1. 

For  historical  reasons,  these  are  often  denoted  by 
the  letters  s,  p,  d,  /,  g ,  h ,  /,  j . 

Although  /  determines  the  magnitude  of  the 
angular  momentum,  it  does  not  give  its  direction. 
In  a  magnetic  field  this  direction  is  important, 
since  the  orbiting  electron  also  generates  a  tiny 
magnetic  field.  In  any  experiment,  only  one  com¬ 
ponent  of  the  angular  momentum  can  be  mea¬ 
sured  at  a  time.  In  a  given  direction  z  (e.g.  in  the 
direction  of  the  applied  magnetic  field),  the  pro¬ 
jection  of  the  angular  momentum  can  have  only 
the  values 


Lz=m;h, 

where  ra/  is  the  magnetic  quantum  number 

mi  =  0,  dzl,  d=2, . . . ,  d =/. 

The  magnetic  quantum  number  is  responsible 
for  the  splitting  of  spectral  lines  in  strong  mag¬ 
netic  fields,  known  as  the  Zeeman  effect.  For  ex¬ 
ample,  if  l  =  1,  mi  can  have  21  +  1  =  3  different 
values.  Thus,  the  line  arising  from  the  transition 
l  =  1  l  =  0  will  split  into  three  components  in 
a  magnetic  field  (Fig.  5.7). 

The  fourth  quantum  number  is  the  spin  de¬ 
scribing  the  intrinsic  angular  momentum  of  the 
electron.  The  spin  of  the  electron  is 

S  =  y/s(s  +  T)h, 

where  the  spin  quantum  number  is  a  =  In 
a  given  direction  z,  the  spin  is 

Sz  =  msH, 

where  ms  can  have  one  of  the  two  values: 


1 


All  particles  have  a  spin  quantum  number.  Par¬ 
ticles  with  an  integral  spin  are  called  bosons  (pho¬ 
ton,  mesons);  particles  with  a  half-integral  spin 
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Fig.  5.7  The  Zeeman  effect.  In  strong  magnetic  fields 
each  energy  level  of  a  hydrogen  atom  splits  into  (2/  +  1) 
separate  levels,  which  correspond  to  different  values  of 
the  magnetic  quantum  number  mi  =1,1  —  1,...,—  /.  The 
energy  differences  of  the  successive  levels  have  the  same 
constant  value  A  E.  For  example  the  p  state  (/  =  1)  splits 
into  three  and  the  d  state  (/  =  2)  into  five  sublevels.  The 
selection  rules  require  that  in  electric  dipole  transitions 
A  mi  equals  0  or  ±1,  and  only  nine  different  transitions 
between  p  and  d  states  are  possible.  Moreover,  the  transi¬ 
tions  with  the  same  Am/  have  the  same  energy  difference. 
Thus  the  spectrum  has  only  three  separate  lines 


ar e  fermions  (proton,  neutron,  electron,  neutrino 
etc.). 

Classically,  spin  can  be  interpreted  as  the  ro¬ 
tation  of  a  particle;  this  analogy,  however,  should 
not  be  taken  too  literally. 

The  total  angular  momentum  J  of  an  electron 
is  the  sum  of  its  orbital  and  spin  angular  momen¬ 
tum: 


J  =  L  +  S. 

Depending  on  the  mutual  orientation  of  the  vec¬ 
tors  L  and  S  the  quantum  number  j  of  total  an¬ 
gular  momentum  can  have  one  of  two  possible 
values, 


(except  if  /  =  0,  when  j  =  ^).  The  z  component 
of  the  total  angular  momentum  can  have  the  val¬ 
ues 


mj  =0,  dzl,  d=2,  .  .  .  d=  j. 
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Spin  also  gives  rise  to  the  fine  structure  of 
spectral  lines.  Lines  appear  as  close  pairs  or  dou¬ 
blets. 

Selection  Rules  The  state  of  an  electron  can¬ 
not  change  arbitrarily;  transitions  are  restricted 
by  selection  rules,  which  follow  from  certain  con¬ 
servation  laws.  The  selection  rules  express  how 
the  quantum  numbers  must  change  in  a  transi¬ 
tion.  Most  probable  are  the  electric  dipole  transi¬ 
tions which  make  the  atom  behave  like  an  oscil¬ 
lating  dipole.  The  conservation  laws  require  that 
in  a  transition  we  have 

AZ  =  ±1, 

A  mi  =0,  dzl. 

In  terms  of  the  total  angular  momentum  the  se¬ 
lection  rules  are 


A/  =  ±1, 

A j  =  0,  dzl, 

A mj  =  0,  ±1. 

The  probabilities  of  all  other  transitions  are 
much  smaller,  and  they  are  called  forbidden  tran¬ 
sitions',  examples  are  magnetic  dipole  transitions 
and  all  quadrupole  and  higher  multipole  transi¬ 
tions. 

Spectral  lines  originating  in  forbidden  transi¬ 
tions  are  called  forbidden  lines.  The  probability 
of  such  a  transition  is  so  low  that  under  normal 
circumstances,  the  transition  cannot  take  place 
before  collisions  force  the  electron  to  change 
state.  Forbidden  lines  are  possible  only  if  the  gas 
is  extremely  rarefied  (like  in  auroras  and  plane¬ 
tary  nebulae). 

The  spins  of  an  electron  and  nucleus  of  a  hy¬ 
drogen  atom  can  be  either  parallel  or  antipar¬ 
allel  (Fig.  5.8).  The  energy  of  the  former  state 
is  0.0000059  eV  higher.  But  the  selection  rules 
make  an  electric  dipole  transition  between  these 
states  impossible.  The  transition,  which  is  a  mag¬ 
netic  dipole  transition,  has  a  very  low  probabil¬ 
ity,  A  =  2.8  x  10-15  s_1.  This  means  that  the  av¬ 
erage  lifetime  of  the  higher  state  is  T  =  1/  A  = 
11  x  106  years.  Usually  collisions  change  the 
state  of  the  electron  well  before  this  period  of 
time  has  elapsed.  But  in  interstellar  space  the  den¬ 
sity  of  hydrogen  is  so  low  and  the  total  amount  of 
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Fig.  5.8  The  origin  of  the  hydrogen  21  cm  line.  The  spins 
of  the  electron  and  the  proton  may  be  either  parallel  or 
opposite.  The  energy  of  the  former  state  is  slightly  larger. 
The  wavelength  of  a  photon  corresponding  to  a  transition 
between  these  states  is  21  cm 


hydrogen  so  great  that  a  considerable  number  of 
these  transitions  can  take  place. 

The  wavelength  of  the  radiation  emitted  by 
this  transition  is  21  cm,  which  is  in  the  radio  band 
of  the  spectrum.  Extinction  at  radio  wavelengths 
is  very  small,  and  we  can  observe  more  distant 
objects  than  by  using  optical  wavelengths.  The 
21  cm  radiation  has  been  of  crucial  importance 
for  surveys  of  interstellar  hydrogen. 

Population  Numbers  The  population  num¬ 
ber  ni  of  an  energy  state  i  means  the  number 
of  atoms  in  that  state  per  unit  volume.  In  ther¬ 
mal  equilibrium,  the  population  numbers  obey 
the  Boltzmann  distribution : 

Hi  =8ie-AE/(kT)'  (5.n) 

no  go 

where  T  is  the  temperature,  k  is  the  Boltzmann 
constant,  A E  —  Ei  —  Eo  =  hv  is  the  energy  dif¬ 
ference  between  the  excited  and  ground  state,  and 
gi  is  the  statistical  weight  of  the  level  i  (it  is 
the  number  of  different  states  with  the  same  en¬ 
ergy  E().  The  subscript  0  always  refers  to  the 
ground  state.  Often  the  population  numbers  dif¬ 
fer  from  the  values  given  by  (5.11),  but  still  we 
can  define  an  excitation  temperature  Texc  in  such 
a  way  that  (5.11)  gives  correct  population  num¬ 
bers,  when  T  is  replaced  by  TQXC.  The  excitation 
temperature  may  be  different  for  different  energy 
levels. 
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Fig.  5.9  Spectrum  of  carbon  monoxide  CO  from  430  nm 
to  670  nm.  The  various  bands  correspond  to  different  vi¬ 
brational  transitions.  Each  band  is  composed  of  numerous 
rotational  lines.  Near  the  right  edge  of  each  band  the  lines 


5.5  Molecular  Spectra 

The  energy  levels  of  an  atom  are  determined  by 
its  electrons.  In  the  case  of  a  molecule,  there 
are  many  more  possibilities:  atoms  can  vibrate 
around  their  equilibria  and  the  molecule  can  ro¬ 
tate  around  some  axis.  Both  vibrational  and  rota¬ 
tional  states  are  quantised.  Transitions  between 
successive  vibrational  states  typically  involve 
photons  in  the  infrared  band,  while  transitions  be¬ 
tween  rotational  states  involve  photons  in  the  mi¬ 
crowave  band.  These  combined  with  transitions 
of  electrons  produce  a  band  spectrum,  character¬ 
istic  for  molecules  (Fig.  5.9).  The  spectrum  has 
several  narrow  bands  composed  of  a  great  num¬ 
ber  of  lines. 


5.6  Continuous  Spectra 

We  have  already  mentioned  some  processes  that 
produce  continuous  spectra.  Continuous  emission 
spectra  can  originate  in  recombinations  and  free- 
free  transitions.  In  recombination,  an  atom  cap¬ 
tures  a  free  electron  whose  energy  is  not  quan¬ 
tised;  in  free-free  transitions,  both  initial  and  fi¬ 
nal  states  are  unquantised.  Thus  the  emission  line 
can  have  any  frequency  whatsoever.  Similarly, 
ionisations  and  free-free  transitions  can  give  rise 
to  a  continuous  absorption  spectrum. 

Each  spectrum  contains  a  continuous  compo¬ 
nent,  or  continuum ,  and  spectral  lines.  Some¬ 
times,  however,  the  lines  are  so  closely  packed 
and  so  broad  that  they  seem  to  form  a  nearly  con¬ 
tinuous  spectrum. 

When  the  pressure  of  hot  gas  is  increased,  the 
spectral  lines  begin  to  broaden.  At  high  pressure, 
atoms  bump  into  each  other  more  frequently,  and 
the  close  neighbours  disturb  the  energy  levels. 


are  so  closely  packed  that  they  overlap  and  at  this  res¬ 
olution,  the  spectrum  looks  continuous.  (R.W.B.  Pearse, 
A.G.  Gaydon:  The  Identification  of  Molecular  Spectra 
(Chapman  &  Hall  Ltd.,  London  1976)  p.  394) 

When  the  pressure  is  high  enough,  the  lines  be¬ 
gin  to  overlap.  Thus  the  spectrum  of  hot  gas  at 
high  pressure  is  continuous.  Electric  fields  also 
broaden  spectral  lines  (the  Stark  effect). 

In  liquids  and  solids  the  atoms  are  more 
densely  packed  than  in  gaseous  substances.  Their 
mutual  perturbations  broaden  the  energy  levels, 
producing  a  continuous  spectrum. 


5.7  Blackbody  Radiation 

A  blackbody  is  defined  as  an  object  that  does 
not  reflect  or  scatter  radiation  shining  upon  it, 
but  absorbs  and  re-emits  the  radiation  completely. 
A  blackbody  is  a  kind  of  an  ideal  radiator,  which 
cannot  exist  in  the  real  world.  Yet  many  objects 
behave  very  much  as  if  they  were  blackbodies. 

The  radiation  of  a  blackbody  depends  only  on 
its  temperature,  being  perfectly  independent  of 
its  shape,  material  and  internal  constitution.  The 
wavelength  distribution  of  the  radiation  follows 
Planck’s  law ,  which  is  a  function  of  temperature 
only.  The  intensity  at  a  frequency  v  of  a  black¬ 
body  at  temperature  T  is 

2  hv3  1 

B,.(T)  =  B(kT)  =  -g-  e>,/|tr|  _  l  ■  (5.12) 

where 

_ Q  A 

h  =  the  Planck  constant  =  6. 63  x  10  Js, 

o  _ 1 

c  =  the  speed  of  light  ~3x  10  ms  , 
k  =  the  Boltzmann  constant 
=  1.38  x  10~23  JK-1. 

By  definition  of  the  intensity,  the  dimension  of  Bv 
is  W  m-2  Hz- 1  sterad- 1 . 
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Blackbody  radiation  can  be  produced  in  a 
closed  cavity  whose  walls  absorb  all  radiation  in¬ 
cident  upon  them  (and  coming  from  inside  the 
cavity).  The  walls  and  the  radiation  in  the  cavity 
are  in  equilibrium;  both  are  at  the  same  temper¬ 
ature,  and  the  walls  emit  all  the  energy  they  re¬ 
ceive.  Since  radiation  energy  is  constantly  trans¬ 
formed  into  thermal  energy  of  the  atoms  of  the 
walls  and  back  to  radiation,  the  blackbody  radia¬ 
tion  is  also  called  thermal  radiation. 

The  spectrum  of  a  blackbody  given  by  Planck’s 
law  (5.12)  is  continuous.  This  is  true  if  the  size  of 
the  radiator  is  very  large  compared  with  the  dom¬ 
inant  wavelengths.  In  the  case  of  the  cavity,  this 
can  be  understood  by  considering  the  radiation  as 
standing  waves  trapped  in  the  cavity.  The  num¬ 
ber  of  different  wavelengths  is  larger,  the  shorter 
the  wavelengths  are  compared  with  the  size  of 
the  cavity.  We  already  mentioned  that  spectra  of 
solid  bodies  are  continuous;  very  often  such  spec¬ 
tra  can  be  quite  well  approximated  by  Planck’s 
law. 

We  can  also  write  Planck’s  law  as  a  function  of 
the  wavelength.  We  require  that  Bv  dv  =  —  B\  dX. 
The  wavelength  decreases  with  increasing  fre¬ 
quency;  hence  the  minus  sign.  Since  v  =  c/X,  we 
have 


dv  c 
dX 


(5.13) 


whence 


dv  c 

BA  =  —  By  —  =  By  — 


dX 


A2’ 


(5.14) 


Let  us  now  try  to  find  the  total  intensity  using 
the  first  of  these  integrals: 


B(T)  = 


BV(T)  dv 


2 h  f°°  v3  dv 
C 2  J0  ehv/(kT)  _  1  ’ 


We  now  change  the  integration  variable  to  r  = 
hv/(kT ),  whence  dv  =  (kT / h)  dv: 


B(T)  = 


2 h  k 4  4  r°°  v3  dv 

c2  h4  Jo  ex  —  1 


The  definite  integral  in  this  expression  is  just 
a  real  number,  independent  of  the  temperature. 
Thus  we  find  that 


B(T)  =  AT4,  (5.16) 


where  the  constant  A  has  the  value 

2k4  7 r4 

A=CPJ5' 


(5.17) 


(In  order  to  get  the  value  of  A  we  have  to  eval¬ 
uate  the  integral.  There  is  no  elementary  way  to 
do  that.  We  can  tell  those  who  are  familiar  with 
all  the  exotic  functions  so  beloved  by  theoreti¬ 
cal  physicists,  that  the  integral  can  rather  easily 
be  expressed  as  L(4)£(4),  where  f  is  the  Rie- 
mann  zeta  function  and  r  is  the  gamma  func¬ 
tion.  For  integral  values,  r(n)  is  simply  the  fac¬ 
torial  ( n  —  1)!.  The  difficult  part  is  showing  that 
£(4)  =  7r4/90.  This  can  be  done  by  expanding 
x4  —  x2  as  a  Fourier- series  and  evaluating  the  se¬ 
ries  at  v  =  7T.) 

The  flux  density  F  for  isotropic  radiation  of 
intensity  B  is  (Sect.  4.1): 


or 


F  =  ttB 


Bx(T)  = 


2  hcx 

u 


[B} J  =  Wm 


-2 


1 

Qhc/(kkT)  _  j  ’ 

m-1  sterad-1. 


(5.15) 


The  functions  Bv  and  B\  are  defined  in  such 
a  way  that  the  total  intensity  can  be  obtained  in 
the  same  way  using  either  of  them: 


B(T)  = 


Bv  dv 


Bx  dX. 


or 

F=oTa.  (5.18) 

This  is  the  Stefan-Boltzmann  law ,  and  the  con¬ 
stant  o  (=  it  A)  is  the  Stefan-Boltzmann  constant , 

a  =5.61  x  10“8  Wm“2K“4. 

From  the  Stefan-Boltzmann  law  we  get  a  re¬ 
lation  between  the  luminosity  and  temperature  of 
a  star.  If  the  radius  of  the  star  is  R,  its  surface 
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area  is  4tzR2,  and  if  the  flux  density  on  the  sur¬ 
face  is  F,  we  have 

L  =  4ttR2F. 

If  the  star  is  assumed  to  radiate  like  a  blackbody, 
we  have  F  =  a  T4,  which  gives 

L=AncrR2T4.  (5.19) 

In  fact  this  defines  the  effective  temperature  of  the 
star,  discussed  in  more  detail  in  the  next  section. 

The  luminosity,  radius  and  temperature  of 
a  star  are  interdependent  quantities,  as  we  can  see 
from  (5.19).  They  are  also  related  to  the  absolute 
bolometric  magnitude  of  the  star.  Equation  (4.13) 
gives  the  difference  of  the  absolute  bolometric 
magnitude  of  the  star  and  the  Sun: 

A/bol  M\ boi,© =  2.5  lg  —  .  (5.20) 

Lq 

But  we  can  now  use  (5.19)  to  express  the  lumi¬ 
nosities  in  terms  of  the  radii  and  temperatures: 

Mbol-Mboi,o  =  -51g^--101g^.  (5.21) 

Kq 


Fig.  5.10  Intensity  distributions  of  blackbodies  at  tem¬ 
perature  12,000  K,  9000  K  and  6000  K.  Since  the  ratios 
of  the  temperatures  are  4:3:2,  the  wavelengths  of  inten¬ 
sity  maxima  given  by  the  Wien  displacement  law  are  in 
the  proportions  1:4,  1:3  and  1:2,  or  3,  4  and  6.  The  ac¬ 
tual  wavelengths  of  the  maxima  are  241.5  nm,  322  nm  and 
483  nm.  The  total  intensities  or  the  areas  below  the  curves 
are  proportional  to  44,  34  and  24 

approximated  by  simpler  expressions.  When  X  ~ 
A, max  (or  he /(XkT)  1),  we  have 


As  we  can  see  in  Fig.  5.10,  the  wavelength  of 
the  maximum  intensity  decreases  with  increas¬ 
ing  total  intensity  (equal  to  the  area  below  the 
curve).  We  can  find  the  wavelength  Amax  corre¬ 
sponding  to  the  maximum  intensity  by  differen¬ 
tiating  Planck’s  function  B\(T )  with  respect  to  X 
and  finding  zero  of  the  derivative.  The  result  is 
the  Wien  displacement  law : 


ofc/ (XkT)  ^  ^ 

In  this  case  we  get  the  Wien  approximation 

B\(T)  SS  7^Q-hcl(kkT) _  (5.23) 

X ^ 

When  he /{XkT)  <<C  1  (X  XmSLX ),  we  have 


XmSLXT  =  b  =  const,  (5.22) 

where  the  Wien  displacement  constant  b  is 

b  =  0.0028978  Km. 

We  can  use  the  same  procedure  to  find  the 
maximum  of  Bv .  But  the  frequency  vmax  thus  ob¬ 
tained  is  different  from  vmax  =  c/XmdLX  given  by 
(5.22).  The  reason  for  this  is  the  fact  that  the 
intensities  are  given  per  unit  frequency  or  unit 
wavelength,  and  the  dependence  of  frequency  on 
wavelength  is  nonlinear. 

When  the  wavelength  is  near  the  maximum  or 
much  longer  than  kmax  Planck’s  function  can  be 


ehc/XkT  %  1  +  he/ (XkT), 

which  gives  the  Rayleigh- Jeans  approximation 

2hc2  XkT  2 ckT 

Bx(T)  ss  — ; - =  — (5.24) 

X5  he  A4 

This  is  particularly  useful  in  radio  astronomy. 

Classical  physics  predicted  only  the  Rayleigh- 
Jeans  approximation.  Were  (5.24)  true  for  all 
wavelengths,  the  intensity  would  grow  beyond 
all  limits  when  the  wavelength  approaches  zero, 
contrary  to  observations.  This  contradiction  was 
known  as  the  ultraviolet  catastrophe. 
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5.8  Temperatures 

Temperatures  of  astronomical  objects  range  from 
almost  absolute  zero  to  millions  of  degrees.  Tem¬ 
perature  can  be  defined  in  a  variety  of  ways,  and 
its  numerical  value  depends  on  the  specific  def¬ 
inition  used.  All  these  different  temperatures  are 
needed  to  describe  different  physical  phenomena, 
and  often  there  is  no  unique  ‘true’  temperature. 

Often  the  temperature  is  determined  by  com¬ 
paring  the  object,  a  star  for  instance,  with  a  black- 
body.  Although  real  stars  do  not  radiate  exactly 
like  blackbodies,  their  spectra  can  usually  be  ap¬ 
proximated  by  blackbody  spectra  after  the  effect 
of  spectral  lines  has  been  eliminated.  The  result¬ 
ing  temperature  depends  on  the  exact  criterion 
used  to  fit  Planck’s  function  to  observations. 

The  most  important  quantity  describing  the 
surface  temperature  of  a  star  is  the  effective  tem¬ 
perature  Tq.  It  is  defined  as  the  temperature  of 
a  blackbody  which  radiates  with  the  same  total 
flux  density  as  the  star.  Since  the  effective  temper¬ 
ature  depends  only  on  the  total  radiation  power 
integrated  over  all  frequencies,  it  is  well  defined 
for  all  energy  distributions  even  if  they  deviate  far 
from  Planck’s  law. 

In  the  previous  section  we  derived  the  Stefan- 
Boltzmann  law,  which  gives  the  total  flux  den¬ 
sity  as  a  function  of  the  temperature.  If  we  now 
find  a  value  TQ  of  the  temperature  such  that  the 
Stefan-Boltzmann  law  gives  the  correct  flux  den¬ 
sity  F  on  the  surface  of  the  star,  we  have  found 
the  effective  temperature.  The  flux  density  on  the 
surface  is 

F  =  oT*.  (5.25) 

The  total  flux  is  L  =  4it  RrF ,  where  R  is  the  ra- 
dius  of  the  star,  and  the  flux  density  at  a  distance  r 
is 


(5.26) 


where  a  =  2R/r  is  the  observed  angular  diame¬ 
ter  of  the  star.  For  direct  determination  of  the  ef¬ 
fective  temperature,  we  have  to  measure  the  total 
flux  density  and  the  angular  diameter  of  the  star. 
This  is  possible  only  in  the  few  cases  in  which  the 
diameter  has  been  found  by  interferometry. 


If  we  assume  that  at  some  wavelength  A  the 
flux  density  F\  on  the  surface  of  the  star  is  ob¬ 
tained  from  Planck’s  law,  we  get  the  brightness 
temperature  7b .  In  the  isotropic  case  we  have  then 
Fx  =  nBx(T\f).  If  the  radius  of  the  star  is  R  and 
distance  from  the  Earth  r,  the  observed  flux  den¬ 
sity  is 


Again  F\  can  be  determined  only  if  the  angu¬ 
lar  diameter  a  is  known.  The  brightness  tempera¬ 
ture  7b  can  then  be  solved  from 


2 

7tBx(Tb). 


(5.27) 


Since  the  star  does  not  radiate  like  a  blackbody, 
its  brightness  temperature  depends  on  the  partic¬ 
ular  wavelength  used  in  (5.27). 

In  radio  astronomy,  brightness  temperature  is 
used  to  express  the  intensity  (or  surface  bright¬ 
ness)  of  the  source.  If  the  intensity  at  frequency  v 
is  7V ,  the  brightness  temperature  is  obtained  from 


Iv  —  Fv{T\f). 


7b  gives  the  temperature  of  a  blackbody  with  the 
same  surface  brightness  as  the  observed  source. 

Since  radio  wavelengths  are  very  long,  the 
condition  hv  <^kT  of  the  Rayleigh- Jeans  ap¬ 
proximation  is  usually  satisfied  (except  for  mil¬ 
limetre  and  submillimetre  bands),  and  we  can 
write  Planck’s  law  as 

D  FT.  \  2/zv3  1 

Bv(Tb)  =  ~2  ehv/(kTb)  _  l 

_  2 hv3  1 

_  ~cr  l+/nV(£7b)  + - 1 

2  kv2 

~  — 2 
cz 

Thus  we  get  the  following  expression  for  the  ra¬ 
dio  astronomical  brightness  temperature: 


(5.28) 


A  measure  of  the  signal  registered  by  a  radio 
telescope  is  the  antenna  temperature  T\.  After 
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Fig.  5.1 1  Determination  of  the  colour  temperature.  The 
ratio  of  the  flux  densities  at  wavelengths  X\  and  X2  gives 
the  temperature  of  a  blackbody  with  the  same  ratio.  In  gen¬ 
eral  the  result  depends  on  the  wavelengths  chosen 


the  antenna  temperature  is  measured,  we  get  the 
brightness  temperature  from 


TA  =  rjTb,  (5.29) 

where  77  is  the  beam  efficiency  of  the  antenna 
(typically  0.4<77<0.8).  Equation  (5.29)  holds 
if  the  source  is  wide  enough  to  cover  the  whole 
beam,  i.e.  the  solid  angle  £2 a  from  which  the  an¬ 
tenna  receives  radiation.  If  the  solid  angle  sub¬ 
tended  by  the  source,  £2s,  is  smaller  than  the 
observed  antenna  temperature  is 

TA  =  r,yfTh  (f?s  <  &a)-  (5.30) 

The  colour  temperature  Tc  can  be  determined 
even  if  the  angular  diameter  of  the  source  is  un¬ 
known  (Fig.  5.11).  We  only  have  to  know  the 
relative  energy  distribution  in  some  wavelength 
range  [Ai ,  A2] ;  the  absolute  value  of  the  flux  is  not 
needed.  The  observed  flux  density  as  a  function 
of  wavelength  is  compared  with  Planck’s  func¬ 
tion  at  different  temperatures.  The  temperature 
giving  the  best  fit  is  the  colour  temperature  in  the 
interval  [Ai,  A.2].  The  colour  temperature  is  usu¬ 
ally  different  for  different  wavelength  intervals, 
since  the  shape  of  the  observed  energy  distribu¬ 
tion  may  be  quite  different  from  the  blackbody 
spectrum. 

A  simple  method  for  finding  a  colour  tem¬ 
perature  is  the  following.  We  measure  the  flux 
density  Ffx  at  two  wavelengths  X\  and  A2.  If 
we  assume  that  the  intensity  distribution  follows 
Planck’s  law,  the  ratio  of  these  flux  densities  must 


be  the  same  as  the  ratio  obtained  from  Planck’s 
law: 

KST)  Bkl{T)  eyyym-i 

F'k2(T)  BXl(T)  *5  eAc/(W)  -  r  <  • 

The  temperature  T  solved  from  this  equation  is 
a  colour  temperature. 

The  observed  flux  densities  correspond  to  cer¬ 
tain  magnitudes  m\x  and  m\2.  The  definition  of 
magnitudes  gives 


mx  1  -  mx2  =  -2.5  lg  —  +  const, 

Fx2 

where  the  constant  term  is  a  consequence  of  the 
different  zero  points  of  the  magnitude  scales.  If 
the  temperature  is  not  too  high,  we  can  use  the 
Wien  approximation  in  the  optical  part  of  the 
spectrum: 


BX] 

mx  1  —  mx2  =  —2.5  lg  — - f  const 


B 


x2 


=  -2.51g 


A2 


+  2.5 


Ai 

he  f  1 


1 


kT  \ Ai  A2 


lg  e  +  const. 


This  can  be  written  as 


mx  1  -  mx2  =  a  +  b/Tc,  (5.32) 

where  a  and  b  are  constants.  This  shows  that 
there  is  a  simple  relationship  between  the  differ¬ 
ence  of  two  magnitudes  and  the  colour  tempera¬ 
ture. 

Strictly  speaking,  the  magnitudes  in  (5.32)  are 
monochromatic,  but  the  same  relation  can  be  also 
used  with  broadband  magnitudes  like  B  and  V .  In 
that  case,  the  two  wavelengths  are  essentially  the 
effective  wavelengths  of  the  B  and  V  bands.  The 
constant  is  chosen  so  that  B  —  V  =  0  for  stars 
of  the  spectral  type  AO  (see  Chap.  9).  Thus  the 
colour  index  B  —  V  also  gives  a  colour  tempera¬ 
ture. 

The  kinetic  temperature  7k,  is  related  to  the 
average  speed  of  gas  molecules.  The  kinetic  en¬ 
ergy  of  an  ideal  gas  molecule  as  a  function  of 
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temperature  follows  from  the  kinetic  gas  theory: 


1  2 

Kinetic  energy  =  -mv 

2> 


Solving  for  7k  we  get 


mv2 


(5.33) 


where  m  is  the  mass  of  the  molecule,  v  its  average 
velocity  (or  rather  its  r.m.s  velocity,  which  means 
that  v2  is  the  average  of  the  squared  velocities), 
and  k ,  the  Boltzmann  constant.  For  ideal  gases 
the  pressure  is  directly  proportional  to  the  kinetic 
temperature  (cf.  Box  11.1): 


Excitation  state 


Metastable 

state 


Ground  state 


P  =  nkTk ,  (5.34) 

where  n  is  the  number  density  of  the  molecules 
(molecules  per  unit  volume).  We  previously  de¬ 
fined  the  excitation  temperature  TQXC  as  a  tem¬ 
perature  which,  if  substituted  into  the  Boltzmann 
distribution  (5.11),  gives  the  observed  population 
numbers.  If  the  distribution  of  atoms  in  different 
levels  is  a  result  of  mutual  collisions  of  the  atoms 
only,  the  excitation  temperature  equals  the  kinetic 
temperature,  Texc  =  Tk. 

The  ionisation  temperature  7j  is  found  by 
comparing  the  number  of  atoms  in  different  states 
of  ionisation.  Since  stars  are  not  exactly  black- 
bodies,  the  values  of  excitation  and  ionisation 
temperatures  usually  vary,  depending  on  the  el¬ 
ement  whose  spectral  lines  were  used  for  temper¬ 
ature  determination. 

In  thermodynamic  equilibrium  all  these  vari¬ 
ous  temperatures  are  equal. 


5.9  Other  Radiation  Mechanisms 

The  radiation  of  a  gas  in  thermodynamic  equi¬ 
librium  depends  on  the  temperature  and  den¬ 
sity  only.  In  astrophysical  objects  deviations  from 
thermodynamic  equilibrium  are,  however,  quite 
common.  Some  examples  of  non-thermal  radia¬ 
tion  arising  under  such  conditions  are  mentioned 
in  the  following. 

Maser  and  Laser  (See  Fig.  5.12.)  The  Boltz¬ 
mann  distribution  (5.11)  shows  that  usually  there 


Fig.  5.12  The  operational  principle  of  the  maser  and  the 
laser.  A  metastable  state  (a  state  with  a  relatively  long  av¬ 
erage  lifetime)  stores  atoms  where  they  accumulate;  there 
are  more  atoms  in  the  metastable  state  than  in  the  ground 
state.  This  population  inversion  is  maintained  by  radia- 
tively  exciting  atoms  to  a  higher  excitation  state  (“pump¬ 
ing”),  from  which  they  spontaneously  jump  down  to  the 
metastable  state.  When  the  atoms  are  illuminated  by  pho¬ 
tons  with  energies  equal  to  the  excitation  energy  of  the 
metastable  state,  the  photons  will  induce  more  radiation 
of  the  same  wavelength,  and  the  radiation  is  amplified  in 
geometric  progression 


are  fewer  atoms  in  excited  states  than  in  the 
ground  state.  There  are,  however,  means  to  pro¬ 
duce  a  population  inversion ,  an  excited  state  con¬ 
taining  more  atoms  than  the  ground  state.  This 
inversion  is  essential  for  both  the  maser  and  the 
laser  (Microwave/Light  Amplification  by  Stimu¬ 
lated  Emission  of  Radiation).  If  the  excited  atoms 
are  now  illuminated  with  photons  having  ener¬ 
gies  equal  to  the  excitation  energy,  the  radia¬ 
tion  will  induce  downward  transitions.  The  num¬ 
ber  of  photons  emitted  greatly  exceeds  the  num¬ 
ber  of  absorbed  photons,  and  radiation  is  ampli¬ 
fied.  Typically  the  excited  state  is  a  metastable 
state ,  a  state  with  a  very  long  average  lifetime, 
which  means  that  the  contribution  of  spontaneous 
emission  is  negligible.  Therefore  the  resulting 
radiation  is  coherent  and  monochromatic.  Sev¬ 
eral  maser  sources  have  been  found  in  interstel¬ 
lar  molecular  clouds  and  dust  envelopes  around 
stars. 
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Synchrotron  Radiation  A  free  charge  in  ac¬ 
celerated  motion  will  emit  electromagnetic  radi¬ 
ation.  Charged  particles  moving  in  a  magnetic 
field  follow  helices  around  the  field  lines.  As  seen 
from  the  direction  of  the  field,  the  motion  is  circu¬ 
lar  and  therefore  accelerated.  The  moving  charge 
will  radiate  in  the  direction  of  its  velocity  vector. 
Such  radiation  is  called  synchrotron  radiation.  It 
will  be  further  discussed  in  Chap.  16. 


5.1 0  Radiative  Transfer 

Propagation  of  radiation  in  a  medium,  also  called 
radiative  transfer,  is  one  of  the  basic  problems  of 
astrophysics.  The  subject  is  too  complicated  to 
be  discussed  here  in  any  detail.  The  fundamental 
equation  of  radiative  transfer  is,  however,  easily 
derived. 

Assume  we  have  a  small  cylinder,  the  bottom 
of  which  has  an  area  dA  and  the  length  of  which 
is  dr .  Let  Iv  be  the  intensity  of  radiation  perpen¬ 
dicular  to  the  bottom  surface  going  into  a  solid 
angle  d co  ([/v]  =  Wm-2  Hz-1  sterad-1).  If  the 
intensity  changes  by  an  amount  d Iv  in  the  dis¬ 
tance  dr,  the  energy  changes  by 

d  E  =  d/v  dA  dv  d  to  d  t 

in  the  cylinder  in  time  d t.  This  equals  the  emis¬ 
sion  minus  absorption  in  the  cylinder.  The  ab¬ 
sorbed  energy  is  (cf.  (4.14)) 


or 


d  / 


V 


—  —Iv  + 


Jv 


(5.37) 


av  dr  av 

We  shall  denote  the  ratio  of  the  emission  coeffi¬ 
cient  jv  to  the  absorption  coefficient  or  opacity  av 
by  Sv: 


Sv  =  —.  (5.38) 

Otv 

Sv  is  called  the  source  function.  Because  av  dr  = 
drv,  where  ry  is  the  optical  thickness  at  fre¬ 
quency  v,  (5.37)  can  be  written  as 


—  —Iv  +  Sv 


(5.39) 


Equation  (5.39)  is  the  basic  equation  of  radia¬ 
tive  transfer.  Without  solving  the  equation,  we  see 
that  if  Iv  <  Sv,  then  dlv/drv  >  0,  and  the  inten¬ 
sity  tends  to  increase  in  the  direction  of  propa¬ 
gation.  And,  if  Iv  >  Sv ,  then  dlv/drv  <  0,  and 
Iv  will  decrease.  In  an  equilibrium  the  emitted 
and  absorbed  energies  are  equal,  in  which  case 
we  find  from  (5.35)  and  (5.36) 


Iy  -  jv/Wy  -  Sy  .  (5.40) 

Substituting  this  into  (5.39),  we  see  that  d/v/drv 
=  0.  In  thermodynamic  equilibrium  the  radiation 
of  the  medium  is  blackbody  radiation,  and  the 
source  function  is  given  by  Planck’s  law: 


dEabs  =  ctyly  dr  dA  dv  dcodt,  (5.35) 

where  c^y  is  the  opacity  of  the  medium  at  fre¬ 
quency  v.  Let  the  amount  of  energy  emitted  per 
hertz  at  frequency  v  into  unit  solid  angle  from 
unit  volume  and  per  unit  time  be  jv  ([jv]  = 
Wm-3  Hz-1  sterad-1).  This  is  called  the  emis- 
sion  coefficient  of  the  medium.  The  energy  emit¬ 
ted  into  solid  angle  d co  from  the  cylinder  is  then 

d£em  =  jv  dr  d A  dv  d co  dt.  (5.36) 

The  equation 


d E  —  dEabs  T  dEem 


gives  then 


dlv  =  -avIvdr  +  jvdr 


Sy=By(T)  = 


2  hv3  1 

q2  oJiv/ikT)  _  j  * 


Even  if  the  system  is  not  in  thermodynamic  equi¬ 
librium,  it  may  be  possible  to  find  an  excitation 
temperature  Texc  such  that  Bv(Texc)  =  Sv.  This 
temperature  may  depend  on  frequency. 

A  formal  solution  of  (5.39)  is 

7v(rv)  =  /v(0)e-r“+  f  V(r”-f)Sv(t)  dt. 

Jo 

(5-41) 

Here  Iv  (0)  is  the  intensity  of  the  background  radi¬ 
ation,  coming  through  the  medium  (e.g.  an  inter¬ 
stellar  cloud)  and  decaying  exponentially  in  the 
medium.  The  second  term  gives  the  emission  in 
the  medium.  The  solution  is  only  formal,  since  in 
general,  the  source  function  Sv  is  unknown  and 
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must  be  solved  simultaneously  with  the  intensity. 
If  Sv(tv)  is  constant  in  the  cloud  and  the  back¬ 
ground  radiation  is  ignored,  we  get 

Iv(tv)  =  Sv  f  V(w)df  =  Sy(l-e-M. 

Jo 

(5.42) 

If  the  cloud  is  optically  thick  (rv  1),  we  have 

Iv  =  Sv,  (5.43) 

i.e.  the  intensity  equals  the  source  function,  and 
the  emission  and  absorption  processes  are  in 
equilibrium. 

An  important  field  of  application  of  the  theory 
of  radiative  transfer  is  in  the  study  of  planetary 
and  stellar  atmospheres.  In  this  case,  to  a  good 
approximation,  the  properties  of  the  medium  only 
vary  in  one  direction,  say  along  the  z  axis.  The 
intensity  will  then  depend  only  on  z  and  6 ,  where 
6  is  the  angle  between  the  z  axis  and  the  direction 
of  propagation  of  the  radiation. 

In  applications  to  atmospheres  it  is  customary 
to  define  the  optical  depth  rv  in  the  vertical  direc¬ 
tion  as 

d Ty  -  Ciy  6-Z. 

Conventionally  z  increases  upwards  and  the  opti¬ 
cal  depth  inwards  in  the  atmosphere.  The  vertical 
line  element  d z  is  related  to  that  along  the  light 
ray,  dr,  according  to 


This  expression  will  be  used  later  in  Chap.  8  on 
the  interpretation  of  stellar  spectra. 


5.11  Examples 

Example  5.1  Find  the  wavelength  of  the  photon 
emitted  in  the  transition  of  a  hydrogen  atom  from 
n2  =  1 10  to  n\  =  109. 

Equation  (5.8)  gives 

k  «2  / 

=  1.097  x  107  m~ 1  f  — h_ 

\  1092 

=  16.71  m-1, 
whence 

X  =  0.060  m. 

This  is  in  the  radio  band.  Such  radiation  was  ob¬ 
served  for  the  first  time  in  1965  by  an  NRAO  ra¬ 
dio  telescope. 

Example  5.2  The  effective  temperature  of  a  star 
is  12,000  K  and  the  absolute  bolometric  magni¬ 
tude  0.0.  Find  the  radius  of  the  star,  when  the  ef¬ 
fective  temperature  of  the  Sun  is  5000  K  and  the 
absolute  bolometric  magnitude  4.7. 

We  can  apply  (5.21): 


d z  =  dr  cos  6 . 

With  these  notational  conventions,  (5.39)  now 
yields 

dlv(z,0) 

cos  0  =  Iv-sv.  (5.44) 

drv 

This  is  the  form  of  the  equation  of  radiative  trans¬ 
fer  usually  encountered  in  the  study  of  stellar  and 
planetary  atmospheres. 

A  formal  expression  for  the  intensity  emerging 
from  an  atmosphere  can  be  obtained  by  integrat¬ 
ing  (5.44)  from  rv  =  oo  (we  assume  that  the  bot¬ 
tom  of  the  atmosphere  is  at  infinite  optical  depth) 
to  rv  =  0  (corresponding  to  the  top  of  the  atmo¬ 
sphere).  This  yields 


R  T 

Mboi  -  Afbol,©  =  —5  lg  — - 10  lg 


R 


Terr 


Rq 

2 


Tt 


o 


R 


o 


eO  \  1()-0.2(Mbol-MboljO) 
/ 

5800  \  ^  iq— 0.2(0. 0—4. 7) 

12,000  ) 


=  2.0. 


Thus  the  radius  is  twice  the  Solar  radius. 


Example  5.3  Derive  the  Wien  displacement 
laws. 

Let  us  denote  x  =  hc/(XkT).  Planck’s  law 
then  becomes 


/v(O,0) 


— rv  sec  0 


sec  0  drv . 


(5.45) 


2k5  T5  x5 

B\(T) = 


h4c^  eA  —  1 
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For  a  given  temperature,  the  first  factor  is  con¬ 
stant.  Thus,  it  is  sufficient  to  find  the  maximum 
of  the  function  f(x)  =  x5/(e*  —  1). 

First  we  must  evaluate  the  derivative  of  /: 


f\x) 


5x  (ex  —  1)  —  jre 
(e*  -  l)2 


x4ex 


(e*  -  l)2 


(5  —  5e  x  —  x) 


- =  1.701  x  10"11  Ks. 

^max 

Note  that  the  wavelength  corresponding  to  vmax 
is  different  from  Amax.  The  reason  is  that  we  have 
used  two  different  forms  of  Planck’s  function, 
one  giving  the  intensity  per  unit  wavelength,  the 
other  per  unit  frequency. 


By  definition,  v  is  always  strictly  positive. 
Hence  f'(x )  can  be  zero  only  if  the  factor  5  — 
5e-x  —  v  is  zero.  This  equation  cannot  be  solved 
analytically.  Instead  we  write  the  equation  as  v  = 
5  —  5e-x  and  solve  it  by  iteration: 

xq  =  5  (this  is  just  a  guess), 
xi  =  5-  5e“x°  =4.96631, 


Example  5.4  (a)  Find  the  fraction  of  radiation 
that  a  blackbody  emits  in  the  range  [A.1,^2], 
where  Ai  and  A  2  ^  Amax.  (b)  How  much  energy 
does  a  100  W  incandescent  light  bulb  radiate  in 
the  radio  wavelengths,  A  >  1  cm?  Assume  the 
temperature  is  2500  K. 

Since  the  wavelengths  are  much  longer  than 
A  max  we  can  use  the  Rayleigh-Jeans  approxima¬ 
tion  B\(T)  ~  2 ckT /A4.  Then 


*5  =4.96511. 

Thus  the  result  is  x  =  4.965.  The  Wien  displace¬ 
ment  law  is  then 

h  c 

A maxT  =  —  =  b  =  2.898  x  10  Km. 


xk 


In  terms  of  frequency  Planck’s  law  is 


BV(T)  = 


2  hv 


1 


c2  ghv/ikT )  1 


Substituting  x  =  hv / (kT)  we  get 


BV(T) = 


2  k3T3  x: 


h2c2  ex  —  1 


Now  we  study  the  function  f(x)=x3/(ex  —  1) 


3v2(ex  —  1)  —  vJe 
(ex  -  l)2 


/'(*)  = 


xV 


(ex  -  l)2 


^-Se--1  -x). 


This  vanishes,  when  3  —  3e  x  —  x  =  0.  The  solu¬ 
tion  of  this  equation  is  x  =  2.821.  Hence 


B\(T)  dA 


2ckT 


2 ckT  /  1  1  \ 

ivq'ql 


and  hence 


B'  _  5 c3h3  1/1  1  \ 

=  k2ix 4  ^3  vXf _  q)' 


Now  the  temperature  is  T  =  2500  K  and  the 
wavelength  range  [0.01  m,  00),  and  so 


B'  =  100  Wx  1.529  x  1(T7 - T - j 

25003  0.013 


=  9.8  x  KT10  W. 


It  is  quite  difficult  to  listen  to  the  radio  emission 
of  a  light  bulb  with  an  ordinary  radio  receiver. 


Example  5.5  (Determination  of  Effective  Tem¬ 
perature)  The  observed  flux  density  of  Arcturus 
is 

F'  =  4.5  x  1(T8  Wm-2. 


cT 

Hnax 


h  c 

—  =*'  =  5.100  x  KT3  Km 
kx 


Interferometric  measurements  give  an  angular  di¬ 
ameter  of  a  =  0.020".  Thus,  a/2  =  4.85  x  10-8 
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radians.  From  (5.26)  we  get 


4.5  x  10“8 


=  4300  K. 


Exercise  5.2  If  the  transition  of  the  hydrogen 
atom  n  +  1  ->  n  were  to  correspond  to  the  wave¬ 
length  21.05  cm,  what  would  the  quantum  num¬ 
ber  n  be?  The  interstellar  medium  emits  strong 
radiation  at  this  wavelength.  Can  this  radiation  be 
due  to  such  transitions? 


Example  5.6  Flux  densities  at  the  wavelengths 
440  nm  and  550  nm  are  1 .30  and  1 .00  W  m-2  m_  1 , 
respectively.  Find  the  colour  temperature. 

If  the  flux  densities  at  the  wavelengths  X\  and 
A 2  are  F\  and  F2,  respectively,  the  colour  temper¬ 
ature  can  be  solved  from  the  equation 

Fi  _  Bkl(Tc )  _  / \2\5  ch^2kTc)  _  l 
F2  ~  Bk2(Tc )  “  Ci  )  ef'cmkTo)  -  1 ' 

If  we  denote 


£1 /M5 

F2U2/  ’ 


he 
X\k  ’ 


he 
A  2 


we  get  the  equation 

tB2/Tc  _  1 
A=  qBUTc  _  \ 


for  the  colour  temperature  Tc.  This  equation  must 
be  solved  numerically. 

In  our  example  the  constants  have  the  follow¬ 
ing  values: 


1.00  /550\5 
F30  \440 / 


2.348, 


Bi  =32,700  K, 


B2  =  26,160  K. 


Exercise  5.3  The  space  is  filled  with  back¬ 
ground  radiation,  remnant  of  the  early  age  of  the 
universe.  Currently  the  distribution  of  this  radi¬ 
ation  is  similar  to  the  radiation  of  a  blackbody 
at  the  temperature  of  2.7  K.  What  is  Amax  corre¬ 
sponding  to  this  radiation?  What  is  its  total  in¬ 
tensity?  Compare  the  intensity  of  the  background 
radiation  to  the  intensity  of  the  Sun  at  the  visual 
wavelengths. 

Exercise  5.4  The  temperature  of  a  red  giant  is 
T  =  2500  K  and  radius  100  times  the  solar  ra¬ 
dius. 

(a)  Find  the  total  luminosity  of  the  star,  and  the 
luminosity  in  the  visual  band  400  nm  <  A  < 
700  nm. 

(b)  Compare  the  star  with  a  100  W  lamp  that  ra¬ 
diates  5  %  of  its  energy  in  the  visual  band. 
What  is  the  distance  of  the  lamp  if  it  looks  as 
bright  as  the  star? 

Exercise  5.5  The  effective  temperature  of  Sir¬ 
ius  is  10,000  K,  apparent  visual  magnitude  —1.5, 
distance  2.67  kpc  and  bolometric  correction  0.5. 
What  is  the  radius  of  Sirius? 

Exercise  5.6  The  observed  flux  density  of  the 
Sun  at  A  =  300  nm  is  0.59  W  m2  nm-1 .  Find  the 
brightness  temperature  of  the  Sun  at  this  wave¬ 
length. 


By  substituting  different  values  for  Tc,  we  find 
that  Tc  =  7545  K  satisfies  our  equation. 


Exercise  5.7  The  colour  temperature  can  be  de¬ 
termined  from  two  magnitudes  corresponding  to 
two  different  wavelengths.  Show  that 


5.12  Exercises 

Exercise  5.1  Show  that  in  the  Wien  approxima¬ 
tion  the  relative  error  of  B\  is 

_  p.~hc/ (XkT) 

-  v  • 

Bi 


7000  K 

e_  (B-V)  +  0A7' 

The  wavelengths  of  the  B  and  V  bands  are 
440  nm  and  548  nm,  respectively,  and  we  assume 
that  B  =  V  for  stars  of  the  spectral  class  A0,  the 
colour  temperature  of  which  is  about  15,000  K. 
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Exercise  5.8  The  kinetic  temperature  of  the  Find  the  average  speed  of  the  electrons  in  such 
plasma  in  the  solar  corona  can  reach  106  K.  a  plasma. 


Celestial  Mechanics 


6 


Celestial  mechanics,  the  study  of  motions  of  ce¬ 
lestial  bodies,  together  with  spherical  astronomy, 
was  the  main  branch  of  astronomy  until  the  end 
of  the  19th  century,  when  astrophysics  began 
to  evolve  rapidly.  The  primary  task  of  classical 
celestial  mechanics  was  to  explain  and  predict 
the  motions  of  planets  and  their  satellites.  Sev¬ 
eral  empirical  models,  like  epicycles  and  Kepler’s 
laws,  were  employed  to  describe  these  motions. 
But  none  of  these  models  explained  why  the  plan¬ 
ets  moved  the  way  they  did.  It  was  only  in  the 
1680’s  that  a  simple  explanation  was  found  for  all 
these  motions — Newton’s  law  of  universal  gravi¬ 
tation.  In  this  chapter,  we  will  derive  some  prop¬ 
erties  of  orbital  motion.  The  physics  we  need  for 
this  is  simple  indeed,  just  Newton’s  laws.  (For 
a  review,  see  Box  6.1.) 

This  chapter  is  mathematically  slightly  more 
involved  than  the  rest  of  the  book.  We  shall  use 
some  vector  calculus  to  derive  our  results,  which, 
however,  can  be  easily  understood  with  very  el¬ 
ementary  mathematics.  A  summary  of  the  basic 
facts  of  vector  calculus  is  given  in  Appendix  A. 4. 


6.1  Equations  of  Motion 

We  shall  concentrate  on  systems  of  only  two  bod¬ 
ies.  In  fact,  this  is  the  most  complicated  case  that 
allows  a  neat  analytical  solution.  For  simplicity, 
let  us  call  the  bodies  the  Sun  and  a  planet,  al¬ 
though  they  could  quite  as  well  be  a  planet  and 
its  moon,  or  the  two  components  of  a  binary  star. 

Let  the  masses  of  the  two  bodies  be  m  i  and  m2 
and  the  radius  vectors  in  some  fixed  inertial  coor¬ 


Fig.  6.1  The  radius  vectors  of  the  Sun  and  a  planet  in  an 
arbitrary  inertial  frame  are  r  \  and  r 2,  and  r  —  —  r  \  is 

the  position  of  the  planet  relative  to  the  Sun 


dinate  frame  r  1  and  r2  (Fig.  6.1).  The  position 
of  the  planet  relative  to  the  Sun  is  denoted  by 
r  =  r2  —  r\ .  According  to  Newton’s  law  of  gravi¬ 
tation  the  planet  feels  a  gravitational  pull  propor¬ 
tional  to  the  masses  mi  and  m2  and  inversely  pro¬ 
portional  to  the  square  of  the  distance  r.  Since  the 
force  is  directed  towards  the  Sun,  it  can  be  ex¬ 
pressed  as 


Gm  1  m2  —  r 

y2  y 


r 

—  Gm  1  m2  —7 , 

r$ 


(6.1) 


where  G  is  the  gravitational  constant .  (More 
about  this  in  Sect.  6.5.) 

Newton’s  second  law  tells  us  that  the  accelera¬ 
tion  r2  of  the  planet  is  proportional  to  the  applied 
force: 


F=m2r2 •  (6.2) 

Combining  (6.1)  and  (6.2),  we  get  the  equation  of 
motion  of  the  planet 

r 

m2r2  =  —  Gm  \  m2  —7 .  (6.3) 

r 3 
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Since  the  Sun  feels  the  same  gravitational  pull, 
but  in  the  opposite  direction,  we  can  immediately 
write  the  equation  of  motion  of  the  Sun: 

r 

m\'r\  =  +Gm  i  m2  —7 .  (6.4) 

r5 

We  are  mainly  interested  in  the  relative  mo¬ 
tion  of  the  planet  with  respect  to  the  Sun.  To  find 
the  equation  of  the  relative  orbit,  we  cancel  the 
masses  appearing  on  both  sides  of  (6.3)  and  (6.4), 
and  subtract  (6.4)  from  (6.3)  to  get 

r 

r  =  —/i  —  ,  (6.5) 

where  we  have  denoted 

/x  =  G(m\  +  m2).  (6.6) 

The  solution  of  (6.5)  now  gives  the  relative  or¬ 
bit  of  the  planet.  The  equation  involves  the  radius 
vector  and  its  second  time  derivative.  In  princi¬ 
ple,  the  solution  should  yield  the  radius  vector  as 
a  function  of  time,  r  =  r(t).  Unfortunately  things 
are  not  this  simple  in  practice;  in  fact,  there  is 
no  way  to  express  the  radius  vector  as  a  function 
of  time  in  a  closed  form  (i.e.  as  a  finite  expres¬ 
sion  involving  familiar  elementary  functions).  Al¬ 
though  there  are  several  ways  to  solve  the  equa¬ 
tion  of  motion,  we  must  resort  to  mathematical 
manipulation  in  one  form  or  another  to  figure  out 
the  essential  properties  of  the  orbit.  Next  we  shall 
study  one  possible  method. 


6.2  Solution  of  the  Equation  of 
Motion 

The  equation  of  motion  (6.5)  is  a  second-order 
(i.e.  contains  second  derivatives)  vector  valued 
differential  equation.  Therefore  we  need  six  in¬ 
tegration  constants  or  integrals  for  the  complete 
solution.  The  solution  is  an  infinite  family  of  or¬ 
bits  with  different  sizes,  shapes  and  orientations. 
A  particular  solution  (e.g.  the  orbit  of  Jupiter)  is 
selected  by  fixing  the  values  of  the  six  integrals. 
The  fate  of  a  planet  is  unambiguously  determined 
by  its  position  and  velocity  at  any  given  moment; 
thus  we  could  take  the  position  and  velocity  vec¬ 
tors  at  some  moment  as  our  integrals.  Although 


Fig.  6.2  The  angular  momentum  vector  k  is  perpendicu¬ 
lar  to  the  radius  and  velocity  vectors  of  the  planet.  Since 
A:  is  a  constant  vector,  the  motion  of  the  planet  is  restricted 
to  the  plane  perpendicular  to  k 

they  do  not  tell  us  anything  about  the  geome¬ 
try  of  the  orbit,  they  can  be  used  as  initial  val¬ 
ues  when  integrating  the  orbit  numerically  with 
a  computer.  Another  set  of  integrals,  the  orbital 
elements ,  contains  geometric  quantities  describ¬ 
ing  the  orbit  in  a  very  clear  and  concrete  way.  We 
shall  return  to  these  later.  A  third  possible  set  in¬ 
volves  certain  physical  quantities,  which  we  shall 
derive  next. 

We  begin  by  showing  that  the  angular  momen¬ 
tum  remains  constant.  The  angular  momentum  of 
the  planet  in  the  heliocentric  frame  is 

L  ~  m2V  x  r .  (6.7) 

Celestial  mechanicians  usually  prefer  to  use  the 
angular  momentum  divided  by  the  planet’s  mass 

k  =  r  x  r.  (6.8) 

Let  us  find  the  time  derivative  of  this: 

k  =  rxr-\-rxr. 

The  latter  term  vanishes  as  a  vector  product  of 
two  parallel  vectors.  The  former  term  contains  r, 
which  is  given  by  the  equation  of  motion: 

k  =  r  x  (— /xr/r3)  =  —  (/x/r3)r  x  r  =  0. 

Thus  A:  is  a  constant  vector  independent  of  time 
(as  is  L,  of  course). 

Since  the  angular  momentum  vector  is  always 
perpendicular  to  the  motion  (this  follows  from 
(6.8)),  the  motion  is  at  all  times  restricted  to  the 
invariable  plane  perpendicular  to  k  (Fig.  6.2). 


6.2  Solution  of  the  Equation  of  Motion 
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Fig.  6.3  The  radial  velocity  r  is  the  projection  of  the  ve¬ 
locity  vector  r  in  the  direction  of  the  radius  vector  r 

To  find  another  constant  vector,  we  compute 
the  vector  product  k  x  r: 

k  x  r  =  (r  x  r)  x  (— /xr/r3) 

=  -^[(r-r)r-(r-r)r]. 

The  time  derivative  of  the  distance  r  is  equal  to 
the  projection  of  r  in  the  direction  of  r  (Fig.  6.3); 
thus,  using  the  properties  of  the  scalar  product, 
we  get  r  =r  •  r/r,  which  gives 

r  •  r  =rr .  (6.9) 


Hence, 


Fig.  6.4  The  orbit  of  an  object  in  the  gravitational  field  of 
another  object  is  a  conic  section:  ellipse,  parabola  or  hy¬ 
perbola.  Vector  e  points  to  the  direction  of  the  pericentre, 
where  the  orbiting  object  is  closest  to  central  body.  If  the 
central  body  is  the  Sun,  this  direction  is  called  the  perihe¬ 
lion;  if  some  other  star,  periastron;  if  the  Earth,  perigee, 
etc.  The  true  anomaly  /  is  measured  from  the  pericentre 


k  x  r  =  — /x(r/r  —  rr /r2)  =  — (— /xr/r). 

d  t 

The  vector  product  can  also  be  expressed  as 


k  x  r  =  — (k  x  r), 
d  t 


where  the  planet  is  closest  to  the  Sun  in  its  orbit. 
This  point  is  called  the  perihelion. 

One  more  constant  is  found  by  computing  r  •  r : 

r  •  r  =  —  fir  •  r/r3  =  — /xrr/r3 


since  A;  is  a  constant  vector.  Combining  this  with 
the  previous  equation,  we  have 


d 
d  t 


(fi/r). 


—  (k  x  r  +  iir /r)  =  0 
df  ^  ' 


and 


Since  we  also  have 


•  • 
r 


d 
d  t 


k  x  r  +  pcr/r  =  const  =  —  pie.  (6.10)  we  get 

Since  k  is  perpendicular  to  the  orbital  plane, 
k  x  r  must  lie  in  that  plane.  Thus,  e  is  a  linear 
combination  of  two  vectors  in  the  orbital  plane;  or 
so  e  itself  must  be  in  the  orbital  plane  (Fig.  6.4). 

Later  we  shall  see  that  it  points  to  the  direction 


d 
d  t 


1  9 

-v  —  u/r  =  const  =  h. 

2 


(6.11) 
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Here  v  is  the  speed  of  the  planet  relative  to  the 
Sun.  The  constant  h  is  called  the  energy  integral ; 
the  total  energy  of  the  planet  is  m2h.  We  must  not 
forget  that  energy  and  angular  momentum  depend 
on  the  coordinate  frame  used.  Here  we  have  used 
a  heliocentric  frame,  which  in  fact  is  in  acceler¬ 
ated  motion. 

So  far,  we  have  found  two  constant  vectors  and 
one  constant  scalar.  It  looks  as  though  we  already 
have  seven  integrals,  i.e.  one  too  many.  But  not  all 
of  these  constants  are  independent;  specifically, 
the  following  two  relations  hold: 

k  •  e  =  0,  (6.12) 

li2(e2-  1)  =2hk2,  (6.13) 

where  e  and  k  are  the  lengths  of  e  and  k.  The 
first  equation  is  obvious  from  the  definitions  of  e 
and  k.  To  prove  (6.13),  we  square  both  sides  of 
(6.10)  to  get 

2  2  /»  •  wl  -x  ,  2r  '  r  ,  .x 

/x  e  =  {k  x  r)  •  (k  x  r)  +  /x  — ^-+2  (kxr) - . 

y  Z  y 


planet  is !  To  fix  its  position  in  the  orbit,  we  have 
to  determine  where  the  planet  is  at  some  given  in¬ 
stant  of  time  t  =  to,  or  alternatively,  at  what  time 
it  is  in  some  given  direction.  We  use  the  latter 
method  by  specifying  the  time  of  perihelion  pas¬ 
sage,  the  time  of  perihelion  r. 


6.3  Equation  of  the  Orbit  and 
Kepler's  First  Law 

In  order  to  find  the  geometric  shape  of  the  orbit, 
we  now  derive  the  equation  of  the  orbit.  Since  e  is 
a  constant  vector  lying  in  the  orbital  plane,  we 
choose  it  as  the  reference  direction.  We  denote  the 
angle  between  the  radius  vector  r  and  e  by  /.  The 
angle  /  is  called  the  true  anomaly.  (There  is  noth¬ 
ing  false  or  anomalous  in  this  and  other  anomalies 
we  shall  meet  later.  Angles  measured  from  the 
perihelion  point  are  called  anomalies  to  distin¬ 
guish  them  from  longitudes  measured  from  some 
other  reference  point,  usually  the  vernal  equinox.) 
Using  the  properties  of  the  scalar  product  we  get 


Since  k  is  perpendicular  to  r,  the  length  of 
k  x  r  is  |A:| \r  |  =  kv  and  (k  x  r)  •  (k  x  r)  =  k2v2. 
Thus,  we  have 

li2e2  =  k2v2  +  /x2  H — —{k  x  r  •  r). 

r 

The  last  term  contains  a  scalar  triple  product, 
where  we  can  exchange  the  dot  and  cross  to  get 
k  •  r  x  r.  Next  we  reverse  the  order  of  the  two 
last  factors.  Because  the  vector  product  is  anti- 
commutative,  we  have  to  change  the  sign  of  the 
product: 


r  •  e  =  re  cos  /. 

But  the  product  r  •  e  can  also  be  evaluated  using 
the  definition  of  e: 

r  •  e  = - ( r  •  k  x  r  +  /xr  •  r/r) 

/x 

= - (k  r  x  r  +  fir)  = - , k2  +  /xr) 

/x  /x 

k2 

= - r. 

/x 


p2(e2  —  l)  =  k2v2 - —(k  •  r  x  r) 


r 

=  2 k2  ^-t/2  —  — ^  =  2k2 h. 

This  completes  the  proof  of  (6.13). 

The  relations  (6.12)  and  (6.13)  reduce  the 
number  of  independent  integrals  by  two,  so  we 
still  need  one  more.  The  constants  we  have  de¬ 
scribe  the  size,  shape  and  orientation  of  the  or¬ 
bit  completely,  but  we  do  not  yet  know  where  the 


Equating  the  two  expressions  of  r  •  e  we  get 


k2  /  /x 

r  = - . 

1  +  e  cos  / 


(6.14) 


This  is  the  general  equation  of  a  conic  section  in 
polar  coordinates  (Fig.  6.4;  see  Appendix  A. 2  for 
a  brief  summary  of  conic  sections).  The  magni¬ 
tude  of  e  gives  the  eccentricity  of  the  conic: 


e  =  0  circle, 

0  <  e  <  1  ellipse, 
e  =  1  parabola, 

e  >  1  hyperbola. 
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Fig.  6.5  Six  integration  constants  are  needed  to  describe 
a  planet’s  orbit.  These  constants  can  be  chosen  in  vari¬ 
ous  ways,  (a)  If  the  orbit  is  to  be  computed  numerically, 
the  simplest  choice  is  to  use  the  initial  values  of  the  ra¬ 
dius  and  velocity  vectors,  (b)  Another  possibility  is  to  use 
the  angular  momentum  k,  the  direction  of  the  perihelion  e 


(the  length  of  which  gives  the  eccentricity),  and  the  peri¬ 
helion  time  r .  (c)  The  third  method  best  describes  the  ge¬ 
ometry  of  the  orbit.  The  constants  are  the  longitude  of  the 
ascending  node  Q ,  the  argument  of  perihelion  co,  the  in¬ 
clination  i,  the  semimajor  axis  a,  the  eccentricity  e  and 
the  time  of  perihelion  r 


Inspecting  (6.14),  we  find  that  r  attains  its  mini¬ 
mum  when  /  =  0,  i.e.  in  the  direction  of  the  vec¬ 
tor  e.  Thus,  e  indeed  points  to  the  direction  of  the 
perihelion. 

Starting  with  Newton’s  laws,  we  have  thus 
managed  to  prove  Kepler’s  first  law: 

The  orbit  of  a  planet  is  an  ellipse,  one  focus  of 

which  is  in  the  Sun. 

Without  any  extra  effort,  we  have  shown  that 
also  other  conic  sections,  the  parabola  and  hyper¬ 
bola,  are  possible  orbits. 


orbit  (6.14),  we  see  that  the  parameter  (or  semi- 
latus  rectum)  of  the  orbit  is  p  =  k2  /  pc.  But  the 
parameter  of  a  conic  section  is  always  a\l  —  e2 \ , 
which  gives  the  semimajor  axis,  if  e  and  k  are 
known: 


a  = 


k2  /  pi 


(6.15) 


By  applying  (6.13),  we  get  an  important  relation 
between  the  size  of  the  orbit  and  the  energy  inte¬ 
gral  h : 


6.4  Orbital  Elements 

We  have  derived  a  set  of  integrals  convenient  for 
studying  the  dynamics  of  orbital  motion.  We  now 
turn  to  another  collection  of  constants  more  ap¬ 
propriate  for  describing  the  geometry  of  the  orbit. 
The  following  six  quantities  are  called  the  orbital 
elements  (Fig.  6.5): 

-  semimajor  axis  a , 

-  eccentricity  e, 

-  inclination  i  (or  i), 

-  longitude  of  the  ascending  node  Q , 

-  argument  of  the  perihelion  co , 

-  time  of  the  perihelion  r . 

The  eccentricity  is  obtained  readily  as  the 
length  of  the  vector  e.  From  the  equation  of  the 


if  the  orbit  is  an  ellipse, 


/x/2 h,  if  the  orbit  is  a  hyperbola. 

(6.16) 

For  a  bound  system  (elliptical  orbit),  the  total 
energy  and  the  energy  integral  are  negative.  For 
a  hyperbolic  orbit  h  is  positive;  the  kinetic  en¬ 
ergy  is  so  high  that  the  particle  can  escape  the  sys¬ 
tem  (or  more  correctly,  recede  without  any  limit). 
The  parabola,  with  h  =  0,  is  a  limiting  case  be¬ 
tween  elliptical  and  hyperbolic  orbits.  In  reality 
parabolic  orbits  do  not  exist,  since  hardly  any 
object  can  have  an  energy  integral  exactly  zero. 
However,  if  the  eccentricity  is  very  close  to  one 
(as  with  many  comets),  the  orbit  is  usually  con¬ 
sidered  parabolic  to  simplify  calculations. 

The  orientation  of  the  orbit  is  determined  by 
the  directions  of  the  two  vectors  k  (perpendicular 
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to  the  orbital  plane)  and  e  (pointing  towards  the 
perihelion).  The  three  angles  i,  Q  and  co  contain 
the  same  information. 

The  inclination  i  gives  the  obliquity  of  the  or¬ 
bital  plane  relative  to  some  fixed  reference  plane. 
For  bodies  in  the  solar  system,  the  reference  plane 
is  usually  the  ecliptic.  For  objects  moving  in  the 
usual  fashion,  i.e.  counterclockwise,  the  inclina¬ 
tion  is  in  the  interval  [0°,  90°];  for  retrograde  or¬ 
bits  (clockwise  motion),  the  inclination  is  in  the 
range  (90°,  180°].  For  example,  the  inclination  of 
Halley’s  comet  is  162°,  which  means  that  the  mo¬ 
tion  is  retrograde  and  the  angle  between  its  orbital 
plane  and  the  ecliptic  is  180°  —  162°  =  18°. 

The  longitude  of  the  ascending  node,  +2 ,  indi¬ 
cates  where  the  object  crosses  the  ecliptic  from 
south  to  north.  It  is  measured  counterclockwise 
from  the  vernal  equinox.  The  orbital  elements  i 
and  Q  together  determine  the  orientation  of  the 
orbital  plane,  and  they  correspond  to  the  direction 
of  k ,  i.e.  the  ratios  of  its  components. 

The  argument  of  the  perihelion  co  gives  the 
direction  of  the  perihelion,  measured  from  the 
ascending  node  in  the  direction  of  motion.  The 
same  information  is  contained  in  the  direction 
of  e.  Very  often  another  angle,  the  longitude  of 
the  perihelion  m  (pronounced  as  pi),  is  used  in¬ 
stead  of  It  is  defined  as 

m  =  +2  +  oo.  (6.17) 

This  is  a  rather  peculiar  angle,  as  it  is  measured 
partly  along  the  ecliptic,  partly  along  the  orbital 
plane.  However,  it  is  often  more  practical  than 
the  argument  of  perihelion,  since  it  is  well  de¬ 
fined  even  when  the  inclination  is  close  to  zero 
in  which  case  the  direction  of  the  ascending  node 
becomes  indeterminate. 

We  have  assumed  up  to  this  point  that  each 
planet  forms  a  separate  two-body  system  with  the 
Sun.  In  reality  planets  interfere  with  each  other  by 
disturbing  each  other’s  orbits.  Still  their  motions 
do  not  deviate  very  far  from  the  shape  of  conic 
sections,  and  we  can  use  orbital  elements  to  de¬ 
scribe  the  orbits.  But  the  elements  are  no  longer 
constant;  they  vary  slowly  with  time.  Moreover, 
their  geometric  interpretation  is  no  longer  quite 
as  obvious  as  before.  Such  elements  are  osculat¬ 
ing  elements  that  would  describe  the  orbit  if  all 


Fig.  6.6  Unit  vectors  er  and  e  f  of  the  polar  coordinate 
frame.  The  directions  of  these  change  while  the  planet 
moves  along  its  orbit 

perturbations  were  to  suddenly  disappear.  They 
can  be  used  to  find  the  positions  and  velocities  of 
the  planets  exactly  as  if  the  elements  were  con¬ 
stants.  The  only  difference  is  that  we  have  to  use 
different  elements  for  each  moment  of  time. 

Table  C.12  (at  the  end  of  the  book)  gives 
the  mean  orbital  elements  for  the  nine  planets 
for  the  epoch  J2000.0  as  well  as  their  first  time 
derivatives.  In  addition  to  these  secular  varia¬ 
tions  the  orbital  elements  suffer  from  periodic 
disturbations,  which  are  not  included  in  the  ta¬ 
ble.  Thus  only  approximate  positions  can  be  cal¬ 
culated  with  these  elements.  Instead  of  the  time 
of  perihelion  the  table  gives  the  mean  longitude 

L  =  M  +  co  +  C2,  (6.18) 

which  gives  directly  the  mean  anomaly  M  (which 
will  be  defined  in  Sect.  6.7). 


6.5  Kepler's  Second  and  Third  Law 

The  radius  vector  of  a  planet  in  polar  coordinates 
is  simply 

r  =  rer,  (6.19) 

where  er  is  a  unit  vector  parallel  with  r  (Fig.  6.6). 
If  the  planet  moves  with  angular  velocity  /,  the 
direction  of  this  unit  vector  also  changes  at  the 
same  rate: 

Xer  =  fef,  (6.20) 
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where  ey  is  a  unit  vector  perpendicular  to  er .  The 
velocity  of  the  planet  is  found  by  taking  the  time 
derivative  of  (6.19): 

r  =  rer  -\-rer  =rer  -\-rfef.  (6.21) 

The  angular  momentum  k  can  now  be  evaluated 
using  (6.19)  and  (6.21): 

k  =  rxr  =  r2fez,  (6.22) 

where  ez  is  a  unit  vector  perpendicular  to  the  or¬ 
bital  plane.  The  magnitude  of  k  is 


k  =  r2f.  (6.23) 

The  surface  velocity  of  a  planet  means  the  area 
swept  by  the  radius  vector  per  unit  of  time.  This 
is  obviously  the  time  derivative  of  some  area,  so 
let  us  call  it  A.  In  terms  of  the  distance  r  and  true 
anomaly  /,  the  surface  velocity  is 

A  =  l-r2f.  (6.24) 

By  comparing  this  with  the  length  of  k  (6.23),  we 
find  that 

.  1 

A  =  -k.  (6.25) 

Since  k  is  constant,  so  is  the  surface  velocity. 
Hence  we  have  Kepler’s  second  law: 


Fig.  6.7  The  areas  of  the  shaded  sectors  of  the  ellipse  are 
equal.  According  to  Kepler’s  second  law,  it  takes  equal 
times  to  travel  distances  AB,  CD  and  EF 

where  P  is  the  orbital  period.  Since  the  area  of 
the  ellipse  is 


nab  =  na2s/l 


(6.28) 


where  a  and  b  are  the  semimajor  and  semiminor 
axes  and  e  the  eccentricity,  we  get 


na 


2 


(6.29) 


To  find  the  length  of  A;,  we  substitute  the  energy 
integral  h  as  a  function  of  semimajor  axis  (6.16) 
into  (6.13)  to  get 


The  radius  vector  of  a  planet  sweeps  equal  areas  in 
equal  amounts  of  time. 


k  =  yj G(m\  +  m2)a  (l  —  e 2) 


(6.30) 


Since  the  Sun-planet  distance  varies,  the  or¬ 
bital  velocity  must  also  vary  (Fig.  6.7).  From  Ke¬ 
pler’s  second  law  it  follows  that  a  planet  must 
move  fastest  when  it  is  closest  to  the  Sun  (near 
perihelion).  Motion  is  slowest  when  the  planet  is 
farthest  from  the  Sun  at  aphelion. 

We  can  write  (6.25)  in  the  form 


When  this  is  substituted  into  (6.29)  we  have 


G{m\  +  m2) 


(6.31) 


This  is  the  exact  form  of  Kepler’s  third  law  as 
derived  from  Newton’s  laws.  The  original  version 
was 


1  The  ratio  of  the  cubes  of  the  semimajor  axes  of 

dA  =  —kdt,  (6.26)  the  orbits  of  two  planets  is  equal  to  the  ratio  of  the 

2  squares  of  their  orbital  periods. 


and  integrate  over  one  complete  period: 


/ 

a  orbi 


1  f 

d A=  -k  I  cl/ , 

orbital  ellipse  2  Jq 


(6.27) 


In  this  form  the  law  is  not  exactly  valid,  even 
for  planets  of  the  solar  system,  since  their  own 
masses  influence  their  periods.  The  errors  due  to 
ignoring  this  effect  are  very  small,  however. 
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Kepler’s  third  law  becomes  remarkably  simple 
if  we  express  distances  in  astronomical  units  (au), 
times  in  sidereal  years  (the  abbreviation  is  unfor¬ 
tunately  a,  not  to  be  confused  with  the  semimajor 
axis,  denoted  by  a  somewhat  similar  symbol  a) 
and  masses  in  solar  masses  (M0).  Then  G  =  4n2 
and 

a3  =  (m\  +  ru2)P2.  (6.32) 

The  masses  of  objects  orbiting  around  the  Sun 
can  safely  be  ignored  (except  for  the  largest  plan¬ 
ets),  and  we  have  the  original  law  P 2  =  a3.  This 
is  very  useful  for  determining  distances  of  var¬ 
ious  objects  whose  periods  have  been  observed. 
For  absolute  distances  we  have  to  measure  at  least 
one  distance  in  metres  to  find  the  length  of  one 
au.  Earlier,  triangulation  was  used  to  measure  the 
parallax  of  the  Sun  or  a  minor  planet,  such  as 
Eros,  that  comes  very  close  to  the  Earth.  Nowa¬ 
days,  radiotelescopes  are  used  as  radar  to  very 
accurately  measure,  for  example,  the  distance  to 
Venus.  Since  changes  in  the  value  of  one  au  also 
change  all  other  distances,  the  International  As¬ 
tronomical  Union  decided  in  1968  to  adopt  the 
value  1  au  =  1 .496000  x  1011  m.  The  semimajor 
axis  of  Earth’s  orbit  is  then  slightly  over  one  au. 
But  constants  tend  to  change.  And  so,  after  1984, 
the  astronomical  unit  has  a  new  value, 

1  au=  1.49597870  x  1011  m. 

This  corresponds  to  the  radius  of  an  object  that 
has  the  same  orbital  period  as  the  Earth  but  that 
has  no  mass.  Since  also  the  Earth’s  mass  affects 
its  period,  the  actual  semimajor  axis  of  the  Earth 
must  be  slightly  bigger  than  one  au. 

Another  important  application  of  Kepler’s 
third  law  is  the  determination  of  masses.  By  ob¬ 
serving  the  period  of  a  natural  or  artificial  satel¬ 
lite,  the  mass  of  the  central  body  can  be  obtained 
immediately.  The  same  method  is  used  to  deter¬ 
mine  masses  of  binary  stars  (more  about  this  sub¬ 
ject  in  Chap.  10). 

Although  the  values  of  the  au  and  year  are  ac¬ 
curately  known  in  Si-units,  the  gravitational  con¬ 
stant  is  known  only  approximately.  Astronomical 
observations  give  the  product  G{m\  +  m2),  but 
there  is  no  way  to  distinguish  between  the  con¬ 
tributions  of  the  gravitational  constant  and  those 


of  the  masses.  The  gravitational  constant  must  be 
measured  in  the  laboratory;  this  is  very  difficult 
because  of  the  weakness  of  gravitation.  There¬ 
fore,  if  a  precision  higher  than  2-3  significant 
digits  is  required,  the  Si-units  cannot  be  used. 
Instead  we  have  to  use  the  solar  mass  as  a  unit 
of  mass  (or,  for  example,  the  Earth’s  mass  after 
Gm®  has  been  determined  from  observations  of 
satellite  orbits). 


6.6  Systems  of  Several  Bodies 


This  far  we  have  discussed  systems  consisting  of 
only  two  bodies.  In  fact  it  is  the  most  complex 
system  for  which  a  complete  solution  is  known. 
The  equations  of  motion  are  easily  generalised, 
though.  As  in  (6.5)  we  get  the  equation  of  motion 
for  the  body  k,  k  =  1 , . . . ,  n : 


i  =n 

rk=  y]  Gnii 


rt  ~  rk 

iS  ’ 

Ti  -TkV 


(6.33) 


where  m;  is  the  mass  of  the  /  th  body  and  r;  its 
radius  vector.  On  the  right  hand  side  of  the  equa¬ 
tion  we  now  have  the  total  gravitational  force 
due  to  all  other  objects,  instead  of  the  force  of 
just  one  body.  If  there  are  more  than  two  bodies, 
these  equations  cannot  be  solved  analytically  in 
a  closed  form  (Fig.  6.8).  The  only  integrals  that 
can  be  easily  derived  in  the  general  case  are  the 
total  energy,  total  momentum,  and  total  angular 
momentum. 

If  the  radius  and  velocity  vectors  of  all  bodies 
are  known  for  a  certain  instant  of  time,  the  posi¬ 
tions  at  some  other  time  can  easily  be  calculated 
numerically  from  the  equations  of  motion.  For 
example,  the  planetary  positions  needed  for  as¬ 
tronomical  yearbooks  are  computed  by  integrat¬ 
ing  the  equations  numerically. 

Another  method  can  be  applied  if  the  grav¬ 
ity  of  one  body  dominates  like  in  the  solar  sys¬ 
tem.  Planetary  orbits  can  then  be  calculated  as 
in  a  two-body  system,  and  the  effects  of  other 
planets  taken  into  account  as  small  perturbations. 
For  these  perturbations  several  series  expansions 
have  been  derived. 

The  restricted  three-body  problem  is  an  exten¬ 
sively  studied  special  case.  It  consists  of  two  mas¬ 
sive  bodies  or  primaries ,  moving  on  circular  or- 
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Fig.  6.8  When  a  system  consists  of  more  than  two  bodies, 
the  equations  of  motion  cannot  be  solved  analytically.  In 
the  solar  system  the  mutual  disturbances  of  the  planets  are 
usually  small  and  can  be  taken  into  account  as  small  per¬ 
turbations  in  the  orbital  elements.  K.F.  Sundman  designed 
a  machine  to  carry  out  the  tedious  integration  of  the  pertur¬ 
bation  equations.  This  machine,  called  the  perturbograph, 
is  one  of  the  earliest  analogue  computers;  unfortunately 
it  was  never  built.  Shown  is  a  design  for  one  component 
that  evaluates  a  certain  integral  occurring  in  the  equations. 
(The  picture  appeared  in  K.F.  Sundman’ s  paper  in  Fest- 
skrift  tillegnad  Anders  Donner  in  1915) 

bits  around  each  other,  and  a  third,  massless  body, 
moving  in  the  same  plane  with  the  primaries.  This 
small  object  does  in  no  way  disturb  the  motion  of 
the  primaries.  Thus  the  orbits  of  the  massive  bod¬ 
ies  are  as  simple  as  possible,  and  their  positions 
are  easily  computed  for  all  times.  The  problem  is 
to  find  the  orbit  of  the  third  body.  It  turns  out  that 
there  is  no  finite  expression  for  this  orbit. 

The  Finnish  astronomer  Karl  Frithiof  Sund¬ 
man  (1873-1949)  managed  to  show  that  a  solu¬ 
tion  exists  and  derive  a  series  expansion  for  the 
orbit.  The  series  converges  so  slowly  that  it  has 


Fig.  6.9  The  Lagrangian  points  of  the  restricted  three- 
body  problem.  The  points  L\,  L2  and  L3  are  on  the  same 
line  with  the  primaries,  but  the  numbering  may  vary.  The 
points  L4  and  L5  form  equilateral  triangles  with  the  pri¬ 
maries 

no  practical  use,  but  as  a  mathematical  result  it 
was  remarkable,  since  many  mathematicians  had 
for  a  long  time  tried  to  attack  the  problem  without 
success. 

The  three-body  problem  has  some  interesting 
special  solutions.  It  can  be  shown  that  in  certain 
points  the  third  body  can  remain  at  rest  with  re¬ 
spect  to  the  primaries.  There  are  five  such  points, 
known  as  the  Lagrangian  points  L\,...,Ls 
(Fig.  6.9).  Three  of  them  are  on  the  straight  line 
determined  by  the  primaries.  These  points  are  un¬ 
stable:  if  a  body  in  any  of  these  points  is  dis¬ 
turbed,  it  will  escape.  The  two  other  points,  on 
the  other  hand,  are  stable.  These  points  together 
with  the  primaries  form  equilateral  triangles.  For 
example,  some  asteroids  have  been  found  around 
the  Lagrangian  points  L4  and  L5  of  Jupiter.  The 
first  of  them  were  named  after  heroes  of  the  Tro¬ 
jan  war,  and  so  they  are  called  Trojan  asteroids. 
They  move  around  the  Lagrangian  points  and 
can  actually  travel  quite  far  from  them,  but  they 
cannot  escape.  Figure  8.38  shows  two  distinct 
condensations  around  the  Lagrangian  points  of 
Jupiter.  Later  similar  Trojan  asteroids  have  been 
found  also  on  the  orbits  of  other  planets. 


6.7  Orbit  Determination 

Celestial  mechanics  has  two  very  practical  tasks: 
to  determine  orbital  elements  from  observations 
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and  to  predict  positions  of  celestial  bodies  with 
known  elements.  Planetary  orbits  are  already 
known  very  accurately,  but  new  comets  and  mi¬ 
nor  planets  are  found  frequently,  requiring  orbit 
determination. 

The  first  practical  methods  for  orbit  determi¬ 
nation  were  developed  by  Johann  Karl  Friedrich 
Gauss  (1777-1855)  at  the  beginning  of  the  19th 
century.  By  that  time  the  first  minor  planets  had 
been  discovered,  and  thanks  to  Gauss’s  orbit  de¬ 
terminations,  they  could  be  found  and  observed 
at  any  time. 

At  least  three  observations  are  needed  for 
computing  the  orbital  elements.  The  directions 
are  usually  measured  from  pictures  taken  a  few 
nights  apart.  Using  these  directions,  it  is  possible 
to  find  the  corresponding  absolute  positions  (the 
rectangular  components  of  the  radius  vector).  To 
be  able  to  do  this,  we  need  some  additional  con¬ 
straints  on  the  orbit;  we  must  assume  that  the  ob¬ 
ject  moves  along  a  conic  section  lying  in  a  plane 
that  passes  through  the  Sun.  When  the  three  ra¬ 
dius  vectors  are  known,  the  ellipse  (or  some  other 
conic  section)  going  through  these  three  points 
can  be  determined.  In  practice,  more  observations 
are  used.  The  elements  determined  are  more  ac¬ 
curate  if  there  are  more  observations  and  if  they 
cover  the  orbit  more  completely. 

Although  the  calculations  for  orbit  determi¬ 
nation  are  not  too  involved  mathematically,  they 
are  relatively  long  and  laborious.  Several  meth¬ 
ods  can  be  found  in  textbooks  of  celestial  me¬ 
chanics. 


Fig.  6.10  Definition  of  the  eccentric  anomaly  E.  The 
planet  is  at  P,  and  r  is  its  radius  vector 

/v 

where  i  and  j  are  unit  vectors  parallel  with  the 
major  and  minor  axes,  respectively.  The  angle  E 
is  the  eccentric  anomaly ;  its  slightly  eccentric 
definition  is  shown  in  Fig.  6.10.  Many  formulas 
of  elliptical  motion  become  very  simple  if  either 
time  or  true  anomaly  is  replaced  by  the  eccentric 
anomaly.  As  an  example,  we  take  the  square  of 
(6.34)  to  find  the  distance  from  the  Sun: 

r2  =  r  r 

=  a2 (cos  E  —  e)2  +  h2  sin2  E 
=  a2[( cos  E  —  e)2  +  (l  —  e2) (l  —  cos2  £)] 

=  a2[  1  —  2e  cos  E  +  e2  cos2  is], 


6.8  Position  in  the  Orbit 

Although  we  already  know  everything  about  the 
geometry  of  the  orbit,  we  still  cannot  find  the 
planet  at  a  given  time,  since  we  do  not  know  the 
radius  vector  rasa  function  of  time.  The  variable 
in  the  equation  of  the  orbit  is  an  angle,  the  true 
anomaly  /,  measured  from  the  perihelion.  From 
Kepler’s  second  law  it  follows  that  /  cannot  in¬ 
crease  at  a  constant  rate  with  time.  Therefore  we 
need  some  preparations  before  we  can  find  the  ra¬ 
dius  vector  at  a  given  instant. 

The  radius  vector  can  be  expressed  as 

r  =  a(cosE  —  e)i  +  bsinEj,  (6.34) 


whence 

r  =  a  (1  —  ^cos E).  (6.35) 

Our  next  problem  is  to  find  how  to  calculate  E 
for  a  given  moment  of  time.  According  to  Ke¬ 
pler’s  second  law,  the  surface  velocity  is  constant. 
Thus  the  area  of  the  shaded  sector  in  Fig.  6.10  is 

t  —  x 

A  =  7i  ab  — — — ,  (6.36) 

where  t  —  x  is  the  time  elapsed  since  the  peri¬ 
helion,  and  P  is  the  orbital  period.  But  the  area 
of  a  part  of  an  ellipse  is  obtained  by  reducing 
the  area  of  the  corresponding  part  of  the  circum¬ 
scribed  circle  by  the  axial  ratio  h/ a.  (As  the  math- 
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ematicians  say,  an  ellipse  is  an  affine  transfor¬ 
mation  of  a  circle.)  Hence  the  area  of  SPX  is 
(Fig.  6.11) 

A  =  -  (area  of  SP'X) 
a 


-  (area  of  the  sector  CP  X 
a 

—  area  of  the  triangle  CP'S) 

b( 1  1  .  N 

-I  -a  ■  aE - ae  •  a  sin  E 

a\  2  2 

1 

-ab(E  —  esin  E). 


By  equating  these  two  expressions  for  the  area  A, 
we  get  the  famous  Kepler's  equation , 


Fig.  6.11  The  area  of  the  shaded  sector  equals  b / a  times 
the  area  SP'X.  S  =  the  Sun,  P  =  the  planet,  X  =  the  peri¬ 
helion 


E  —  e  sin  e  —  M ,  (6.37) 


where 

2tz* 

M=-^-(t-r)  (6.38) 

is  the  mean  anomaly  of  the  planet  at  time  t.  The 
mean  anomaly  increases  at  a  constant  rate  with 
time.  It  indicates  where  the  planet  would  be  if  it 
moved  in  a  circular  orbit  of  radius  a.  For  circular 
orbits  all  three  anomalies  /,  £,  and  M  are  always 
equal. 

If  we  know  the  period  and  the  time  elapsed 
after  the  perihelion,  we  can  use  (6.38)  to  find  the 
mean  anomaly.  Next  we  must  solve  for  the  eccen¬ 
tric  anomaly  from  Kepler’s  equation  (6.37).  Fi¬ 
nally  the  radius  vector  is  given  by  (6.35).  Since 
the  components  of  r  expressed  in  terms  of  the 
true  anomaly  are  r  cos  /  and  r  sin  /,  we  find 


a(cosE  —  e)  cos  E  —  e 

cos  f  = - = - , 

r  1  —  e  cos  E 

b  sin  E  r - -  sin  E 

sin  /  = - =  V  l  —  ez - . 

r  1  —  e  cos  E 


(6.39) 


These  determine  the  true  anomaly,  should  it  be  of 
interest. 

Now  we  know  the  position  in  the  orbital  plane. 
This  must  usually  be  transformed  to  some  other 
previously  selected  reference  frame.  For  exam¬ 
ple,  we  may  want  to  know  the  ecliptic  longitude 
and  latitude,  which  can  later  be  used  to  find  the 


right  ascension  and  declination.  These  transfor¬ 
mations  belong  to  the  realm  of  spherical  astron¬ 
omy  and  are  briefly  discussed  in  Examples  6.5- 
6.7. 
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If  an  object  moves  fast  enough,  it  can  escape  from 
the  gravitational  field  of  the  central  body  (to  be 
precise:  the  field  extends  to  infinity,  so  the  object 
never  really  escapes,  but  is  able  to  recede  without 
any  limit).  If  the  escaping  object  has  the  mini¬ 
mum  velocity  allowing  escape,  it  will  have  lost 
all  its  velocity  at  infinity  (Fig.  6.12).  There  its  ki¬ 
netic  energy  is  zero,  since  v  =  0,  and  the  potential 
energy  is  also  zero,  since  the  distance  r  is  infinite. 
At  infinite  distance  the  total  energy  as  well  as  the 
energy  integral  h  are  zero.  The  law  of  conserva¬ 
tion  of  energy  gives,  then: 

-v2-^=0,  (6.40) 

Z  K 

where  R  is  the  initial  distance  at  which  the  object 
is  moving  with  velocity  v.  From  this  we  can  solve 
the  escape  velocity : 


2G(m\  +  m2) 


R 


(6.41) 


For  example  on  the  surface  of  the  Earth,  ne  is 
about  1 1  km/s  (if  m2  <^m@). 
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Fig.  6.1 2  A  projectile  is 
shot  horizontally  from 
a  mountain  on  an 
atmosphereless  planet.  If 
the  initial  velocity  is  small, 
the  orbit  is  an  ellipse  whose 
pericentre  is  inside  the 
planet,  and  the  projectile 
will  hit  the  surface  of  the 
planet.  When  the  velocity 
is  increased,  the  pericentre 
moves  outside  the  planet. 
When  the  initial  velocity 
is  vc,  the  orbit  is  circular.  If 
the  velocity  is  increased 
further,  the  eccentricity  of 
the  orbit  grows  again  and 
the  pericentre  is  at  the 
height  of  the  cannon.  The 
apocentre  moves  further 
away  until  the  orbit 
becomes  parabolic  when 
the  initial  velocity  is  ve . 
With  even  higher 
velocities,  the  orbit 
becomes  hyperbolic 


\ 


The  escape  velocity  can  also  be  expressed  us¬ 
ing  the  orbital  velocity  of  a  circular  orbit.  The  or¬ 
bital  period  P  as  a  function  of  the  radius  R  of  the 
orbit  and  the  orbital  velocity  vc  is 

2n  R 
P  = - . 

Vc 

Substitution  into  Kepler’s  third  law  yields 

4 jt2R2  _  4 n2R? 

G(m\  +  m2) 

From  this  we  can  solve  the  velocity  vc  in  a  circu¬ 
lar  orbit  of  radius  R : 


vc  = 


G(m\  +  m2) 


R 


(6.42) 


Comparing  this  with  the  expression  (6.41)  of  the 
escape  velocity,  we  see  that 


Vc 


V2vc. 


6.1 0  Virial  Theorem 

If  a  system  consists  of  more  than  two  objects,  the 
equations  of  motion  cannot  in  general  be  solved 
analytically  (Fig.  6.12).  Given  some  initial  val¬ 
ues,  the  orbits  can,  of  course,  be  found  by  numer¬ 
ical  integration,  but  this  does  not  tell  us  anything 
about  the  general  properties  of  all  possible  orbits. 
The  only  integration  constants  available  for  an  ar¬ 
bitrary  system  are  the  total  momentum,  angular 
momentum  and  energy.  In  addition  to  these,  it  is 
possible  to  derive  certain  statistical  results,  like 
the  virial  theorem.  It  concerns  time  averages  only, 
but  does  not  say  anything  about  the  actual  state  of 
the  system  at  some  specified  moment. 

Suppose  we  have  a  system  of  n  point  masses 
mi  with  radius  vectors  #r  and  velocities  /*;.  We 
define  a  quantity  A  (the  “virial”  of  the  system)  as 
follows: 


n 

A  =  'Y^miri  ■ r, . 

i  =  1 


(6.43) 


(6.44) 
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The  time  derivative  of  this  is 


n 

A  =  7>,r,-  •  r,  +  mifi  ■  r,)- 

i  =  1 


(6.45) 


The  first  term  equals  twice  the  kinetic  energy 
of  the  i  th  particle,  and  the  second  term  contains 
a  factor  m/r/  which,  according  to  Newton’s  laws, 
equals  the  force  applied  to  the  i  th  particle.  Thus 
we  have 


n 


A  =  2T  +  YJFi  -n. 


(6.46) 


i  =  1 


where  T  is  the  total  kinetic  energy  of  the  system. 
If  (x)  denotes  the  time  average  of  v  in  the  time 
interval  [0,  r],  we  have 

(A)  =  I  jT  A  dt  =  (2 T)  +  4 

(6.47) 

If  the  system  remains  bounded,  i.e.  none  of  the 
particles  escapes,  all  rz-  ’s  as  well  as  all  velocities 
will  remain  bounded.  In  such  a  case,  A  does  not 
grow  without  limit,  and  the  integral  of  the  pre¬ 
vious  equation  remains  finite.  When  the  time  in- 
terval  becomes  longer  (r  — >  oo),  (A)  approaches 
zero,  and  we  get 


n 


<2 t)  +  (  J2Fi-rn= o. 


(6.48) 


i  =  1 


This  is  the  general  form  of  the  virial  theorem.  If 
the  forces  are  due  to  mutual  gravitation  only,  they 
have  the  expressions 


n 


Fi  =  - Gm i  Y' 

./  =  I  - ./  4 ' 


ri  ~  r  j 

mi — — 
r. 
ij 


(6.49) 


where  r\j  =  |  r  /  —  r  j  \ .  The  latter  term  in  the  virial 
theorem  is  now 


n  n 

=  ~gY  J2  mimJ 

i  =  1  j  =  i  +  1 


where  the  latter  form  is  obtained  by  rearranging 
the  double  sum,  combining  the  terms 


ri  ~  r  j 

miirij - r - rt 

r  ■ 
ij 


and 


mjmi 


ntinij 


Since  (r/  —  r  /)  •  (r*  —  r  /)  =  r2.  the  sum  reduces 

J  J  IJ 

to 


n 


n 


-gE  e 


miirij 


i  —  1  j  =  i  +  1 


where  1/  is  the  potential  energy  of  the  system. 
Thus,  the  virial  theorem  becomes  simply 


{T)  =  ~{U).  (6.50) 


6.1 1  The  Jeans  Limit 

We  shall  later  study  the  birth  of  stars  and  galax¬ 
ies.  The  initial  stage  is,  roughly  speaking,  a  gas 
cloud  that  begins  to  collapse  due  to  its  own  gravi¬ 
tation.  If  the  mass  of  the  cloud  is  high  enough,  its 
potential  energy  exceeds  the  kinetic  energy  and 
the  cloud  collapses.  From  the  virial  theorem  we 
can  deduce  that  the  potential  energy  must  be  at 
least  twice  the  kinetic  energy.  This  provides  a  cri¬ 
terion  for  the  critical  mass  necessary  for  the  cloud 
of  collapse.  This  criterion  was  first  suggested  by 
Sir  James  Jeans  in  1902. 

The  critical  mass  will  obviously  depend  on  the 
pressure  P  and  density  p .  Since  gravitation  is  the 
compressing  force,  the  gravitational  constant  G 
will  probably  also  enter  our  expression.  Thus  the 
critical  mass  is  of  the  form 


n 


HF 

i  =  1 


i  •  rt 


n  n 


=  -G  E  E  mimj  A1  ■  ri 


i  =  1  j  = 


rl 

‘J 


M  =  CPaGbpc,  (6.51) 

where  C  is  a  dimensionless  constant,  and  the  con¬ 
stants  a ,  b  and  c  are  determined  so  that  the  right- 
hand  side  has  the  dimension  of  mass.  The  dimen¬ 
sion  of  pressure  is  kgm-1s-2,  of  gravitational 
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constant  kg  1  m3  s  2  and  of  density  kg  m  3 . 
Thus  the  dimension  of  the  right-hand  side  is 

k g(a-&+c)  m(-a+3b-3c)  ^(-2a-2b) 

Since  this  must  be  kilograms  ultimately,  we  get 
the  following  set  of  equations: 


a  —  b  +  c  =  1 ,  —a  +  3b  —  3c  =  0, 
—  2a  —  2b  =  0. 


The  solution  of  this  is  a  =  3/2,  b  =  —  3/2  and 
c  =  —  2.  Hence  the  critical  mass  is 


Mj  =  C 


p  3/2 

G^p2' 


(6.52) 


This  is  called  the  Jeans  mass.  In  order  to  deter¬ 
mine  the  constant  C,  we  naturally  must  calcu¬ 
late  both  kinetic  and  potential  energy.  Another 
method  based  on  the  propagation  of  waves  de¬ 
termines  the  diameter  of  the  cloud,  the  Jeans 
length  Aj,  by  requiring  that  a  disturbance  of  size 
Aj  grow  unbounded.  The  value  of  the  constant  C 
depends  on  the  exact  form  of  the  perturbation,  but 
its  typical  values  are  in  the  range  [l/n,  2tv].  We 
can  take  C  =  1  as  well,  in  which  case  (6.52)  gives 
a  correct  order  of  magnitude  for  the  critical  mass. 
If  the  mass  of  a  cloud  is  much  higher  than  Mj,  it 
will  collapse  by  its  own  gravitation. 

In  (6.52)  the  pressure  can  be  replaced  by  the 
kinetic  temperature  7k  of  the  gas  (see  Sect.  5.8  for 
a  definition).  According  to  the  kinetic  gas  theory, 
the  pressure  is 


P=nkTk ,  (6.53) 

where  n  is  the  number  density  (particles  per  unit 
volume)  and  k  is  Boltzmann’s  constant.  The  num¬ 
ber  density  is  obtained  by  dividing  the  density  of 
the  gas  p  by  the  average  molecular  weight  \i\ 

n  =  pfri, 


Box  6.1  (Newton’s  Laws) 

1.  In  the  absence  of  external  forces,  a  particle 
will  remain  at  rest  or  move  along  a  straight 
line  with  constant  speed. 

2.  The  rate  of  change  of  the  momentum  of 
a  particle  is  equal  to  the  applied  force  F  : 

d 

p  =  — (mv)  =  F. 
d  t 

3.  If  particle  A  exerts  a  force  F  on  another  par¬ 
ticle  B ,  B  will  exert  an  equal  but  opposite 
force  —F  on  A. 

If  several  forces  F\,  F 2, ...  are  applied  on 
a  particle,  the  effect  is  equal  to  that  caused  by 
one  force  F  which  is  the  vector  sum  of  the  in¬ 
dividual  forces  ( F  =  F\  +  F2  -\ - ). 

Law  of  gravitation:  If  the  masses  of  particles 
A  and  B  are  m  a  and  m  b  and  their  mutual  dis¬ 
tance  r,  the  force  exerted  on  A  by  B  is  directed 
towards  B  and  has  the  magnitude  Gm a  ms /r2, 
where  G  is  a  constant  depending  on  the  units 
chosen. 

Newton  denoted  the  derivative  of  a  func- 
tion  f  by  f  and  the  integral  function  by  f .  The 
corresponding  notations  used  by  Leibniz  were 
df/dt  and  f  fdx.  Of  Newton’s  notations,  only 
the  dot  is  still  used,  always  signifying  the  time 
derivative:  /  =  d  f/dt.  For  example,  the  veloc¬ 
ity  r  is  the  time  derivative  of  r,  the  acceleration 
r  its  second  derivative,  etc. 


6.12  Examples 

Example  6.1  Find  the  orbital  elements  of  Jupiter 
on  August  23,  1996. 

The  Julian  date  is  2,450,319,  hence  from 
(6.17),  T  =  —0.0336.  By  substituting  this  into 
the  expressions  of  Table  C.12,  we  get 


whence 

P  =  pkTk/gL. 

By  substituting  this  into  (6.52)  we  get 

kTk\3/2  1 
V-Gj 


(6.54) 


a  =  5.2033, 
e  =  0.0484, 
i  =  1.3053°, 

Q  =  100.5448°, 
xu  =  14.7460°, 


Mj  =  C 
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L  =  -67.460°  =  292.540°. 

From  these  we  can  compute  the  argument  of  per¬ 
ihelion  and  mean  anomaly: 

CD  =  zu  -  Q  =  -85.7988°  =  274.201°, 

M  =  L  —  zu  =  -82.2060°  =  277.794°. 

Example  6.2  (Orbital  Velocity)  (a)  Comet  Aus¬ 
tin  (1982g)  moves  in  a  parabolic  orbit.  Find  its 
velocity  on  October  8,  1982,  when  the  distance 
from  the  Sun  was  1.10  au. 

The  energy  integral  for  a  parabola  is  h  =  0. 
Thus  (6.11)  gives  the  velocity  v : 

2  /x  I2GMq 
r  V  r 

2  x  47T2  x  1 

- =  8.47722  au/a 

1.10 

8.47722  x  1.496  x  1011  m 

- &  40  km/s. 

365.2564  x  24  x  3600  s 


The  period  is  obtained  from  Kepler’s  third 
law: 

2  47 r2a2 

P2  = - 

G(m  i  +  m2) 

Atx2\  2 

“  6.67  x  10— 1 1  (5  +  5)  S 

=  5.9  x  1010  s2, 

whence 

P  =  243,000  s  =  2.8  d. 

Example  6.4  The  period  of  the  Martian  moon 
Phobos  is  0.3189  d  and  the  radius  of  the  orbit 
9370  km.  What  is  the  mass  of  Mars? 

First  we  change  to  more  appropriate  units: 

P  =  0.3189  d  =  0.0008731  sidereal  years, 
a  —  9370  km  =  6.2634  x  10-5  au. 

Equation  (6.32)  gives  (it  is  safe  to  assume  that 

^Phobos  777 Mars) 


(b)  The  semimajor  axis  of  the  minor  planet 
1982  RA  is  1.568  au  and  the  distance  from  the 
Sun  on  October  8,  1982,  was  1.17  au.  Find  its 
velocity. 

The  energy  integral  (6.16)  is  now 

h  =  —/i/2a. 

Hence 

1  9  /X  /X 

2  r  2a ’ 

which  gives 


OTMars  =  a3  /  P2  =  0.000000322  Mq 

(ss  0.107  Mq). 

Example  6.5  Derive  formulas  for  a  planet’s  he¬ 
liocentric  longitude  and  latitude,  given  its  orbital 
elements  and  true  anomaly. 

We  apply  the  sine  formula  to  the  spherical  tri¬ 
angle  of  the  figure: 

sin  sin(<x>  +  /) 
sin  i  sin(7r/2) 

or 

sin  P  =  sin  i  sin(&>  +  /) . 


=  6.5044  au/a  ^31  km/s. 


The  sine-cosine  formula  gives 
cos(7r/2)  sin/3 

=  —  cos  i  sin(&>  +  /)  cos(A  —  £2) 
+  cos(&>  +  /)  sin(k  —  *$2), 


Example  6.3  In  an  otherwise  empty  universe,  whence 
two  rocks  of  5  kg  each  orbit  each  other  at  a  dis¬ 
tance  of  1  m.  What  is  the  orbital  period? 


tan(k  —  Q)  =  cos  i  tan(<x>  +  /). 
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Example  6.6  Find  the  radius  vector  and  helio¬ 
centric  longitude  and  latitude  of  Jupiter  on  Au¬ 
gust  23,  1996. 

The  orbital  elements  were  computed  in  Exam¬ 
ple  6.1: 

a  =  5.2033  au, 
e  =  0.0484, 
i  =  1.3053°, 

Q  =  100.5448°, 
go  =  274.2012°, 

M  =  277.7940°  =  4.8484  rad. 

Since  the  mean  anomaly  was  obtained  directly, 
we  need  not  compute  the  time  elapsed  since  peri¬ 
helion. 

Now  we  have  to  solve  Kepler’s  equation.  It 
cannot  be  solved  analytically,  and  we  are  obliged 
to  take  the  brute  force  approach  (also  called  nu¬ 
merical  analysis)  in  the  form  of  iteration.  For  it¬ 
eration,  we  write  the  equation  as 

En+ 1  =  M  +  e  sin  En, 

where  En  is  the  value  found  in  the  nth  iteration. 
The  mean  anomaly  is  a  reasonable  initial  guess, 
Eq.  (N.B.:  Here,  all  angles  must  be  in  radians; 
otherwise,  nonsense  results!)  The  iteration  pro¬ 
ceeds  as  follows: 

Eq  =  M  =  4.8484, 

E\  =  M  +  e  sin  Eq  =  4.8004, 

E2  =  M  +  e  sin  E\  =  4.8002, 


E3  =  M  +  e  sin  E2  =  4.8002, 

after  which  successive  approximations  no  longer 
change,  which  means  that  the  solution,  accurate 
to  four  decimal  places,  is 

E  =  4.8002  =  275.0°. 

The  radius  vector  is 

r  =  ci  {cos  E  —  e)i  +  a\J  1  —  e1  sin  E  j 
=  0.2045  i  —  5.1772  j 

and  the  distance  from  the  Sun, 

r  =  a(  1  —  e cos  E)  =  5.1813  au. 

The  signs  of  the  components  of  the  radius  vector 
show  that  the  planet  is  in  the  fourth  quadrant.  The 
true  anomaly  is 

-5.1772 

/  =  arctan - =  272.3  . 

J  0.2045 

Applying  the  results  of  the  previous  example,  we 
find  the  latitude  and  longitude: 

sin  p  =  sin  i  sin(o;  +  /) 

=  sin  1.3°  sin(274.2°  +  272.3°) 

=  -0.0026 
=>  P  =  —0.15°, 
tan(A  —  £2)  =  cos  i  tan (co  +  /) 

=  cos  1 .3°  tan(274.2°  +  272.3°) 

=  0.1139 
=►  A  =  +  186.5° 

=  100.5°  +  186.5° 

=  287.0°. 

(We  must  be  careful  here;  the  equation  for  tan  (A  — 
Q)  allows  two  solutions.  If  necessary,  a  figure 
can  be  drawn  to  decide  which  is  the  correct  one.) 

Example  6.7  Find  Jupiter’s  right  ascension  and 
declination  on  August  23,  1996. 


6.12  Examples 
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In  Example  6.6,  we  found  the  longitude  and 
latitude,  A  =  287.0°,  /3  =  —0.15°.  The  corre¬ 
sponding  rectangular  (heliocentric)  coordinates 
are: 


x  =  r  cos  A  cos  /3  =  1 .5 154  au, 
y  =  r  sin  A  cos  /3  =  —4.9547  au, 
z  =  r  sin/3  =  —0.0133  au. 

Jupiter’s  ecliptic  coordinates  must  be  trans¬ 
formed  to  equatorial  ones  by  rotating  them  around 
the  v-axis  by  an  angle  e,  the  obliquity  of  the 
ecliptic  (see  Box  2.1): 

Xj  =  x  =  1.5154  au, 

Yj  =  y  cos  £  —  z  sin  s  =  —4.5405  au, 

Zj  =  y  sin  s  +  z  cos  £  =  —1.9831  au. 

To  find  the  direction  relative  to  the  Earth,  we 
have  to  find  where  the  Earth  is.  In  principle,  we 
could  repeat  the  previous  procedure  with  the  or¬ 
bital  elements  of  the  Earth.  Or,  if  we  are  lazy, 
we  could  pick  up  the  nearest  Astronomical  Al¬ 
manac ,  which  lists  the  equatorial  coordinates  of 
the  Earth: 


A0  =  0.8815  au, 
y0  =  —0.4543  au, 

Z0  =  -0.1970  au. 

Then  the  position  relative  to  the  Earth  is 

X0  =  Xj  -  X0  =  0.6339  au, 
y0  =  7j  —  T0  =  -4.0862  au, 

Z()  =  Zj  —  Z0  =  —1.7861  au. 

And  finally,  the  right  ascension  and  declination 
are 

a  =  arctan(To/2fo)  =  278.82°  =  18  h  35  min, 

8  =  arctan  0  —  =  —23.4°. 

A« + n 

If  the  values  given  by  the  Astronomical  Almanac 
are  rounded  to  the  same  accuracy,  the  same  result 


is  obtained.  We  should  not  expect  a  very  precise 
position  since  we  have  neglected  all  short-period 
perturbations  in  Jupiter’s  orbital  elements. 

Example  6.8  Which  is  easier,  to  send  a  probe  to 
the  Sun  or  away  from  the  Solar  system? 

The  orbital  velocity  of  the  Earth  is  about 
30  km/s.  Thus  the  escape  velocity  from  the  So¬ 
lar  system  is  \fl  x  30  ~  42  km/s.  A  probe  that  is 
sent  from  the  Earth  already  has  a  velocity  equal 
to  the  orbital  velocity  of  the  Earth.  Hence  an  extra 
velocity  of  only  12  km/s  is  needed.  In  addition, 
the  probe  has  to  escape  from  the  Earth,  which  re¬ 
quires  11  km/s.  Thus  the  total  velocity  changes 
are  about  23  km/s. 

If  the  probe  has  to  fall  to  the  Sun  it  has  to  get 
rid  of  the  orbital  velocity  of  the  Earth  30  km/s.  In 
this  case,  too,  the  probe  has  first  to  be  lifted  from 
the  Earth.  Thus  the  total  velocity  change  needed 
is  41  km/s.  This  is  nearly  impossible  with  current 
technology.  Therefore  a  probe  to  be  sent  to  the 
Sun  is  first  directed  close  to  some  planet,  and  the 
gravitational  field  of  the  planet  is  used  to  acceler¬ 
ate  the  probe  towards  its  final  destination. 


Example  6.9  An  interstellar  hydrogen  cloud 
contains  10  atoms  per  cm3.  How  big  must  the 
cloud  be  to  collapse  due  to  its  own  gravitation? 
The  temperature  of  the  cloud  is  100  K. 

The  mass  of  one  hydrogen  atom  is  1.67  x 
10~27  kg,  which  gives  a  density 

p  =n/ji=  107  m-3  x  1.67  x  10-27  kg 

=  1.67  x  10-20  kg/m3. 

The  critical  mass  is 


Mj 


1.38  x  10-23  J/Kx  100  K 


3/2 


1.67  x  10-27  kg  x  6.67  x  lO"11  Nm2kg“2 

1 


x 


a/1.67  x  10  20  kg/m3 
1  x  1034  kg  ~  5000  Mq. 


The  radius  of  the  cloud  is 


,  /  3  M  17 

R  =  H - «  5  x  1017  m  20  pc. 

471  p 
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6.13  Exercises 

Exercise  6.1  Find  the  ratio  of  the  orbital  veloci¬ 
ties  at  aphelion  and  perihelion  ua/up.  What  is  this 
ratio  for  the  Earth? 

Exercise  6.2  The  perihelion  and  aphelion  of  the 
orbit  of  Eros  are  1.1084  and  1.8078  astronomical 
units  from  the  Sun.  What  is  the  velocity  of  Eros 
when  its  distance  from  the  Sun  equals  the  mean 
distance  of  Mars? 

Exercise  6.3  Find  the  radius  of  the  orbit  of 
a  geostationary  satellite;  such  a  satellite  remains 
always  over  the  same  point  of  the  equator  of  the 
Earth.  Are  there  areas  on  the  surface  of  the  Earth 
that  cannot  be  seen  from  any  geostationary  satel¬ 
lite?  If  so,  what  fraction  of  the  total  surface  area? 

Exercise  6.4  From  the  angular  diameter  of  the 
Sun  and  the  length  of  the  year,  derive  the  mean 
density  of  the  Sun. 


Exercise  6.5  Find  the  mean,  eccentric  and  true 
anomalies  of  the  Earth  one  quarter  of  a  year  after 
the  perihelion. 

Exercise  6.6  The  velocity  of  a  comet  is  5  m/s, 
when  it  is  very  far  from  the  Sun.  If  it  moved  along 
a  straight  line,  it  would  pass  the  Sun  at  a  distance 
of  1  au.  Find  the  eccentricity,  semimajor  axis  and 
perihelion  distance  of  the  orbit.  What  will  happen 
to  the  comet? 

Exercise  6.7  (a)  Find  the  ecliptic  geocentric  ra¬ 
dius  vector  of  the  Sun  on  May  1,  1997  (/  = 
2,450,570). 

(b)  What  are  the  declination  and  right  ascen¬ 
sion  of  the  Sun  then? 


The  Solar  System 


The  solar  system  consists  of  a  central  star,  called 
the  Sun ,  planets  orbiting  the  Sun  and  several 
smaller  objects.  By  the  solar  system  we  mean 
here  and  in  the  next  chapter  the  system  around 
our  own  Sun.  Also  other  stars  have  similar  sys¬ 
tems.  They  will  be  discussed  later  in  Chap.  22. 
This  chapter  deals  with  general  properties  of  the 
solar  system.  Individual  objects  will  be  discussed 
in  the  next  chapter. 

Research  of  the  solar  system  has  evolved 
rapidly  since  the  1960’s  when  space  probes  have 
made  it  possible  to  study  planets  from  short  dis¬ 
tances.  Many  methods  used  in  geosciences  are 
nowadays  applied  also  in  planetary  studies.  Lan¬ 
ders  have  been  sent  to  the  Moon,  Venus,  Mars, 
and  several  smaller  bodies. 

The  most  convenient  way  to  describe  distances 
in  the  solar  system  is  to  use  astronomical  units 
(au),  the  mean  distance  of  the  Sun  and  Earth.  One 
auis  =  1.495  97870  x  1011  m  (see  Sect.  6.5).  The 
distance  to  the  nearest  star,  Proxima  Centauri  is 
over  270,000  au. 

Actually  one  au  is  the  semimajor  axis  of  a 
massless  planet  whose  orbital  period  equals  that 
of  the  Earth.  Since  also  the  mass  of  the  Earth  af¬ 
fects  its  motion,  the  semimajor  axis  of  the  Earth 
is  slightly  larger  than  one  au  (6.32).  The  small 
difference  is,  however,  important  only  in  precise 
calculations. 


7.1  Classification  of  Objects 

In  addition  to  the  Sun  and  Moon  five  objects  mov¬ 
ing  relative  to  the  stars  were  already  known  in 


the  antiquity:  Mercury,  Venus,  Mars,  Jupiter  and 
Saturn.  They  were  called  planets  from  the  Greek 
word  meaning  a  wanderer.  At  that  time  also  the 
Sun  and  Moon  were  considered  planets,  and  the 
names  of  these  seven  objects  are  still  reflected  in 
the  names  of  the  days  of  the  week. 

After  the  invention  of  the  telescope  three  more 
planets  were  found:  Uranus,  Neptune  and  Pluto. 
When  observational  instruments  and  methods 
have  evolved,  more  objects  orbiting  behind  Nep¬ 
tune  and  of  the  same  size  as  Pluto  have  been 
found.  Since  there  was  no  unambiguous  defini¬ 
tion  of  a  planet,  some  of  these  new  objects  should 
have  been  called  planets.  Therefore  the  Interna¬ 
tional  Astronomical  Union  (IAU)  in  its  General 
Assembly  in  2006  defined  three  distinct  cate¬ 
gories  to  clarify  the  situation. 

According  to  the  new  definition,  an  object  is 
a  planet  if  it  satisfies  the  following  three  condi¬ 
tions: 

(1)  It  orbits  the  Sun. 

(2)  It  has  sufficient  mass  for  its  self-gravity 
to  overcome  rigid  body  forces  so  that  it 
assumes  a  hydrostatic  equilibrium  (nearly 
round)  shape. 

(3)  Its  perturbations  have  cleared  away  other  ob¬ 
jects  in  the  neighbourhood  of  its  orbit. 

If  a  body  satisfies  the  conditions  (1)  and  (2)  but 
not  (3),  it  is  a  dwarf  planet.  Since  Pluto  does  not 
fulfil  the  last  requirement  its  status  was  reduced 
to  a  dwarf  planet. 

All  other  objects  orbiting  the  Sun  shall  be  re¬ 
ferred  to  collectively  as  Small  Solar  System  Bod¬ 
ies.  These  include  most  of  the  asteroids,  Trans- 
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Neptunian  Objects,  comets,  and  other  small  bod¬ 
ies.  If  the  central  body  is  not  the  Sun  the  object  is 
a  moon  or  a  satellite  independently  of  its  proper¬ 
ties. 

A  satellite  is  a  body  which  orbits  the  primary 
body  so  that  the  centre  of  mass  (barycentre)  is 
inside  the  primary.  If  this  is  not  the  case,  then  the 
system  is  called  a  binary  system.  For  example,  in 
the  case  of  the  Earth  and  Moon  the  barycentre  of 
the  system  is  inside  the  Earth,  and  the  Moon  is 
Earth’s  satellite.  In  the  Pluto-Charon  system  the 
centre  of  mass  is  outside  Pluto,  and  therefore  they 
are  called  a  binary  system. 

The  rules  (2)  and  (3)  are  somewhat  problem¬ 
atic.  How  can  the  shape  of  a  very  distant  object 
be  established?  How  wide  an  area  a  planet  should 
clean  to  be  a  planet,  and  how  clean  that  region 
should  be? 

Also  the  definition  of  a  satellite  is  slightly 
problematic,  since  the  largest  planets  are  sur¬ 
rounded  by  ring  systems  consisting  of  small  par¬ 
ticles.  What  is  the  distinction  between  the  largest 
ring  particles  and  smallest  satellites?  In  practice, 
an  object  could  be  considered  a  moon  if  it  has 
been  observed  as  a  separate  body  whose  orbital 
elements  have  been  determined.  In  any  case  the 
question  about  the  exact  number  of  satellites  is 
no  more  meaningful,  since  more  and  more  will  be 
found  when  observations  become  more  accurate. 
More  interesting  questions  are  e.g.  the  statistical 
properties  of  the  moons. 

Other  unresolved  questions  concern  the  classi¬ 
fication  of  the  minor  bodies  and  exoplanets.  Pos¬ 
sibly  the  definitions  need  some  fine-tuning  in  the 
future. 

According  the  current  definition  there  are  now 
eight  planets  in  the  solar  system:  Mercury ,  Venus , 
Earth ,  Mars ,  Jupiter ,  Saturn ,  Uranus ,  and  Nep¬ 
tune.  It  looks  quite  improbable  that  more  planets 
will  be  found. 

Currently  there  are  five  known  dwarf  planets: 
Pluto  that  was  earlier  a  planet,  the  first  asteroid 
Ceres  and  Haumea ,  Makemake  and  Eris  found 
in  2004-2005  orbiting  beyond  Pluto.  Similar  ob¬ 
jects  may  well  be  found  in  the  future.  The  prob¬ 
lem  is  to  check  whether  these  distant  bodies  sat¬ 
isfy  the  conditions  (2)  and  (3). 

The  solar  system  contains  a  vast  number 
of  different  small  bodies.  Traditionally,  they 
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have  been  divided  into  three  categories:  aster¬ 
oids,  comets  and  meteoroids.  Their  differences 
and  properties  are  discussed  in  more  detail  in 
Sects.  8.10-8.14. 

The  planets  from  Mercury  to  Saturn  are  bright 
and  well  visible  with  a  naked  eye.  Records  of 
them  are  found  even  in  the  most  ancient  writ¬ 
ten  documents.  Uranus  and  Neptune  can  be  seen 
with  a  pair  of  binoculars.  In  addition  to  the  bright 
planets,  only  the  brightest  comets  are  visible  with 
a  naked  eye. 

Gravitation  controls  the  motion  of  the  solar 
system  bodies.  The  planetary  orbits  around  the 
Sun  (Fig.  7.1)  are  almost  coplanar  ellipses  which 
deviate  only  slightly  from  circles.  Mercury,  the 
innermost  planet,  has  the  most  eccentric  orbit. 
The  orbital  planes  of  asteroids,  minor  bodies 
that  circle  the  Sun  mainly  between  the  orbits  of 
Mars  and  Jupiter,  are  often  more  tilted  than  the 
planes  of  the  planetary  orbits.  Asteroids  and  dis¬ 
tant  Trans-Neptunian  Objects  revolve  in  the  same 
direction  as  the  major  planets;  comets,  however, 
may  move  in  the  opposite  direction.  Cometary  or¬ 
bits  can  be  very  elongated,  even  hyperbolic.  Most 
of  the  satellites  circle  their  parent  planets  in  the 
same  direction  as  the  planet  moves  around  the 
Sun.  Only  the  motions  of  the  smallest  particles, 
gas  and  dust  are  affected  by  the  solar  wind,  radi¬ 
ation  pressure  and  magnetic  fields . 

The  planets  can  be  divided  into  inferior  and 
superior  planets  depending  on  their  orbits.  Mer¬ 
cury  and  Venus  are  inferior  planets  as  seen  from 
the  Earth,  and  planets  from  Mars  to  Neptune  are 
superior  planets. 

Depending  on  their  physical  properties  the 
planets  can  be  divided  into  different  groups 
(Fig.  7.2).  Mercury,  Venus,  Earth,  and  Mars  are 
called  terrestrial  (Earth-like)  planets;  they  have 
a  solid  surface,  are  of  almost  equal  size  (diam¬ 
eters  from  5000  to  12,000  km),  and  have  quite 
a  high  mean  density  (4000-5000  kgm-3;  the 
density  of  water  is  1000  kgm-3).  The  plan¬ 
ets  from  Jupiter  to  Neptune  are  called  Jovian 
(Jupiter-like)  or  giant  planets.  The  densities  of 
the  giant  planets  are  about  1000-2000  kgm-3, 
and  most  of  their  volume  is  liquid.  Diameters  are 
several  times  greater  than  those  of  the  terrestrial 
planets. 
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Fig.  7.1  (a)  Planetary  orbits  from  Mercury  to  Mars.  The 
dashed  line  represents  the  part  of  the  orbit  below  the  eclip¬ 
tic;  the  arrows  show  the  distances  travelled  by  the  planets 
during  one  month  (January  2000).  (b)  Planets  from  Jupiter 


to  Neptune  and  the  dwarf  planet  Pluto.  The  arrows  indi¬ 
cate  the  distances  travelled  by  the  planets  during  the  10 
year  interval  2000-2010 


Fig.  7.2  Major  planets  from  Mercury  to  Neptune.  Four 
innermost  planets  are  called  terrestrial  planets  and  four 
outermost  ones  are  giant  planets.  Three  dwarf  planets  are 


7.2  Planetary  Configurations 

To  the  naked  eye  planets  look  starlike  dots.  How¬ 
ever,  their  slow  motion  with  respect  to  the  stars 
reveals  that  they  are  bodies  of  our  solar  system. 


also  shown.  Relative  size  of  the  Sun  is  shown  at  left.  Plan¬ 
etary  distances  to  the  Sun  are  not  in  scale.  (The  Interna¬ 
tional  Astronomical  Union/Martin  Kornmesser) 


The  apparent  motions  of  the  planets  look  quite 
complicated,  partly  because  they  reflect  the  mo¬ 
tion  of  the  Earth  around  the  Sun  (Figs.  7.3  and 
7.4). 
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Fig.  7.3  (a)  Apparent  motion  of  Mars  during  the  2016 
opposition.  Usually  Mars  moves  in  the  prograde  direction 
(counterclockwise  with  respect  to  the  stars),  but  over  a 
month  before  and  after  the  opposition  the  motion  is  retro- 

Normally  the  planets  move  eastward  (< direct 
motion,  counterclockwise  as  seen  from  the  North¬ 
ern  hemisphere)  when  compared  with  the  stars. 
When  the  Earth  passes  a  superior  planet,  the  mo¬ 
tion  of  the  planet  reverses  to  the  opposite  or  ret¬ 
rograde  direction.  After  a  few  weeks  of  retro¬ 
grade  motion,  the  direction  is  changed  again,  and 
the  planet  continues  in  the  original  direction.  It  is 
quite  understandable  that  the  ancient  astronomers 
had  great  difficulties  in  explaining  and  modelling 
such  complicated  turns  and  loops.  Figure  7.5  ex¬ 
plains  some  basic  planetary  configurations. 

A  superior  planet  (planet  outside  the  orbit  of 
the  Earth)  is  said  to  be  in  opposition  when  it  is 
exactly  opposite  the  Sun,  i.e.  when  the  Earth  is 
between  the  planet  and  the  Sun.  When  the  planet 
is  behind  the  Sun,  it  is  in  conjunction .  In  prac¬ 


grade.  (b)  Relative  positions  of  the  Earth  and  Mars.  The 
projection  of  the  Earth-Mars  direction  on  the  infinitely 
distant  celestial  sphere  results  in  (a) 


tise,  the  planet  may  not  be  exactly  opposite  or 
behind  the  Sun  because  the  orbits  of  the  planet 
and  the  Earth  are  not  in  the  same  plane.  In  astro¬ 
nomical  almanacs  oppositions  and  conjunctions 
are  defined  in  terms  of  ecliptic  longitudes.  The 
longitudes  of  a  body  and  the  Sun  differ  by  1 80°  at 
the  moment  of  opposition;  in  conjunction  the  lon¬ 
gitudes  are  equal.  However,  the  right  ascension  is 
used  if  the  other  body  is  not  the  Sun.  Those  points 
where  the  apparent  motion  of  a  planet  changes  its 
direction  are  called  stationary  points.  Opposition 
occurs  in  the  middle  of  the  retrograde  loop. 

Inferior  planets  (Mercury  and  Venus)  are  never 
in  opposition.  The  configuration  when  the  plan¬ 
ets  is  between  the  Earth  and  the  Sun  is  called  in¬ 
ferior  conjunction.  The  conjunction  correspond¬ 
ing  to  that  of  a  superior  planet  is  called  an  upper 
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Fig.  7.4  Apparent  motions  of  the  Sun,  Mercury,  Venus 
and  Mars  in  1995  in  the  geocentric  frame  as  seen  from  the 
North  pole  of  the  ecliptic 


side  of  the  Sun  the  planet  is  seen.  The  planet  is 
an  “evening  star”  and  sets  after  the  Sun  when  it  is 
in  eastern  elongation;  in  western  elongation  the 
planet  is  seen  in  the  morning  sky  as  a  “morning 
star”. 

The  synodic  period  is  the  time  interval  be¬ 
tween  two  successive  events  (e.g.  oppositions). 
The  period  which  we  used  in  the  previous  chap¬ 
ters  is  the  sidereal  period ,  the  true  time  of  revolu¬ 
tion  around  the  Sun,  unique  for  each  object.  The 
synodic  period  depends  on  the  difference  of  the 
sidereal  periods  of  the  two  bodies. 

Let  the  sidereal  periods  of  two  planets  be  P\ 
and  P2  (assume  that  P\  <  P2).  Their  mean  an¬ 
gular  velocities  ( mean  motions)  are  In  /  P\  and 
In / P2.  After  one  synodic  period  P\  2,  the  inner 
planet  has  made  one  full  revolution  more  than 
the  outer  planet.  Thus  the  angles  travelled  by  the 
planets  are  related  by 
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Greatest 
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Fig.  7.5  Planetary  configurations.  The  angle  a  (Sun-ob- 
ject-Earth)  is  the  phase  angle  and  s  (Sun-Earth-object)  is 
the  elongation 


conjunction  or  superior  conjunction.  The  maxi¬ 
mum  (eastern  or  western)  elongation ,  i.e.  the  an¬ 
gular  distance  of  the  planet  from  the  Sun  is  28° 
for  Mercury  and  47°  for  Venus.  Elongations  are 
called  eastern  or  western,  depending  on  which 


2tx 

=  271  +  P\^  2  —  , 
Pi 


or 


-  •  V  /  •  JL  / 

Pi, 2  Pi  Pi 

Since  Mercury  and  Venus  move  inside  the  or¬ 
bit  of  the  Earth,  they  show  similar  phases  as  the 
Moon.  The  angle  Sun-planet-Earth  is  called  the 
phase  angle ,  often  denoted  by  the  Greek  letter  a. 
The  fraction  of  the  planetary  surface  seen  illumi¬ 
nated  depends  on  the  phase  angle.  For  Mercury 
and  Venus  the  phase  angle  can  have  any  value  be¬ 
tween  0°  and  180°.  This  means  that  we  can  see 
“full  Venus”  (when  it  is  behind  the  Sun),  “half 
Venus”,  and  so  on. 

The  phase  angle  range  of  the  superior  planets 
is  more  limited.  For  Mars  the  maximum  phase 
angle  is  41°,  for  Jupiter  11°,  and  for  Neptune 
only  2° . 


7.3  Orbit  of  the  Earth  and  Visibility 
of  the  Sun 

The  rotation  of  the  Earth  and  its  motion  around 
the  Sun  have  been  used  as  the  basis  of  time  reck¬ 
oning  since  prehistoric  times  (although  their  true 
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nature  was  not  known  then).  However,  these  mo¬ 
tions  are  not  free  from  perturbations.  Although 
our  modem  time  system  is  based  on  atomic 
clocks,  the  ordinary  time  we  use  in  our  clocks 
is  adjusted  to  the  rotation  of  the  Earth  by  leap 
seconds  (Sect.  2.14). 

The  sidereal  year  is  the  real  orbital  period 
of  the  Earth  around  the  Sun.  After  one  sidereal 
year,  the  Sun  is  seen  at  the  same  position  rela¬ 
tive  to  the  stars.  The  length  of  the  sidereal  year  is 
365.256363051  days  of  86,400  SI  seconds  at  the 
epoch  J2000.0  =  2000  January  1  12:00:00  TT. 

We  noted  earlier  that,  owing  to  precession,  the 
direction  of  the  vernal  equinox  moves  along  the 
ecliptic  at  about  50"  per  year.  This  means  that  the 
Sun  returns  to  the  vernal  equinox  before  one  com¬ 
plete  sidereal  year  has  elapsed.  This  time  interval, 
called  the  tropical  year ,  is  365.24218967  days. 

A  third  definition  of  the  year  is  based  on  the 
perihelion  passages  of  the  Earth.  Planetary  per¬ 
turbations  cause  a  gradual  change  in  the  direc¬ 
tion  of  the  Earth’s  perihelion.  The  time  inter¬ 
val  between  two  successie  perihelion  passages  is 
called  the  anomalistic  year ,  the  length  of  which 
is  365.259635864  days,  a  little  longer  than  the 
sidereal  year.  It  takes  about  21,000  years  for  the 
perihelion  to  revolve  360°  relative  to  the  vernal 
equinox. 

The  equator  of  the  Earth  is  tilted  about  23.4° 
with  respect  to  the  ecliptic.  Owing  to  perturba¬ 
tions,  this  angle  changes  with  time.  If  periodic 
terms  are  neglected,  the  obliquity  of  the  ecliptic  e 
can  be  calculated  as  in  Box  2.1.  The  obliquity 
varies  between  22.1°  and  24.5°  with  a  41,000 
year  periodicity.  At  present  the  tilt  is  decreasing. 
There  are  also  small  short  term  variations,  like  the 
nutation. 

The  declination  of  the  Sun  varies  between  —  e 
and  +s  during  the  year.  At  any  given  time,  the 
Sun  is  seen  at  zenith  from  one  point  on  the  surface 
of  the  Earth.  The  latitude  of  this  point  is  the  same 
as  the  declination  of  the  Sun.  At  the  latitudes  —  e 
(the  Tropic  of  Capricorn)  and  +£  (the  Tropic  of 
Cancer ),  the  Sun  is  seen  at  zenith  once  every 
year,  and  between  these  latitudes  twice  a  year. 
The  Sun  crosses  the  equator  at  vernal  and  autum¬ 
nal  equinox. 

In  the  Northern  hemisphere  the  Sun  will  not 
set  if  the  latitude  is  greater  than  90°  —  5,  where 
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8  is  the  declination  of  the  Sun.  The  southern¬ 
most  latitude  where  the  midnight  Sun  can  be  seen 
is  thus  90°  —  £  =  66.6°.  This  is  called  the  Arc¬ 
tic  Circle.  (The  same  holds  true  in  the  Southern 
hemisphere.)  The  Arctic  Circle  is  the  southern¬ 
most  place  where  the  Sun  is  (in  theory)  below 
the  horizon  during  the  whole  day  at  the  winter 
solstice.  The  sunless  time  lasts  longer  and  longer 
when  one  goes  north  (south  in  the  Southern  hemi¬ 
sphere).  At  the  poles,  day  and  night  last  half 
a  year  each.  In  practise,  refraction  and  location  of 
the  observing  site  will  have  a  large  influence  on 
the  visibility  of  the  midnight  Sun  and  the  number 
of  sunless  days.  Because  refraction  raises  objects 
seen  at  the  horizon,  the  midnight  Sun  can  be  seen 
a  little  further  south  than  at  the  Arctic  Circle.  For 
the  same  reason  the  Sun  can  be  seen  simultane¬ 
ously  at  both  poles  around  the  time  of  vernal  and 
autumnal  equinox.  (See  also  Sect.  2.6.) 

The  eccentricity  of  the  Earth’s  orbit  is  about 
0.0167.  The  distance  from  the  Sun  varies  between 
147-152  million  km.  The  flux  density  of  solar  ra¬ 
diation  varies  somewhat  at  different  parts  of  the 
Earth’s  orbit,  but  this  has  practically  no  effect  on 
the  seasons.  In  fact  the  Earth  is  at  perihelion  in  the 
beginning  of  January,  in  the  middle  of  the  north¬ 
ern  hemisphere’s  winter. 

The  seasons  are  due  to  the  obliquity  of  the 
ecliptic  which  affects  the  energy  received  from 
the  Sun  depends  in  three  different  ways.  First  the 
flux  per  unit  area  is  proportional  to  sin  a,  where 
a  is  the  altitude  of  the  Sun.  In  summer  the  alti¬ 
tude  can  have  greater  values  than  in  winter,  giv¬ 
ing  more  energy  per  unit  area.  Another  effect  is 
due  to  the  atmosphere:  when  the  Sun  is  near  the 
horizon,  the  radiation  must  penetrate  thick  atmo¬ 
spheric  layers.  This  means  large  extinction  and 
less  radiation  at  the  surface.  The  third  factor  is  the 
length  of  the  time  the  Sun  is  above  the  horizon. 
This  is  important  at  high  latitudes,  where  the  low 
altitude  of  the  Sun  is  compensated  by  the  long 
daylight  time  in  summer.  This  last  effect  is  ne¬ 
glected  in  many  textbooks  written  by  authors  not 
living  this  far  north.  These  effects  are  discussed 
in  detail  in  Example  7.2. 

There  are  also  long-term  variations  in  the  an¬ 
nual  Solar  flux.  Serbian  geophysicist  Milutin  Mi- 
lankovic  (1879-1958)  published  in  the  1930’s 
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and  1940’s  his  theory  of  ice  ages.  During  the  last 
2-3  million  years,  large  ice  ages  have  recurred 
approximately  every  100,000  years.  He  proposed 
that  variations  of  the  Earth’s  orbit  cause  long¬ 
term  periodic  climate  change,  now  known  as  Mi- 
lankovic  cycles.  Milankovic  claimed  that  the  cy¬ 
cles  in  eccentricity,  direction  of  the  perihelion, 
obliquity,  and  precession  result  in  100,000  year 
ice  age  cycle.  The  cycle  of  precession  is  26,000 
years,  direction  of  the  perihelion  relative  to  the 
equinoxes  is  22,000  years,  and  the  obliquity  of 
the  ecliptic  has  a  41,000  year  cycle.  Changes 
in  orbital  eccentricity  are  not  fully  periodic  but 
some  periods  above  100,000  years  can  be  found. 
The  eccentricity  varies  between  0.005-0.058  and 
is  currently  0.0167. 

The  annual  incoming  Solar  flux  varies  with 
these  orbital  changes  and  the  effect  is  largest  at 
high  latitudes.  If,  for  example,  the  eccentricity  is 
high,  and  the  Earth  is  near  the  aphelion  during  the 
hemisphere’s  winter,  then  winters  are  long  and 
cold  and  summers  are  short.  However,  the  the¬ 
ory  is  controversial,  orbital  forcing  on  the  climate 
change  is  not  well  understood,  and  probably  not 
enough  to  trigger  glaciation.  There  exist  also  pos¬ 
itive  feedback  loops,  like  the  effect  of  low  albedo 
of  snow  and  ice.  It  means  that  ice  reflects  more  ra¬ 
diation  back  into  space,  thus  cooling  the  climate. 
The  system  is  highly  chaotic  so  that  even  minor 
changes  in  the  primary  conditions  will  result  in 
large  differences  in  the  outcome.  There  are  also 
other  effects  causing  climate  change,  including 
emerging  gases  from  large  lava  flows  and  erup¬ 
tions  of  volcanos  and,  nowadays,  anthropogenic 
reasons. 

The  future  is  also  uncertain.  Some  theories 
predict  that  the  warm  period  will  continue  next 
50,000  years,  whereas  others  conclude  that  the 
climate  is  already  cooling.  Anthropogenic  rea¬ 
sons,  like  ever  increasing  fraction  of  green  house 
gases,  e.g.  carbon  dioxide,  will  change  the  short¬ 
term  predictions. 


7.4  The  Orbit  of  the  Moon 

The  Earth’s  satellite,  the  Moon ,  circles  the  Earth 
counterclockwise.  One  revolution,  the  sidereal 


month ,  takes  about  27.322  days.  In  practise, 
a  more  important  period  is  the  synodic  month , 
the  duration  of  the  Lunar  phases  (e.g.  from  full 
moon  to  full  moon).  In  the  course  of  one  sidereal 
month  the  Earth  has  travelled  almost  1/12  of  its 
orbit  around  the  Sun.  The  Moon  still  has  about 
1/12  of  its  orbit  to  go  before  the  Earth-Moon- 
Sun  configuration  is  again  the  same.  This  takes 
about  2  days,  so  the  phases  of  the  Moon  are  re¬ 
peated  every  29  days.  More  exactly,  the  length  of 
the  synodic  month  is  29.531  days. 

The  new  moon  is  that  instant  when  the  Moon 
is  in  conjunction  with  the  Sun.  Almanacs  define 
the  phases  of  the  Moon  in  terms  of  ecliptic  lon¬ 
gitudes;  the  longitudes  of  the  new  moon  and  the 
Sun  are  equal.  Usually  the  new  moon  is  slightly 
north  or  south  of  the  Sun  because  the  lunar  orbit 
is  tilted  5°  with  respect  to  the  ecliptic. 

About  2  days  after  the  new  moon,  the  wax¬ 
ing  crescent  moon  can  be  seen  in  the  western 
evening  sky.  About  1  week  after  the  new  moon, 
the  first  quarter  follows,  when  the  longitudes  of 
the  Moon  and  the  Sun  differ  by  90° .  The  right  half 
of  the  Moon  is  seen  lit  (left  half  when  seen  from 
the  Southern  hemisphere).  Th t  full  moon  appears 
a  fortnight  after  the  new  moon,  and  1  week  af¬ 
ter  this  the  last  quarter.  Finally  the  waning  cres¬ 
cent  moon  disappears  in  the  glory  of  the  morning 
sky. 

The  orbit  of  the  Moon  is  approximately  ellip¬ 
tic.  The  length  of  the  semimajor  axis  is 
384,400  km  and  the  eccentricity  0.055.  Owing 
to  perturbations  caused  mainly  by  the  Sun,  the 
orbital  elements  vary  with  time.  The  minimum 
distance  of  the  Moon  from  the  centre  of  the 
Earth  is  356,400  km,  and  the  maximum  distance 
406,700  km.  This  range  is  larger  than  the  one  cal¬ 
culated  from  the  semimajor  axis  and  the  eccen¬ 
tricity.  The  apparent  angular  diameter  is  in  the 
range  29.4/-33.5/. 

The  rotation  time  of  the  Moon  is  equal  to  the 
sidereal  month,  so  the  same  side  of  the  Moon  al¬ 
ways  faces  the  Earth.  Such  synchronous  rotation 
is  common  among  the  satellites  of  the  solar  sys¬ 
tem:  almost  all  large  moons  rotate  synchronously. 

The  orbital  speed  of  the  Moon  varies  accord¬ 
ing  to  Kepler’s  second  law.  The  rotation  period, 
however,  remains  constant.  This  means  that,  at 


148 


7  The  Solar  System 


Fig.  7.6  Librations  of  the  Moon  can  be  seen  in  this  pair  of  photographs  taken  when  the  Moon  was  close  to  the  perigee 
and  the  apogee,  respectively.  (Helsinki  University  Observatory) 


different  phases  of  the  lunar  orbit,  we  can  see 
slightly  different  parts  of  the  surface.  When  the 
Moon  is  close  to  its  perigee,  its  speed  is  greater 
than  average  (and  thus  greater  than  the  mean  ro¬ 
tation  rate),  and  we  can  see  more  of  the  right-hand 
edge  of  the  Moon’s  limb  (as  seen  from  the  North¬ 
ern  hemisphere).  Correspondingly,  at  the  apogee 
we  see  “behind”  the  left  edge.  Owing  to  this  li- 
bration ,  a  total  of  59  %  of  the  surface  area  can 
be  seen  from  the  Earth  (Fig.  7.6).  The  libration  is 
quite  easy  to  see  if  one  follows  some  detail  at  the 
edge  of  the  lunar  limb. 

There  are  also  two  other  factors  causing  libra¬ 
tion.  When  the  Moon  is  rising  from  the  East,  we 
can  see  a  little  behind  the  right  edge,  and  when 
the  Moon  is  setting,  behind  the  left  edge.  The 
third  effect  is  the  latitudinal  libration.  Since  the 
orbit  of  the  Moon  is  not  quite  in  the  equatorial 
plane,  we  can  see  behind  the  northern  or  southern 
pole  depending  on  the  position  of  the  Moon. 

The  orbital  plane  of  the  Moon  is  tilted  only 
about  5°  to  the  ecliptic.  Therefore  the  Moon  is 
always  close  to  the  ecliptic  just  like  the  Sun  and 


the  planets.  However,  the  orbital  plane  changes 
gradually  with  time,  owing  mainly  to  the  pertur¬ 
bations  caused  by  the  Earth  and  the  Sun.  These 
perturbations  cause  the  nodal  line  (the  intersec¬ 
tion  of  the  plane  of  the  ecliptic  and  the  orbital 
plane  of  the  Moon)  to  make  one  full  revolu¬ 
tion  in  18.6  years.  We  have  already  encountered 
the  same  period  in  the  nutation.  When  the  as¬ 
cending  node  of  the  lunar  orbit  is  close  to  the 
vernal  equinox,  the  Moon  can  be  23.4°  +5°  = 
28.4°  north  or  south  of  the  equator.  When  the  de¬ 
scending  node  is  close  to  the  vernal  equinox,  the 
zone  where  the  Moon  can  be  found  extends  only 
23.4°  —  5°  =  18.4°  north  or  south  of  the  equator. 

The  nodical  or  draconic  month  is  the  time  in 
which  the  Moon  moves  from  one  ascending  node 
back  to  the  next  one.  Because  the  line  of  nodes  is 
rotating,  the  nodical  month  is  3  hours  shorter  than 
the  sidereal  month,  i.e.  27.212  days.  The  orbital 
ellipse  itself  also  precesses  slowly.  The  orbital 
period  from  perigee  to  perigee,  the  anomalistic 
month ,  is  5.5  h  longer  than  the  sidereal  month,  or 
about  27.555  days. 
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Gravitational  differences  caused  by  the  Moon 
and  the  Sun  on  different  parts  of  the  Earth’s  sur¬ 
face  give  rise  to  the  tides.  Gravitation  is  greatest 
at  the  sub-lunar  point  and  smallest  at  the  opposite 
side  of  the  Earth.  At  these  points,  the  surface  of 
the  seas  is  highest  (high  tide,  flood).  About  6  h 
after  flood,  the  surface  is  lowest  (low  tide,  ebb). 
The  tide  generated  by  the  Sun  is  less  than  half  of 
the  lunar  tide.  When  the  Sun  and  the  Moon  are 
in  the  same  direction  with  respect  to  the  Earth 
(new  moon)  or  opposite  each  other  (full  moon), 
the  tidal  effect  reaches  its  maximum;  this  is  called 
spring  tide. 

The  sea  level  typically  varies  1  m,  but  in  some 
narrow  straits,  the  difference  can  be  as  great  as 
15  m.  Due  to  the  irregular  shape  of  the  oceans, 
the  true  pattern  of  the  oceanic  tide  is  very  com¬ 
plicated. 

The  solid  surface  of  the  Earth  also  suffers  tidal 
effects,  but  the  amplitude  is  much  smaller,  about 
30  cm. 

Tides  generate  friction,  which  dissipates  the 
rotational  and  orbital  kinetic  energy  of  the  Earth- 
Moon  system.  This  energy  loss  induces  some 
changes  in  the  system.  First,  the  rotation  of  the 
Earth  slows  down  until  the  Earth  also  rotates  syn¬ 
chronously,  i.e.  the  same  side  of  Earth  will  always 
face  the  Moon.  Secondly,  the  semimajor  axis  of 
the  orbit  of  the  Moon  increases,  and  the  Moon 
drifts  away  about  3  cm  per  year. 


7.5  Eclipses  and  Occultations 

An  eclipse  is  an  event  in  which  a  body  goes 
through  the  shadow  of  another  body.  The  most 
frequently  observed  eclipses  are  the  lunar  eclipses 
and  the  eclipses  of  the  large  satellites  of  Jupiter. 
An  occultation  takes  place  when  an  occulting 
body  goes  in  front  of  another  object;  typical 
examples  are  stellar  occultations  caused  by  the 
Moon.  Generally,  occultations  can  be  seen  only 
in  a  narrow  strip;  an  eclipse  is  visible  wherever 
the  body  is  above  the  horizon. 

Solar  and  lunar  eclipses  are  the  most  spec¬ 
tacular  events  in  the  sky.  A  solar  eclipse  occurs 
when  the  Moon  is  between  the  Earth  and  the  Sun 
(Fig.  7.7).  (According  to  the  definition,  a  solar 
eclipse  is  not  an  eclipse  but  an  occultation!)  If  the 
whole  disk  of  the  Sun  is  behind  the  Moon,  the 
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Fig.  7.7  (a)  A  total  solar  eclipse  can  be  seen  only  inside 
a  narrow  strip;  outside  the  zone  of  totality  the  eclipse  is 
partial,  (b)  An  eclipse  is  annular  if  the  Moon  is  at  apogee 
from  where  the  shadow  of  the  Moon  does  not  reach  the 
Earth,  (c)  A  lunar  eclipse  is  visible  everywhere  where  the 
Moon  is  above  the  horizon 


Fig.  7.8  The  total  eclipse  of  the  Sun  occurred  in  1990 
over  Finland.  (Photo  Math  Martikainen) 


eclipse  is  total  (Fig.  7.8);  otherwise,  it  is  partial. 
If  the  Moon  is  close  to  its  apogee,  the  apparent 
diameter  of  the  Moon  is  smaller  than  that  of  the 
Sun,  and  the  eclipse  is  annular. 

If  the  orbital  plane  of  the  Moon  coincided  with 
the  plane  of  the  ecliptic,  one  solar  and  one  lunar 
eclipse  would  occur  every  synodic  month.  How¬ 
ever,  the  plane  is  tilted  about  5°;  therefore,  at  full 
moon,  the  Moon  must  be  close  to  the  nodes  for 
an  eclipse  to  occur.  The  angular  distance  of  the 
Moon  from  the  node  must  be  smaller  than  4.6° 
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for  a  total  lunar  eclipse,  and  10.3°  for  a  total  so¬ 
lar  eclipse. 

Two  to  seven  eclipses  occur  annually.  Usually 
eclipses  take  place  in  a  set  of  1-3  eclipses,  sepa¬ 
rated  by  an  interval  of  173  days.  In  one  set  there 
can  be  just  one  solar  eclipse  or  a  succession  of 
solar,  lunar  and  another  solar  eclipse.  In  one  year, 
eclipses  belonging  to  2  or  3  such  sets  can  take 
place. 

The  Sun  and  the  (ascending  or  descending) 
node  of  the  lunar  orbit  are  in  the  same  direction 
once  every  346.62  days.  Nineteen  such  periods 
(=  6585.78  days  =  18  years  11  days)  are  very 
close  to  the  length  of  223  synodic  months.  This 
means  that  the  Sun-Moon  configuration  and  the 
eclipses  are  repeated  in  the  same  order  after  this 
period,  called  for  historical  reasons  the  Saros  pe¬ 
riod;  it  was  already  known  to  the  ancient  Babylo¬ 
nians. 

A  solar  eclipse  is  total  if  the  whole  solar  disk 
is  hidden  behind  the  Moon.  In  a  partial  eclipse 
only  a  part  of  the  Sun  is  covered.  If  the  Moon  is 
close  to  its  apogee  its  apparent  diameter  is  smaller 
than  the  diameter  of  the  Sun,  and  the  eclipse  is 
annular. 

During  a  solar  eclipse  the  shadow  of  the  Moon 
on  Earth’s  surface  is  always  less  than  270  km 
wide.  The  shadow  moves  at  least  34  km/min;  thus 
the  maximum  duration  of  a  total  eclipse  is  l\ 
minutes.  On  both  sides  of  the  track  of  the  total 
eclipse  a  partial  eclipse  can  be  seen. 

Also  lunar  eclipses  can  be  of  different  kinds. 
A  lunar  eclipse  is  total  if  the  Moon  is  entirely  in¬ 
side  the  umbral  shadow  of  the  Earth;  otherwise 
the  eclipse  is  partial.  If  the  Moon  does  not  hit 
the  umbral  shadow  but  receives  some  sunlight, 
the  eclipse  is  difficult  to  see  with  the  unaided  eye 
because  the  lunar  magnitude  remains  almost  un¬ 
changed. 

A  lunar  eclipse  can  be  seen  on  the  whole  hemi¬ 
sphere  where  the  Moon  is  above  the  horizon.  The 
maximum  duration  of  a  lunar  eclipse  is  3.8  h,  and 
the  duration  of  the  total  phase  is  always  shorter 
than  1.7  h.  During  the  total  phase  the  Moon  is 
coloured  deep  red  because  some  red  light  is  re¬ 
fracted  through  the  Earth’s  atmosphere. 

The  Moon  moves  eastwards,  and  stars  are  oc¬ 
culted  by  the  dark  edge  of  the  Moon  during 
the  first  quarter.  Therefore  occultation  is  easier 


Fig.  7.9  Transit  of  Venus  was  seen  in  June  6,  2004.  Venus 
is  seen  as  a  small  dark  disc  in  front  of  the  Sun.  (Photo  H. 
Karttunen) 

to  observe,  and  photometric  measurements  are 
possible  Observations  of  the  stellar  occultations 
caused  by  the  Moon  formerly  served  as  an  accu¬ 
rate  method  for  determining  the  lunar  orbit.  Be¬ 
cause  the  Moon  has  no  atmosphere,  a  star  disap¬ 
pears  abruptly  in  less  than  1/50  s.  If  a  fast  pho¬ 
tometer  is  used  for  recording  the  event,  the  typical 
diffraction  pattern  can  be  seen.  The  shape  of  the 
diffraction  was  used  to  determine  angular  diame¬ 
ters  of  stars  ans  separations  of  binary  stars.  In  the 
first  decades  of  radio  astronomy  the  occultations 
of  some  radio  sources  were  used  for  determining 
their  exact  positions. 

There  are  some  bright  stars  and  planets  inside 
the  11°  wide  zone  where  the  Moon  moves,  but  the 
occultation  of  a  bright,  naked-eye  object  is  quite 
rare. 

Occultations  are  also  caused  by  planets  and  as¬ 
teroids.  Accurate  predictions  are  complicated  be¬ 
cause  such  an  event  is  visible  only  in  a  very  nar¬ 
row  path.  The  Uranian  rings  were  found  during 
an  occultation  in  1977,  and  the  shapes  of  some  as¬ 
teroids  have  been  studied  during  some  favourable 
events,  timed  exactly  by  several  observers  located 
along  the  predicted  path. 

A  transit  is  an  event  in  which  Mercury  or 
Venus  moves  across  the  Solar  disk  as  seen  from 
the  Earth  (Fig.  7.9).  A  transit  can  occur  only  when 
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the  planet  is  close  to  its  orbital  node  at  the  time 
of  inferior  conjunction.  Transits  of  Mercury  oc¬ 
cur  about  13  times  per  century;  transits  of  Venus 
only  twice.  The  next  transits  of  Mercury  are: 
Nov  11,  2019;  Nov  13,  2032  and  Nov  7,  2039. 
The  next  transits  of  Venus  are:  Dec  11,  2117; 
Dec  8,  2125  and  Jun  11,  2247.  In  the  18th  cen¬ 
tury  the  two  transits  of  Venus  (1761  and  1769) 
were  used  for  determining  the  value  of  the  astro¬ 
nomical  unit. 


7.6  The  Structure  and  Surfaces  of 
Planets 

Since  the  1960’s  a  vast  amount  of  data  have  been 
collected  using  spacecraft,  either  during  a  flyby, 
orbiting  a  body,  or  directly  landing  on  the  surface. 
This  gives  a  great  advantage  compared  to  other 
astronomical  observations.  We  may  even  speak  of 
revolution:  the  solar  system  bodies  have  turned 
from  astronomical  objects  to  geophysical  ones. 
Many  methods  traditionally  used  in  various  sib¬ 
ling  branches  of  geophysics  can  now  be  applied 
to  planetary  studies. 

The  perturbations  in  the  orbit  of  a  satellite  or 
spacecraft  can  be  used  in  studying  the  internal 
structure  of  a  planet.  Any  deviation  from  spher¬ 
ical  symmetry  is  visible  in  the  external  gravita¬ 
tional  field.  The  shape  and  irregularities  of  the 
gravitation  field  generated  by  a  planet  reflect  its 
shape,  internal  structure  and  mass  distribution. 
Also  the  surface  gives  certain  indications  on  in¬ 
ternal  structure  and  processes. 

The  IAU  planet  definition  states  that  planets 
are  bodies  in  hydrostatic  equilibrium.  Gravity  of 
a  body  will  pull  its  material  inwards,  but  the  body 
resist  the  pull  if  the  strength  of  the  material  is 
greater  than  the  pressure  exerted  by  the  overly¬ 
ing  layers.  If  the  diameter  is  larger  than  about 
800-1000  km,  gravity  is  able  to  deform  rocky 
bodies  into  spherical  shape.  Smaller  bodies  than 
this  have  irregular  shapes.  On  the  other  hand,  e.g. 
icy  moons  of  Saturn  are  spherical  because  ice  is 
more  easily  deformed  than  rock. 

Hydrostatic  equilibrium  means  that  the  sur¬ 
face  of  the  body  approximately  follows  an  equi- 
potential  surface  of  gravity.  This  is  true  e.g.  on  the 
Earth,  where  the  sea  surface  very  closely  follows 


the  equipotential  surface  called  the  geoid.  Due  to 
internal  strength  of  rocks,  continents  can  deviate 
from  the  geoid  surface  by  a  few  kilometers  but 
compared  to  the  diameter  of  the  Earth  the  surface 
topography  is  negligible. 

A  rotating  planet  is  always  flattened.  The 
amount  of  flattening  depends  on  the  rotation  rate 
and  the  strength  of  the  material;  a  liquid  drop  is 
more  easily  deformed  than  a  rock.  The  shape  of 
a  rotating  body  in  hydrostatic  equilibrium  can  be 
derived  from  the  equations  of  motion.  If  the  ro¬ 
tation  rate  is  moderate,  the  equilibrium  shape  of 
a  liquid  body  is  an  ellipsoid  of  revolution,  rotat¬ 
ing  around  its  shortest  axis. 

If  Rq  and  Rp  are  the  equatorial  and  polar  radii, 
respectively,  the  shape  of  the  planet  can  be  ex¬ 
pressed  as 
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The  dynamical  flattening,  denoted  by  /  is  defined 
as 


(7.3) 


Because  Re>  Rp,  the  flattening  /  is  always  pos¬ 
itive. 

The  giant  planets  are  in  practise  close  to  hy¬ 
drostatic  equilibrium,  and  their  shape  is  deter¬ 
mined  by  the  rotation.  The  rotation  period  of  Sat¬ 
urn  is  only  10.5  h,  and  its  dynamical  flattening  is 
1/10  which  is  easily  visible. 

Asteroids  and  other  minor  bodies  are  so  small 
that  they  are  not  flattened  by  rotation.  However, 
there  is  an  upper  limit  for  a  rotation  rate  of  an 
asteroid  before  it  breaks  apart  due  to  centrifugal 
forces.  If  we  assume  that  the  body  is  held  together 
only  by  gravity,  we  can  approximate  the  maxi¬ 
mum  rotation  rate  by  setting  the  centrifugal  force 
equal  to  the  gravitational  force: 


G  Mm  mv2 
R2  -R- 


(7.4) 


where  m  is  a  small  test  mass  on  the  surface  at  a 
distance  of  R  from  the  center  of  the  body.  Substi¬ 
tuting  the  rotation  period  P , 
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we  get 

GM  _  4 tt2R 

~W  =  ~P 2“’ 

or 

I  R 3  ri  [3n 

P  =  2ttJ - =  2tt  / - =1  — .  (7.5) 

V  GM  y  471  Gp  y  Gp 

If  we  substitute  the  density  p  with  the  mean  den¬ 
sity  of  terrestrial  rocks,  i.e.  2700  kgm-3,  we  get 
for  the  minimum  rotation  period  P  ^2  hours. 

The  structure  of  the  terrestrial  planets 
(Fig.  7.10)  can  also  be  studied  with  seismic 
waves.  The  waves  formed  in  an  earthquake  are 
reflected  and  refracted  inside  a  planet  like  any 
other  wave  at  the  boundary  of  two  different  lay¬ 
ers.  The  waves  are  longitudinal  or  transversal 
(P  and  S  waves ,  respectively).  Both  can  propagate 
in  solid  materials  such  as  rock.  However,  only  the 
longitudinal  wave  can  penetrate  liquids.  One  can 
determine  whether  a  part  of  the  interior  material 
is  in  the  liquid  state  and  where  the  boundaries 
of  the  layers  are  by  studying  the  recordings  of 
seismometers  placed  on  the  surface  of  a  planet. 
Naturally  the  Earth  is  the  best-known  body,  but 
quakes  of  the  Moon,  Venus,  and  Mars  have  also 
been  observed. 

The  interior  temperatures  of  the  planets  are 
considerably  larger  than  the  surface  temperatures. 
For  example,  the  temperature  in  the  Earth’s  core 


Fig.  7.10  Internal  structure  and  relative  sizes  of  the  ter¬ 
restrial  planets.  The  percentage  shows  the  volume  of  the 
core  relative  to  the  total  volume  of  the  planet.  In  the  case 
of  the  Earth,  the  percentage  includes  both  the  outer  and 
the  inner  core 
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is  about  4500-5000  K,  and  in  the  core  of  Jupiter 
about  30,000  K. 

A  part  of  that  heat  is  the  remnant  of  the  re¬ 
leased  potential  energy  from  the  gravitational 
contraction  during  the  formation  of  planets.  De¬ 
cay  of  radioactive  isotopes  also  releases  heat. 
Soon  after  the  formation  of  planets  intense  me¬ 
teorite  bombardment  was  an  important  source  of 
heat.  Together  with  heat  from  short-lived  radioac¬ 
tive  isotopes  this  caused  melting  of  terrestrial 
planets.  The  planets  were  differentiated :  the  orig¬ 
inally  relatively  homogeneous  material  became 
segregated  into  layers  of  different  chemical  com¬ 
position.  The  heaviest  elements  sank  into  centre 
thus  forming  a  dense  core. 

The  terrestrial  planets  have  an  iron-nickel 
core.  Mercury  has  the  relatively  largest  core; 
Mars  the  smallest.  The  density  of  the  core  is 
around  10,000  kgm-3.  The  Fe-Ni  core  is  sur¬ 
rounded  by  a  mantle ,  composed  of  silicates  (com¬ 
pounds  of  silicon).  The  density  of  the  outermost 
layers  is  about  3000  kgm-3.  The  mean  density 
of  the  terrestrial  planets  is  3500-5500  kgm-3. 
A  thin  crust  is  the  outermost  layer.  The  crust 
and  the  upper  mantle  form  a  solid  lithosphere , 
and  the  partly  molten  layer  below  that  is  the  as- 
tenosphere.  Mean  densities  of  planets  are  3500- 
5500  kgm-3,  but  the  density  of  the  surface  ma¬ 
terial  is  less  than  300  kg  m-3. 

Rock  is  not  a  good  heat  conductor,  but  in  ter¬ 
restrial  planets  conduction  is  the  only  important 
method  to  transfer  from  the  interior  to  the  sur¬ 
face.  Heat  transfer  by  convection  depends  on  the 
viscosity  material  and  the  temperature  gradient. 
In  the  mantle  of  the  Earth  slow  vertical  convec¬ 
tive  flows  occur  below  a  few  hundred  kilome¬ 
tres.  These  flows  drive  e.g.  the  motion  of  the  tec¬ 
tonic  plates.  Continental  drift  gives  rise,  for  ex¬ 
ample,  to  mountain  formation.  The  Earth  is  the 
only  planet  where  plate  tectonics  is  active  today. 
On  other  terrestrial  planets  the  process  has  either 
ceased  long  ago  or  has  never  occurred. 

The  giant  planets  (Fig.  7.11)  do  not  have 
a  similar  isolating  outer  layer,  and  convection 
can  transfer  heat  all  the  way  to  the  surface. 
Thus  the  giant  planets  radiate  more  energy  than 
they  receive  from  the  Sun.  Saturn  radiates  about 
2.8  times  the  heat  it  gets  from  the  Sun,  more  than 
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Jupiter 


Fig.  7.11  Internal  structure  and  relative  sizes  of  the  gi¬ 
ant  planets.  Differences  in  size  and  distance  from  the  Sun 
cause  differences  in  the  chemical  composition  and  internal 
structure.  Due  to  smaller  size,  Uranus  and  Neptune  do  not 
have  any  layer  of  metallic  hydrogen.  The  Earth  is  shown 
in  scale 

any  other  planet.  This  heat  is  suspected  to  origi¬ 
nate  from  the  separation  of  hydrogen  and  helium, 
when  the  heavier  helium  is  gradually  sinking  to¬ 
ward  the  centre  of  the  planet.  In  Jupiter  the  heat 
is  mainly  remnant  heat  from  the  time  the  planet 
was  born. 

The  mean  densities  of  the  giant  planets  are 
quite  low;  the  density  of  Saturn,  for  example,  is 
only  700  kgm~3.  (If  Saturn  were  put  in  a  gigan¬ 
tic  bathtub,  it  would  float  on  the  water!)  Most  of 
the  volume  of  a  giant  planet  is  a  mixture  of  hy¬ 
drogen  and  helium.  In  the  centre,  there  is  possibly 
a  silicate  core,  the  mass  of  which  is  a  few  Earth 
masses.  The  core  is  surrounded  by  a  layer  of 
metallic  hydrogen.  Due  to  the  extreme  pressure, 
hydrogen  is  not  in  its  normal  molecular  form  H2, 
but  dissociated  into  atoms.  In  this  state,  hydrogen 
is  electrically  conducting.  The  magnetic  fields  of 
the  giant  planets  may  originate  in  the  layer  of 
metallic  hydrogen. 

Closer  to  the  surface,  the  pressure  is  lower  and 
hydrogen  is  in  molecular  form.  The  relative  thick¬ 
ness  of  the  layers  of  metallic  and  molecular  hy¬ 
drogen  vary  from  planet  to  planet.  Uranus  and 
Neptune  may  not  have  any  layer  of  metallic  hy¬ 
drogen  because  their  internal  pressure  is  too  low 
for  dissociation  of  the  hydrogen.  Instead,  a  layer 
of  “ices”  surround  the  core.  This  is  a  layer  of 
a  water-dominant  mixture  of  water,  methane  and 


ammonia.  Under  the  high  pressure  and  tempera¬ 
ture  the  mixture  is  partly  dissolved  into  its  com¬ 
ponents  and  it  behaves  more  like  a  molten  salt  and 
it  is  also  electrically  conductive  like  the  metallic 
hydrogen. 

On  top  of  everything  is  a  gaseous  atmosphere, 
only  a  few  hundred  kilometres  thick.  The  clouds 
at  the  top  of  the  atmosphere  form  the  visible  “sur¬ 
face”  of  the  giant  planets. 

Planetary  surfaces  are  modified  by  several 
geological  processes.  These  include  continental 
drift ,  volcanism ,  meteorite  impacts  and  climate. 
The  Earth  is  an  example  of  a  body  whose  surface 
has  been  renewed  many  times  during  past  aeons. 
The  age  of  the  surface  depends  on  the  processes 
and  thus  implies  the  geological  evolutionary  his¬ 
tory  of  the  planet. 

Volcanism  is  a  minor  factor  on  the  Earth  (at 
least  now),  but  the  surface  of  the  Jovian  moon  Io 
is  changing  rapidly  due  to  violent  volcanic  erup¬ 
tions  (Fig.  7.12).  Volcanism  on  Io  is  caused  by 
frictional  heating  by  tides.  Volcanoes  have  been 
observed  also  on  Mars  and  Venus,  but  not  on  the 
Moon. 

Lunar  craters  are  meteorite  impact  craters, 
common  on  almost  every  body  with  a  solid  sur¬ 
face.  Meteorites  are  bombarding  the  planets  con¬ 
tinuously,  but  the  rate  has  been  diminishing  since 
the  beginnings  of  the  solar  system.  The  number 
of  impact  craters  reflects  the  age  of  the  surface 
(Figs.  7.13  and  7.14). 

The  Jovian  moon  Callisto  is  an  example  of 
a  body  with  an  ancient  surface  which  is  not 
fully  inactive.  Lack  of  small  craters  indicates 
some  resurfacing  process  filling  and  degrading 
the  minor  surface  features.  The  Earth  is  an  ex¬ 
ample  of  a  body,  whose  atmosphere  both  pro¬ 
tects  the  surface  and  destroys  the  traces  of  im¬ 
pacts.  All  smaller  meteorites  are  burned  to  ashes 
in  the  atmosphere  (one  need  only  note  the  num¬ 
ber  of  shooting  stars),  and  some  larger  bodies  are 
bounced  back  to  outer  space.  The  traces  on  the 
surface  are  destroyed  very  quickly  by  erosion  in 
less  than  a  few  million  years.  Venus  is  an  even 
more  extreme  case  where  all  small  craters  are 
missing  due  to  a  thick  protective  atmosphere. 

Climate  has  the  greatest  influence  on  the  Earth 
and  Venus.  Both  planets  have  a  thick  atmosphere. 
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Fig.  7.1 2  An  example  of 
resurfacing.  Two  volcanic 
plumes  on  Jupiter’s  moon 
Io  observed  by  Galileo 
spacecraft  in  1997.  One 
plume  was  captured  on  the 
bright  limb  or  edge  of  the 
moon  ( inset  at  upper 
right),  erupting  over  a 
caldera  named  Pillan 
Patera.  The  plume  is  140 
kilometers  high.  The 
second  plume,  seen  near 
the  terminator,  is  called 
Prometheus.  The  shadow 
of  the  75  km  high  airborne 
plume  can  be  seen 
extending  to  the  right  of  the 
eruption  vent.  (NASA/JPL) 


On  Mars,  powerful  dust  storms  deform  the  land¬ 
scape,  too,  often  covering  the  planet  with  yellow¬ 
ish  dust  clouds. 


7.7  Atmospheres  and 
Magnetospheres 

All  major  planets  have  an  atmosphere;  the  atmo¬ 
sphere  of  Mercury,  however,  is  extremely  thin. 
The  giant  planets  are  surrounded  by  a  very  thick 
gaseous  layer  that  can  be  regarded  as  an  at¬ 
mosphere.  Saturn’s  moon  Titan  has  a  relatively 
thick  atmosphere  consisting  of  methane.  Also  the 
dwarf  planet  Pluto  has  a  thin  atmosphere  consist¬ 
ing  mainly  of  methane. 

The  composition,  thickness,  density  and  struc¬ 
ture  of  the  atmosphere  vary  from  planet  to  planet, 
but  some  common  features  can  be  found 
(Figs.  7.15  and  7.16).  Let  us  first  study  the  de¬ 
pendence  of  the  temperature  T,  pressure  P,  and 
density  p  on  the  height  h.  Let  us  consider  a  cylin¬ 
der  with  a  length  d h.  The  change  in  the  pres¬ 
sure  d  P  from  the  height  h  to  h  +  dh  is  pro¬ 
portional  to  the  mass  of  the  gas  in  the  cylin¬ 
der: 

(7.6) 


Fig.  7.1 3  Ages  of  the  surfaces  of  Mercury,  the  Earth,  the 
Moon  and  Mars.  The  curve  represents  the  fraction  of  the 
surface  which  existed  at  a  certain  time.  Most  of  the  surface 
of  the  Moon,  Mercury  and  Mars  are  more  than  3500  mil¬ 
lion  years  old,  whereas  the  surface  of  the  Earth  is  mostly 
younger  than  200  million  years 

where  g  is  the  acceleration  of  gravity.  Equation 
(7.6)  is  the  equation  of  hydrostatic  equilibrium. 
(It  is  discussed  in  detail  in  Chap.  11.) 

As  a  first  approximation,  we  may  assume  that 
g  does  not  depend  on  height.  In  the  case  of  the 
Earth,  the  error  is  only  about  3  %  if  g  is  con¬ 
sidered  constant  from  the  surface  to  a  height  of 
100  km. 


d P  =  —gp  dh, 
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Fig.  7.14  The  number  of  meteorite  impact  craters  is  an 
indicator  of  the  age  of  the  surface  and  the  shapes  of  the 
craters  give  information  on  the  strength  of  the  material. 
The  upper  row  shows  Mercury  (left)  and  the  Moon,  and  the 
second  row ,  the  Jovian  moons  Europa  (left),  Ganymede 
(centre)  and  Callisto.  The  pictures  of  the  Jovian  moons 
were  taken  by  the  Galileo  orbiter  with  a  resolution  of 
150  metres/pixel.  Europa  has  only  a  few  craters,  there 
are  areas  of  different  ages  on  the  surface  Ganymede  and 
the  surface  of  Callisto  is  the  oldest.  Note  the  grooves  and 


ridges  that  indicate  different  geological  processes.  IN  the 
bottom  there  are  two  volcanic  plumes  on  Jupiter’s  moon 
Io  observed  by  Galileo  spacecraft  in  1997.  One  plume  was 
captured  on  the  bright  limb  or  edge  of  the  moon  (inset  at 
upper  right),  erupting  over  a  caldera  named  Pillan  Patera. 
The  plume  is  140  kilometers  high.  The  second  plume,  seen 
near  the  terminator,  is  called  Prometheus.  The  shadow  of 
the  75  km  high  airborne  plume  can  be  seen  extending  to 
the  right  of  the  eruption  vent.  (NASA/JPL  and  DLR) 


The  equation  of  state  of  the  ideal  gas 

PV  =  NkT  (7.7) 


gives  me  expression 


P  = 


pkT 


pc 


where  N  is  the  number  of  atoms  or  molecules, 
k  is  the  Boltzmann  constant,  pi  is  the  mass  of  one 


atom  or  molecule  and 


piN 

"v”’ 


By  using  the  equation  of  hydrostatic  equilibrium 
(7.6)  and  the  equation  of  state  (7.8),  we  obtain 


d  P 
~P 


d  h. 
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Fig.  7.15  Relative 
abundances  of  the  most 
abundant  gases  in  the 
atmospheres  of  Venus, 
Earth,  and  Mars.  The 
number  at  the  bottom  of 
each  circle  denotes  the 
surface  pressure  in  atms 
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Fig.  7.16  (a)  Temperature  as  a  function  of  height  in  the 
atmospheres  of  Venus,  Earth,  and  Mars,  (b)  Temperature 
profiles  of  the  atmospheres  of  Jupiter  and  Saturn.  The 
zero  height  is  chosen  to  be  the  point  where  the  pressure 
is  100  mbar.  Numbers  along  the  curves  are  pressures  in 
millibars 


Integration  yields  P  as  a  function  of  height: 


P  =  I\)  exp 


=  P0  exp 


The  variable  H ,  which  has  the  dimension  of 
length,  is  called  the  scale  height : 


H  = 


kT 

Pg' 


(7.10) 


The  scale  height  defines  the  height  at  which  the 
pressure  has  decreased  by  a  factor  e.  H  is  a  func¬ 
tion  of  height,  but  here  we  may  assume  that  it  is 
constant.  With  this  approximation,  we  obtain 

h  P 

- =  In  — 

H  P0 


or,  using  (7.8), 


pT(h)  _  t-h/H 

PoTo 


(7.11) 


The  scale  height  is  an  important  parameter  in 
many  formulas  describing  the  structure  of  the  at¬ 
mosphere  (Table  7.1).  For  example,  if  the  change 
of  the  pressure  or  the  density  is  known  as  a  func¬ 
tion  of  height,  the  mean  molecular  weight  of  the 
atmosphere  can  be  computed.  The  scale  height  of 
the  Jovian  atmosphere  was  determined  in  1952 
when  Jupiter  occulted  a  star.  With  these  observa¬ 
tions,  the  scale  height  was  calculated  to  be  8  km, 
and  the  mean  molecular  weight  3-5  amu  (atomic 
mass  unit,  1/12  of  the  mass  of  12C).  Thus  the 
main  components  are  hydrogen  and  helium,  a  re¬ 
sult  later  confirmed  by  spacecraft  data. 

In  terrestrial  observations,  infrared  data  are 
limited  by  water  vapour  and  carbon  dioxide.  The 
scale  height  of  CO2  is  5  km,  which  means  that 
the  partial  pressure  is  already  halved  at  a  height 
of  3.5  km.  Thus  infrared  observations  can  be 
made  on  top  of  high  mountains  (like  Mauna  Kea 
in  Hawaii).  The  scale  height  of  water  vapour 
is  13  km,  but  the  relative  humidity  and  hence 
the  actual  water  content  is  very  site-  and  time- 
dependent. 

The  scale  height  and  the  temperature  of  the 
atmosphere  define  the  permanence  of  the  atmo¬ 
sphere.  If  the  speed  of  a  molecule  is  greater  than 


7.7  Atmospheres  and  Magnetospheres 


157 


Table  7.1  Scale  heights  of  some  gases  in  the  atmospheres  of  Venus,  Earth,  and  Mars 


Gas 

Molecular  weight  [amu] 

Earth  H  [km] 

Venus  H  [km] 

Mars  H  [km] 

h2 

2 

120 

360 

290 

o2 

32 

7 

23 

18 

H20 

18 

13 

40 

32 

C02 

44 

5 

16 

13 

n2 

28 

8 

26 

20 

Temperature  [K] 

275 

750 

260 

Acceleration  of  gravity  [m/s2] 

9.81 

8.61 

3.77 

the  escape  velocity,  the  molecule  will  escape  into 
space.  The  whole  atmosphere  could  disappear  in 
a  relatively  short  time. 

According  to  the  kinetic  gas  theory,  the  mean 
velocity  v  of  a  molecule  depends  both  on  the  ki¬ 
netic  temperature  7k  of  the  gas  and  the  mass  m  of 
the  molecule: 


v  = 


(7.12) 


If  the  mass  of  a  planet  is  M  and  its  radius  R ,  the 
escape  velocity  is 


2  GM 


Ve  = 


R 


(7.13) 


Even  if  the  mean  velocity  is  smaller  than  the 
escape  velocity,  the  atmosphere  can  evaporate 
into  space  if  there  is  enough  time,  since  some 
molecules  will  always  have  velocities  exceed¬ 
ing  ve.  Assuming  a  velocity  distribution,  one  can 
calculate  the  probability  for  v  >  vQ  Hence  it  is 
possible  to  estimate  what  fraction  of  the  atmo¬ 
sphere  will  disappear  in,  say,  109  years.  As  a  rule 
of  thumb,  it  can  be  said  that  at  least  half  of  the 
atmosphere  will  remain  over  1000  million  years 
if  the  mean  velocity  v  <  0.2ne. 

Giant  planets  move  far  from  the  Sun;  thus 
the  surface  temperature  is  low.  Also,  the  grav¬ 
itation  is  strong.  Thus  it  is  understandable  that 
e.g.  Jupiter  has  been  able  to  retain  more  hydro¬ 
gen  than  the  Earth. 

The  probability  that  a  molecule  close  to  the 
surface  will  escape  is  insignificantly  small.  The 
free  mean  path  of  a  molecule  is  very  small  when 
the  gas  density  is  high  (Fig.  7.17).  Thus  the  escap¬ 
ing  molecule  is  most  probably  leaving  from  the 


Fig.  7.17  Close  to  the  surface,  the  mean  free  path  of 
a  molecule  is  smaller  than  higher  in  the  atmosphere  where 
the  gas  density  is  smaller.  The  escaping  molecules  origi¬ 
nate  close  to  the  critical  layer 


uppermost  layers.  The  critical  layer  is  defined  as 
a  height  at  which  a  molecule,  moving  upward,  has 
a  probability  1  /e  of  hitting  another  molecule.  The 
part  of  the  atmosphere  above  the  critical  layer  is 
called  the  exosphere.  The  exosphere  of  the  Earth 
begins  at  a  height  of  500  km,  where  the  kinetic 
temperature  of  the  gas  is  1500-2000  K  and  the 
pressure  is  lower  than  in  the  best  terrestrial  vacu¬ 
ums. 

The  magnetosphere  is  the  “outer  boundary”  of 
a  planet.  Size  and  shape  depend  on  the  strength 
of  the  magnetic  field  of  the  planet  and  on  the 
solar  wind.  The  solar  wind  is  a  flux  of  charged 
particles,  mostly  electrons  and  protons,  outflow- 
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Fig.  7.18  Structure  of  the  magnetosphere  of  the  Earth.  (A.  Nurmi/Tiede  2000) 


in g  from  the  Sun.  The  speed  of  the  wind  at  the 
distance  of  the  Earth  is  about  500  km/s  and  the 
density  5-10  particles/cm3  but  both  values  can 
change  considerably  depending  on  the  solar  ac¬ 
tivity. 

On  the  solar  side  there  is  a  bow  shock 
(Fig.  7.18),  typically  at  a  distance  of  a  few  tens 
of  planetary  radii  (Table  7.2).  At  the  bow  shock, 
particles  of  the  solar  wind  first  hit  the  magne¬ 
tosphere.  The  magnetosphere  is  limited  by  the 
magnetopause ,  flattened  on  the  solar  side  and  ex¬ 
tended  to  a  long  tail  on  the  opposite  side.  Charged 
particles  inside  the  magnetopause  are  captured  by 
the  magnetic  field  and  some  particles  are  accel¬ 
erated  to  great  velocities  (Fig.  7.19).  If  the  ve¬ 
locities  are  interpreted  according  to  the  kinetic 
gas  theory,  these  velocities  even  correspond  to 
millions  of  kelvins.  However,  the  density,  and 
thus  the  total  energy,  is  very  small.  The  “hottest” 


places  found  are  around  Jupiter  and  Saturn,  where 
the  particle  velocities  correspond  even  tempera¬ 
tures  of  400  million  kelvins. 

The  region  of  space  containing  trapped  char¬ 
ged  particles,  the  radiation  belts  around  the  Earth, 
are  named  van  Allen’s  belts.  These  radiation 
zones  were  discovered  by  the  first  US  satellite, 
Explorer  1,  in  1958. 

The  number  of  charged  particles  increases  af¬ 
ter  strong  solar  bursts.  Some  of  these  particles 
“leak”  to  the  atmosphere,  resulting  in  auroras. 
Similar  effects  have  also  been  detected  in  Jupiter, 
Saturn  and  Uranus. 

The  solar  magnetic  field  arises  from  the  turbu¬ 
lent  motions  of  the  electrically  conductive  matter. 
The  energy  driving  the  convection  in  the  layer  is 
coming  from  the  nuclear  fusion  in  the  core.  This, 
however,  cannot  explain  planetary  magnetism. 
Neither  can  the  remanent  primordial  magnetic 
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Table  7.2  Planetary  magnetic  fields 


Dipole  moment  (Earth  =1) 

Field  strength  (gauss)a 

Polarity1 

Angle0 

Magneto-pause4 

Mercury 

0.0007 

0.003 

it 

14° 

1.5 

Venus 

<0.0004 

<0.00003 

— 

— 

— 

Earth 

1.0 

0.305 

t 

11° 

10 

Mars 

<0.0002 

<0.0003 

— 

— 

— 

Jupiter 

20,000. 

4.28 

10° 

80 

Saturn 

600. 

0.22 

<1° 

20 

Uranus 

50. 

0.23 

59° 

20 

Neptune 

25. 

0.14 

47° 

25 

aAt  equator  (1  gauss  equals  10  4  T) 

bff  same  as  the  Earth,  opposite 

c  Angle  between  magnetic  and  rotational  axes 

dAverage  magnetopause  distance  in  the  direction  of  the  Sun  in  planetary  radii 


Fig.  7.1 9  A  glow  of  hot  plasma  trapped  inside  the  Earth’s 
magnetosphere.  The  picture  was  taken  by  NASA’s  Im¬ 
ager  for  Magnetopause  to  Aurora  Global  Exploration  (IM¬ 
AGE)  spacecraft  on  August  11,  2000  at  18:00  UT.  The 
Sun  is  outside  the  picture  area  toward  the  top  right  corner. 
(NASA  and  the  IMAGE  science  team) 


field  explain  it  because  the  internal  temperature 
of  planets  is  well  above  the  Curie  point  (about 
850  K  for  magnetite).  If  the  temperature  is  above 
the  Curie  point,  ferromagnetic  materials  will  lose 
their  remanent  magnetism. 


The  planetary  dynamo  generating  the  mag¬ 
netic  field  requires  that  the  planet  is  rotating  and 
has  a  convective  layer  of  electrically  conductive 
material.  The  temperature  gradient  across  this 
layer  must  also  be  high  enough  to  maintain  con¬ 
vection.  Terrestrial  planets  have  a  liquid  Fe-Ni 
core,  or  a  liquid  layer  in  the  core,  Jupiter  and  Sat¬ 
urn  have  a  layer  of  liquid  metallic  hydrogen  and 
Uranus  and  Neptune  have  a  mixture  of  water,  am¬ 
monia  and  methane. 

The  strength  of  the  magnetic  field  varies  a  lot 
from  planet  to  planet.  It  can  be  characterised 
by  the  dipole  magnetic  moment.  The  magnetic 
moment  of  Jupiter  is  about  100  million  times 
that  of  Mercury.  The  magnetic  moment  of  the 
Earth  is  about  7.9  x  1025  gauss  cm3  that  can 
be  compared  to  the  typical  strong  electromag¬ 
netic  fields  achieved  in  the  laboratories,  about 
100,000  gauss  cm3.  Inducing  such  a  strong  field 
requires  currents  that  are  of  the  order  of  109  Am¬ 
peres.  When  divided  by  the  cube  of  planetary 
radii,  one  gets  an  estimate  of  the  field  strength 
on  the  equator. 

The  alignment  of  the  magnetic  field  with  re¬ 
spect  to  the  rotation  axis  of  a  planet  differs  from 
planet  to  planet  (Fig.  7.20).  Saturn’s  magnetic 
field  is  close  to  the  ideal  case  where  rotational 
axis  and  magnetic  axis  coincide.  Also  the  Earth 
and  Jupiter  show  reasonably  good  point  dipole 
field  with  a  tilt  of  about  10°.  However,  fields  of 
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Fig.  7.20  Planetary  magnetic  fields 


Uranus  and  Neptune  are  both  offset  from  the  cen¬ 
tre  of  the  planet  and  tilted  by  about  50°  from  the 
rotation  axis.  This  may  indicate  a  different  mech¬ 
anism  for  the  dynamo. 

The  magnetic  fields  of  Mercury  and  the  Earth 
have  an  opposite  polarity  than  the  fields  of 
other  planets.  It  is  known  that  the  polarity  of 
the  Earth’s  magnetic  field  has  reversed  several 
times  over  geologic  time  scales,  previously  about 
750,000  years  ago.  There  are  some  indications 
that  the  reversal  of  the  polarity  is  beginning  now 
because  the  field  strength  is  declining  about  one 
percent  per  decade,  magnetic  poles  are  moving 
more  rapidly  and  the  field  asymmetry  is  increas¬ 
ing.  The  whole  process  will  take  several  thousand 
years  during  which  the  Earth’s  surface  is  more 
open  to  the  cosmic  rays. 

The  Galileo  mission  also  revealed  that  the  Jo¬ 
vian  moon  Ganymede  has  a  magnetic  field.  The 
field  is  weak  and  too  small  to  have  a  magneto¬ 
tail  or  trapped  particles  around  the  moon.  Cal- 
listo,  which  is  of  the  same  size,  does  not  show 
any  magnetosphere.  Neither  does  our  Moon  have 
any  global  magnetic  field. 


7.8  Albedos 

The  planets  and  all  other  bodies  of  the  solar  sys¬ 
tem  only  reflect  the  radiation  of  the  Sun  (we  may 
neglect  here  the  thermal  and  radio  wave  radia¬ 
tion  and  concentrate  mainly  on  the  visual  wave¬ 
lengths).  The  brightness  of  a  body  depends  on  its 
distance  from  the  Sun  and  the  Earth,  and  on  the 
albedo  of  its  surface.  The  term  albedo  defines  the 
ability  of  a  body  to  reflect  light. 


Fig.  7.21  Symbols  used  in  the  photometric  formulas.  The 
angle  a  is  the  phase  angle 


If  the  luminosity  of  the  Sun  is  L©,  the  flux 
density  at  the  distance  r  is  (Fig.  7.21) 


^0 
4nr 2 


(7.14) 


If  the  radius  of  the  planet  is  R ,  the  area  of  its  cross 
section  is  ttR2,  and  the  total  flux  incident  on  the 
surface  of  the  planet  is 


Lin  =  n  R2 


4tv  r2 


LqR2 

4r2 


(7.15) 


Only  a  part  of  the  incident  flux  is  reflected  back. 
The  other  part  is  absorbed  and  converted  into 
heat  which  is  then  emitted  as  a  thermal  emission 
from  the  planet.  The  Bond  albedo  A  (or  spheri¬ 
cal  albedo)  is  defined  as  the  ratio  of  the  emergent 
flux  to  the  incident  flux  (0  <  A  <  1).  The  flux  re¬ 
flected  by  the  planet  is  thus 


L 


out 


ALqR2 
4  r2 


(7.16) 


The  planet  is  observed  at  a  distance  A.  If  radi¬ 
ation  is  reflected  isotropically,  the  observed  flux 
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density  should  be 


F=T^2-  (-1A7'> 

An  A2 

In  reality,  however,  radiation  is  reflected  aniso- 
tropically.  If  we  assume  that  the  reflecting  object 
is  a  homogeneous  sphere,  the  distribution  of  the 
reflected  radiation  depends  on  the  phase  angle  a 
only.  Thus  we  can  express  the  flux  density  ob¬ 
served  at  a  distance  A  as 


F  =  C0(a ) 


L 


out 


47T  A2 


(7.18) 


The  function  0  giving  the  phase  angle  depen¬ 
dence  is  called  the  phase  function.  It  is  nor¬ 
malised  so  that  0  (a  —  0°)  —  1 . 

Since  all  the  radiation  reflected  from  the  planet 
is  found  somewhere  on  the  surface  of  the  sphere, 
we  must  have 


C0(a) 


L 


out 


471  A2 


d  S  =  L 


out 


(7.19) 


Remembering  that  Lout  =  ALm ,  (7.18)  can  be 
written  in  the  form 


1 

A2Li 


(7.25) 


The  first  factor  is  intrinsic  for  each  object,  the  sec¬ 
ond  gives  the  phase  angle  dependence,  the  third 
the  distance  dependence  and  the  fourth,  the  inci¬ 
dent  radiation  power.  The  first  factor  is  often  de¬ 
noted  by 


CA 

An 


(7.26) 


When  we  substitute  here  the  expression  of  C 
(7.24),  and  solve  for  the  Bond  albedo,  we  get 


An  T  A 

A  = - =  n  T  —  =  n  Tq  =  pq.  (7.27) 

Here  p  =  nF  is  called  the  geometric  albedo 
and  q  is  the  previously  introduced  phase  integral. 
These  quantities  are  related  by 


or 


C 

471  A2 


0(a)dS=  1, 


(7.20) 


where  the  integration  is  extended  over  the  surface 
of  the  sphere  of  radius  A.  The  surface  element 
of  such  a  sphere  is  d S  =  A2  da  sina  d</>,  and  we 
have 


/»  pit 

0(a)dS  =  A2  /  0(a)  sin  a  da  d</> 

Js  Ja  =  oJd) 


0(a)dS  = 

'  S  Ja  =  0  J  <p  =  0 

rn 

=  A22n  /  0(a)  sin  a  da. 

Jo 

The  normalisation  constant  C  is 


(7.21) 


C  = 


An  A' 


fs  0  (a)  dS  Jq  0(a)  sin  a  da 


(7.22) 


The  quantity 


•  JT 


q  =  2  f  0(a)  sin  a  da  (7.23) 

Jo 


is  the  phase  integral.  In  terms  of  the  phase  inte¬ 
gral  the  normalisation  constant  is 


A  =  pq.  (7.28) 


The  geometric  albedo  seems  to  have  appeared 
as  an  arbitrary  factor  with  no  obvious  physical  in¬ 
terpretation.  We’ll  now  try  to  explain  this  quan¬ 
tity  using  a  Lambertian  surface.  A  Lambertian 
surface  is  defined  as  an  absolutely  white,  diffuse 
surface  which  reflects  all  radiation,  i.e.  its  Bond 
albedo  is  A  =  1.  Moreover,  its  surface  bright¬ 
ness  is  the  same  for  all  viewing  directions,  which 
means  that  the  phase  function  is 


0(a)  = 


cosa, 


if  0  <  a  <  7t/2, 
otherwise. 


(7.29) 


In  reality,  no  such  surface  exists  but  there  are 
some  materials  which  behave  almost  like  a  Lam¬ 
bertian  surface.  A  wall  with  a  mat  white  finish  is 
a  good  approximation;  although  it  doesn’t  reflect 
all  incident  light,  the  distribution  of  the  reflected 
light  is  about  right,  and  its  brightness  looks  the 
same  from  all  directions. 

For  a  Lambertian  surface  the  constant  C  is 


4 

C  =  ~.  (7.24) 

q 


2 

Jq  0(a)  sin  a  da 
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/;/2  cos  a  sin  a  da 


1/2 


=  4. 


(7.30) 


Thus  the  geometric  albedo  of  a  Lambertian  sur¬ 
face  is 


CA  4x1 

p  =  tx  r  =  —  = - =  i. 

4  4 


(7.31) 


At  the  phase  angle  zero  0(a  =  0°) 
the  reflected  flux  density  is 


=  1  and 


CA  1 

F  =  ~a  To^in- 

471  A2 


If  we  replace  the  object  with  a  Lambertian  surface 
of  the  same  size,  we  get 


4  1 

Fh~A^^-Lm' 


The  ratio  of  these  flux  densities  is 


F  CA 

—  =  —  =  jt  r  =  p. 
Fl  4 


(7.32) 


Now  we  have  found  a  physical  interpretation 
for  p:  the  geometric  albedo  is  the  ratio  of  the 
flux  densities  at  phase  angle  a  =  0°  reflected  by 
a  planet  and  a  Lambertian  surface  of  the  same 
cross  section. 

The  geometric  albedo  depends  on  the  re¬ 
flectance  of  the  surface  but  also  on  the  phase 
function  0 .  Many  rough  surfaces  reflect  most  of 
the  incident  radiation  directly  backward.  In  such 
a  case  the  geometric  albedo  p  is  greater  than  in 
the  case  of  an  isotropically  reflecting  surface.  On 
some  surfaces  p  >  1 ;  the  most  extreme  case  is  a 
mirror,  for  which  the  specular  reflection,  p  =  oo. 
The  geometric  albedo  of  solar  system  bodies  vary 
between  0.03-1.  The  geometric  albedo  of  the 
Moon  is  =  0.12  and  the  greatest  value,  p  =  1.0, 
has  been  measured  for  the  Saturnian  moon  Ence- 
ladus. 

It  turns  out  that  p  can  be  derived  from  the  ob¬ 
servations,  but  the  Bond  albedo  A  can  be  deter¬ 
mined  only  if  the  phase  integral  q  is  also  known. 
That  will  be  discussed  in  the  next  section. 


7.9  Photometry,  Polarimetry  and 
Spectroscopy 

Having  defined  the  phase  function  and  albedos 
we  are  ready  to  derive  a  formula  for  planetary 
magnitudes.  The  flux  density  of  the  reflected  light 


CA  1 
F  =  —  0  (a)  —zL{n. 

4tt  A2 

We  now  substitute  the  incident  flux 


L  in  — 


4  r2 


and  the  constant  factor  expressed  in  terms  of  the 
geometric  albedo 


CA  p 

—  =  r  =  - 

4tt  7 x 


Thus  we  get 


p  1  LqR 2 

F  =  L0(a)  c— 

jt  A2  4  r2 


(7.33) 


The  observed  solar  flux  density  at  a  distance  of 
a  =  1  au  from  the  Sun  is 


47T<22 


(7.34) 


The  ratio  of  these  is 


F  p0{a)R2a2 


(7.35) 


If  the  apparent  solar  magnitude  at  a  distance  of 
1  au  is  mQ  and  the  apparent  magnitude  of  the 
planet  m  we  have 

F 

m  -  mQ  =  -2.5  lg — 


2^2 


=  -2.51g 


p0(a)R  a 
A  2r2 


c  pR2  a4 
~2'5  g^AV0(a) 

~2'5lgP7l 2  _2-51gXT2 

-  2.5  lg  0(a) 
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R2  A  r 

=  — 2.5  lg  p  — T-  +  5  lg  — r- 
az  az 

—  2.5  \g  0(a).  (7.36) 

If  we  denote 

R2 

^(1>  0)  =  m©  —  2.5 lg  p—*,  (7.37) 

az 

then  the  magnitude  of  a  planet  can  be  expressed 
as 


m  =  V(l,0)  +  51g^  -  2.5 lg  0(a).  (7.38) 

a1 

The  first  term  V(1,0)  depends  only  on  the  size 
of  the  planet  and  its  reflection  properties.  So  it  is 
a  quantity  intrinsic  to  the  planet,  and  it  is  called 
the  absolute  magnitude  (not  to  be  confused  with 
the  absolute  magnitude  in  stellar  astronomy!). 
The  second  term  contains  the  distance  depen¬ 
dence  and  the  third  one  the  dependence  on  the 
phase  angle. 

If  the  phase  angle  is  zero,  and  we  set  r  =  A  = 
a ,  (7.38)  becomes  simply  m  =  V(1,0).  The  ab¬ 
solute  magnitude  can  be  interpreted  as  the  mag¬ 
nitude  of  a  body  if  it  is  at  a  distance  of  1  au  from 
the  Earth  and  the  Sun  at  a  phase  angle  a  =  0°. 
As  will  be  immediately  noticed,  this  is  physically 
impossible  because  the  observer  would  be  in  the 
very  centre  of  the  Sun.  Thus  V  (1 , 0)  can  never  be 
observed. 

By  using  (7.37)  and  (7.38)  at  a  =  0°,  the  ge¬ 
ometric  albedo  can  be  solved  for  in  terms  values 
all  obtainable  from  observations. 


2 

10_ 0-4(mo 


ra©) 


(7.39) 


where  mo  =  m(a  =  0°).  As  can  easily  be  seen, 
p  can  be  greater  than  unity  but  in  the  real  world,  it 
is  normally  well  below  that.  Typical  values  for  p 
are  in  the  range  0. 1-0.5. 

The  last  term  containing  the  phase  angle  de¬ 
pendence  in  (7.38)  is  the  most  problematic  one. 
For  many  objects  the  phase  function  is  not  known 
very  well.  This  means  that  from  the  observations, 
one  can  calculate  only 


V(l,  a)  =  V(l,  0)  -  2.5  lg  0(a),  (7.40) 


which  is  the  absolute  magnitude  at  phase  an¬ 
gle  a.  V (1,  a),  plotted  as  a  function  of  the  phase 
angle,  is  called  the  phase  curve  (Fig.  7.22).  The 
phase  curve  extrapolated  to  a  =  0°  gives  V  (1 , 0). 
The  shape  of  the  phase  curve  is  very  different  for 
objects  with  or  without  an  atmosphere. 

The  Bond  albedo  can  be  determined  only  if 
the  phase  function  0  is  known.  Superior  planets 
(and  other  bodies  orbiting  outside  the  orbit  of  the 
Earth)  can  be  observed  only  in  a  limited  phase  an¬ 
gle  range,  and  therefore  0  is  poorly  known,  ex¬ 
cept  for  those  bodies  that  have  been  observed  by 
spacecraft.  The  situation  is  somewhat  better  for 
the  inferior  planets.  Especially  in  popular  texts 
the  Bond  albedo  is  given  instead  of  p  (naturally 
without  mentioning  the  exact  names!).  A  good 
excuse  for  this  is  the  obvious  physical  meaning  of 
the  former,  and  also  the  fact  that  the  Bond  albedo 
is  normalised  to  [0,  1]. 

Opposition  Effect  If  an  object  has  an  atmo¬ 
sphere  it  reflects  light  more  or  less  isotropically 
to  all  directions.  The  flux  density  of  the  reflected 
light  is  then  proportional  to  the  area  of  the  visible 
illuminated  surface  (actually  to  the  projection  of 
this  area  on  a  plane  perpendicular  to  the  line  of 
sight).  Atmosphereless  bodies  reflect  light  more 
strongly  to  the  direction  of  the  incident  light. 
Hence  the  brightness  increases  rapidly  when  the 
phase  angle  approaches  zero.  When  the  phase  is 
larger  than  about  10°,  the  changes  are  smaller. 
This  rapid  brightening  close  to  the  opposition  is 
called  the  opposition  effect.  An  atmosphere  de¬ 
stroys  the  opposition  effect. 

The  full  explanation  is  still  in  dispute.  A  qual¬ 
itative  (but  only  partial)  explanation  is  that  close 
to  the  opposition,  no  shadows  are  visible.  When 
the  phase  angle  increases,  the  shadows  become 
visible  and  the  brightness  drops.  The  main  rea¬ 
son,  however,  is  the  coherent  backscatter  due  to 
the  wave  properties  of  the  light. 

Magnitudes  of  Asteroids  The  shape  of  the 
phase  curve  depends  on  the  geometric  albedo.  It 
is  possible  to  estimate  the  geometric  albedo  if  the 
phase  curve  is  known.  This  requires  at  least  a  few 
observations  at  different  phase  angles.  Most  crit¬ 
ical  is  the  range  0°-10°.  A  known  phase  curve 
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a)  Phase  Angle  (°)  b)  Phase  Angle  (°) 


Fig.  7.22  The  phase  curves  and  polarisation  of  differ¬ 
ent  types  of  asteroids.  The  asteroid  characteristics  are  dis¬ 
cussed  in  more  detail  in  Sect.  8.11.  (From  Muinonen  et 

can  be  used  to  determine  the  diameter  of  the 
body,  e.g.  the  size  of  an  asteroid.  Apparent  di¬ 
ameters  of  asteroids  are  so  small  that  for  ground 
based  observations  one  has  to  use  indirect  meth¬ 
ods,  like  polarimetric  or  radiometric  (thermal  ra¬ 
diation)  observations  (Fig.  7.22).  Beginning  from 
the  1990’s,  imaging  made  during  spacecraft  fly¬ 
bys  and  with  the  Hubble  Space  Telescope  have 
given  also  direct  measures  of  the  diameter  and 
shape  of  asteroids. 

When  the  phase  angle  is  greater  than  a  few  de¬ 
grees,  the  magnitude  of  an  asteroid  depends  al¬ 
most  linearly  on  the  phase  angle.  Earlier  this  lin¬ 
ear  part  was  extrapolated  to  a  =  0°  to  estimate 
the  opposition  magnitude  of  an  asteroid.  Due  to 
the  opposition  effect  the  actual  opposition  mag¬ 
nitude  can  be  considerably  brighter. 

In  1985  the  IAU  adopted  the  HG  system  for 
magnitudes  of  atmosphereless  bodies.  Formally, 


al.,  Asteroid  photometric  and  polarimetric  phase  effects, 
in  Bottke,  Binzel,  Cellino,  Paolizhi  (Eds.)  Asteroids  III, 
University  of  Arizona  Press,  Tucson) 

it  was  semi-empirical,  although  it  was  based  on 
photometric  theories  by  Lumme  and  Bowell.  In 
the  2012  meeting  this  was  replaced  by  a  new 
HG\G2  system.  Although  the  older  HG  system 
was  useful  in  many  cases,  it  was  not  satisfac¬ 
tory  if  the  opposition  effect  was  very  small  or  re¬ 
stricted  to  very  small  phase  angles. 

In  the  new  system  the  magnitude  at  phase  an¬ 
gle  a  is 

V(l,a) 

=  -2.51g[<2i0i(oO  +a2<p2(oi)  +  <2303(aO] 
=  H  -  2.51g[Gi0i(o')  +  G2<&i(oi) 

+  (1-G1-G2)<2>3(a)],  (7.41) 

where  the  values  of  the  basis  functions  @2 
and  03  are  found  by  spline  interpolations  from 
the  following  tables: 
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a  [°] 

01 

0.0 

1.0 

7.5 

0.75 

30.0 

0.33486016 

60.0 

0.13410560 

90.0 

0.05110476 

120.0 

0.02146569 

150.0 

0.00363970 

<£j(7.5°)  =  —1.90986 

<£j(150°)  =  —0.09133 

and  hence  H  is  just  the  absolute  magnitude  in  op¬ 
position.  The  constants  G\  and  G2  describe  the 
shape  of  the  phase  curve. 

Asteroid  data  has  earlier  been  published  in  the 
yearbook  Efemeridy  malyh  planet.  Currently  the 
best  source  is  the  web  pages  of  the  Minor  Planet 
Center:  http://www.cfa.harvard.edu/iau/services/ 
WebCSAccess.html. 


a  [°] 

02 

0.0 

1.0 

7.5 

0.925 

30.0 

0.62884169 

60.0 

0.31755495 

90.0 

0.12716367 

120.0 

0.02237390 

150.0 

0.00016506 

<Z>'(7.5°)  =  -0.57330 

<£'(150°)  =  -8.657  x  10“8 

Polarimetric  Observations  The  light  reflected 
by  the  bodies  of  the  solar  system  is  usually  po¬ 
larised,  at  least  to  some  degree.  The  amount  of 
polarisation  depends  on  the  reflecting  material 
and  also  on  the  geometry:  polarisation  is  a  func¬ 
tion  of  the  phase  angle.  The  degree  of  polarisa¬ 
tion  P  is  defined  as 


F±  -  Fy 
F±  +  Fl{  ’ 


(7.44) 


a  [°] 

03 

0.0 

1.0 

0.3 

0.83381185 

1.0 

0.57735424 

2.0 

0.42144772 

4.0 

0.23174230 

8.0 

0.10348178 

12.0 

0.06173347 

20.0 

0.01610701 

30.0 

0.0 

<Z>'(0°)  =  -0.10630 

<Z>'(30°)  =  0 

When  the  phase  angle  is  zero  all  the  functions 
have  the  value  1.  If  the  phase  angle  is  a  <  7.5°, 
01  ans  02  are  linear  functions:  <P\(a)  =  1  — 
a/30°,  02 (a)  =  1  —  o'/100°. 

Fitting  an  expression  in  terms  of  the  basis 
functions  to  the  observed  phase  curve  one  gets 
the  coefficients  at ,  and  then  further 

H  =  -2.5  lg(ai  +  a2  +  03), 

G\  =  a\/(a\  +  02  +  03),  (7.42) 

G2  =  a2/(ai  +  02  +  03 ), 

When  the  phase  angle  is  zero,  we  have 

V(l,  0)  =  H  —  2.51g[Gi  +  G2  +  1  -  Gi  -  G2] 
=  H,  (7.43) 


where  F±  is  the  flux  density  of  radiation,  per¬ 
pendicular  to  a  fixed  plane,  called  the  scattering 
plane,  and  F\\  is  the  flux  density  parallel  to  the 
plane.  In  solar  system  studies,  polarisation  is  usu¬ 
ally  referred  to  the  plane  defined  by  the  Earth,  the 
Sun,  and  the  object.  According  to  (7.44),  P  can 
be  positive  or  negative;  thus  the  terms  “positive” 
and  “negative”  polarisation  are  used. 

The  degree  of  polarisation  as  a  function  of  the 
phase  angle  depends  on  the  surface  structure  and 
the  atmosphere.  The  degree  of  polarisation  of  the 
light  reflected  by  the  surface  of  an  atmosphereless 
body  is  positive  when  the  phase  angle  is  greater 
than  about  20°.  Closer  to  opposition,  polarisa¬ 
tion  is  negative.  A  dependence  between  the  polar¬ 
isation  and  geometric  albedo  has  been  observed. 
This  gives  an  independent  method  for  determin¬ 
ing  the  albedo  and  the  size. 

When  light  is  reflected  from  an  atmosphere, 
the  degree  of  polarisation  as  a  function  of  the 
phase  angle  is  more  complicated.  For  some  phase 
angles  P  can  be  highly  negative.  Using  the  theory 
of  radiative  transfer,  one  can  compute  how  the  at¬ 
mosphere  affects  light  and  its  polarisation.  Com¬ 
paring  these  results  with  observations  one  can  ob¬ 
tain  information  about  the  contents  of  the  atmo¬ 
sphere.  For  example,  the  composition  of  Venus’ 
atmosphere  could  be  studied  by  polarisation  stud¬ 
ies  before  any  probes  were  sent  to  the  planet. 
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Wavelength  [nm] 


480  520  560  600  640  680  720 


Moon 


Jupiter 


Saturn 


Uranus 


Neptune 


Fig.  7  .23  Spectra  of  the  Moon  and  the  giant  planets.  Strong  absorption  bands  can  be  seen  in  the  spectra  of  Uranus  and 
Neptune.  (Lowell  Observatory  Bulletin  42  (1909)) 


Planetary  Spectroscopy  The  photometric  and 
polarimetric  observations  discussed  above  were 
monochromatic.  However,  the  studies  of  the  at¬ 
mosphere  of  Venus  also  used  spectral  informa¬ 
tion.  Broadband  UBV  photometry  or  polarime- 
try  is  the  simplest  example  of  spectrophotometry 
(spectropolarimetry).  The  term  spectrophotome¬ 
try  usually  means  observations  made  with  sev¬ 
eral  narrowband  filters.  Naturally,  solar  system 
objects  are  also  observed  by  means  of  “classical” 
spectroscopy. 

Spectrophotometry  and  polarimetry  give  in¬ 
formation  at  discrete  wavelengths  only.  In  prac¬ 
tise,  the  number  of  points  of  the  spectrum  (or 
the  number  of  filters  available)  is  often  limited 
to  20-30.  This  means  that  no  details  can  be  seen 
in  the  spectra.  On  the  other  hand,  in  ordinary 
spectroscopy,  the  limiting  magnitude  is  smaller, 
although  the  situation  is  rapidly  improving  with 
the  new  generation  detectors,  such  as  the  CCD 
camera. 

The  spectrum  observed  is  the  spectrum  of  the 
Sun.  Generally,  the  planetary  contribution  is  rel¬ 


atively  small,  and  these  differences  can  be  seen 
when  the  solar  spectrum  is  subtracted.  The  Ura¬ 
nian  spectrum  is  a  typical  example  (Fig.  7.23). 
There  are  strong  absorption  bands  in  the  near- 
infrared.  Laboratory  measurements  have  shown 
that  these  are  due  to  methane.  A  portion  of  the 
red  light  is  also  absorbed,  causing  the  greenish 
colour  of  the  planet.  The  general  techniques  of 
spectral  observations  are  discussed  in  the  context 
of  stellar  spectroscopy  in  Chap.  9. 


7.1 0  Thermal  Radiation  of  the 
Planets 

Thermal  radiation  of  the  solar  system  bodies  de¬ 
pends  on  the  albedo  and  the  distance  from  the 
Sun,  i.e.  on  the  amount  of  absorbed  radiation.  In¬ 
ternal  heat  is  important  in  Jupiter  and  Saturn,  but 
we  may  neglect  it  at  this  point. 

By  using  the  Stefan-Boltzmann  law,  the  flux 
on  the  surface  of  the  Sun  can  be  expressed  as 

L  =  4ttRqCtTq. 
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If  the  Bond  albedo  of  the  body  is  A,  the  fraction 
of  the  radiation  absorbed  by  the  planet  is  (1  — 
A).  This  is  later  emitted  as  heat.  If  the  body  is  at 
a  distance  r  from  the  Sun,  the  absorbed  flux  is 


RqgTqkR 


(1 


(7.45) 


There  are  good  reasons  to  assume  that  the  body 
is  in  thermal  equilibrium ,  i.e.  the  emitted  and 
the  absorbed  fluxes  are  equal.  If  not,  the  body 
will  warm  up  or  cool  down  until  equilibrium  is 
reached. 

Let  us  first  assume  that  the  body  is  rotating 
slowly.  The  dark  side  has  had  time  to  cool  down, 
and  the  thermal  radiation  is  emitted  mainly  from 
one  hemisphere.  The  flux  emitted  is 

Lem  =  2ttR2(tT4,  (7.46) 


where  T  is  the  temperature  of  the  body  and  2: r  R2 
is  the  area  of  one  hemisphere.  In  thermal  equilib¬ 
rium,  (7.48)  and  (7.49)  are  equal: 

d2  rrA 

Q,°(l  -  A)  =  2T4, 

y  A 


whence 


(^f(A/2 


(7.47) 


A  body  rotating  quickly  emits  an  approximately 
equal  flux  from  all  parts  of  its  surface.  The  emit¬ 
ted  flux  is  then 


Table  7.3  Theoretical  and  observed  temperatures  of 
some  planets 


Albedo 

Distance 
from 
the  Sun 
[AU] 

Theoretical 

temperature 

[K] 

Observed 

maximum 

temperature 

[K] 

(7.50) 

(7.51) 

Mercury 

0.06 

0.39 

525 

440 

700 

Venus 

0.76 

0.72 

270 

230 

750 

Earth 

0.36 

1.00 

290 

250 

310 

Mars 

0.16 

1.52 

260 

215 

290 

Jupiter 

0.73 

5.20 

110 

90 

130 

effect,  radiation  is  allowed  to  enter,  but  not  to  exit. 
The  same  effect  is  at  work  in  the  Earth’s  atmo¬ 
sphere.  Without  the  greenhouse  effect,  the  mean 
temperature  could  be  well  below  the  freezing 
point  and  the  whole  Earth  would  be  ice-covered. 
Particularly  strong  the  effect  is  on  Venus,  where 
the  surface  temperature  is  hundreds  of  degrees 
higher  than  the  theoretical  value. 

According  to  the  Wien  displacement  law 
(5.22)  Amax  =  b/T  the  radiation  maximum  of 
a  body  at  200  K  is  at  A  =  14  pm,  deep  in  in¬ 
frared.  When  the  thermal  radiation  in  the  infrared 
or  radio  range  is  measured  the  temperature  can  be 
found,  and  further  the  Bond  albedo  can  be  calcu¬ 
lated  from  (7.47)  or  (7.48).  If  also  the  phase  func¬ 
tion  is  known  the  geometric  albedo  and  hence  the 
diameter  can  be  evaluated. 
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Lem  =  4jzR2oT4 


and  the  temperature 


(¥f(A/2 


(7.48) 


The  theoretical  temperatures  obtained  above 
are  not  valid  for  most  of  the  major  planets.  The 
main  “culprits”  responsible  here  are  the  atmo¬ 
sphere  and  the  internal  heat.  Measured  and  the¬ 
oretical  temperatures  of  some  major  planets  are 
compared  in  Table  7.3.  Venus  is  an  extreme  ex¬ 
ample  of  the  disagreement  between  theoretical 
and  actual  figures.  The  reason  is  the  greenhouse 


Cosmogony  is  a  branch  of  astronomy  which  stud¬ 
ies  the  origin  of  the  solar  system.  The  first  steps 
of  the  planetary  formation  processes  are  closely 
connected  to  star  formation. 

Although  the  properties  and  details  of  the  bod¬ 
ies  of  our  solar  system  (see  next  chapter)  may 
look  wildly  different  there  are  some  distinct  fea¬ 
tures  which  have  to  be  explained  by  any  serious 
cosmogonical  theory.  These  include: 

-  planetary  orbits  are  almost  coplanar  and  also 
parallel  to  the  solar  equator; 

-  orbits  are  almost  circular; 

-  planets  orbit  the  Sun  counterclockwise,  which 
is  also  the  direction  of  solar  rotation; 
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Table  7.4  True  distances  of  the  planets  from  the  Sun  and 
distances  according  to  the  Titius-Bode  law  (7.49) 


Planet 

n 

Calculated 

distance 

[AU] 

True 

distance 

[AU] 

Mercury 

—oo 

0.4 

0.4 

Venus 

0 

0.7 

0.7 

Earth 

1 

1.0 

1.0 

Mars 

2 

1.6 

1.5 

Ceres 

3 

2.8 

2.8 

Jupiter 

4 

5.2 

5.2 

Saturn 

5 

10.0 

9.2 

Uranus 

6 

19.6 

19.2 

Neptune 

7 

38.8 

30.1 

Pluto 

8 

77.2 

39.5 

-  planets  also  rotate  around  their  axes  counter¬ 
clockwise  (excluding  Venus  and  Uranus); 

-  planets  have  99  %  of  the  angular  momentum 
of  the  solar  system  but  only  0.15  %  of  the  total 
mass; 

-  terrestrial  and  giant  planets  exhibit  physical 
and  chemical  differences; 

-  relative  abundances  of  ices  and  rocks  as  a  func¬ 
tion  of  the  distance  from  the  Sun. 

Sometimes  also  the  empirical  Titius-Bode  law 
is  included  (Table  7.4).  It  states  that 

a  =  0.4  +  0.3  x  2n , 
n  = —o o,0, 1,2, ...  (7.49) 

where  the  semimajor  axis  a  is  expressed  in  au. 

It  is  sometimes  mentioned  that  the  first  scien¬ 
tific  theory  was  the  vortex  theory  by  the  French 
philosopher  Rene  Descartes  in  1644;  however  it 
was  concerned  about  the  motion  of  the  solar  sys¬ 
tem  bodies  and  not  its  origin. 

The  first  modern  cosmogonical  theories  were 
introduced  in  the  18th  century.  One  of  the  first 
cosmogonists  was  Immanuel  Kant ,  who  in  1755 
presented  his  nebular  hypothesis.  According  to 
this  theory,  the  solar  system  condensed  from 
a  large  rotating  nebula.  Kant’s  nebular  hypoth¬ 
esis  is  surprisingly  close  to  the  basic  ideas  of 
modern  cosmogonical  models.  In  a  similar  vein, 
Pierre  Simon  de  Laplace  suggested  in  1796  that 
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the  planets  have  formed  from  gas  rings  ejected 
from  the  equator  of  the  collapsing  Sun. 

The  main  difficulty  of  the  nebular  hypothesis 
was  its  inability  to  explain  the  distribution  of  an¬ 
gular  momentum  in  the  solar  system.  Although 
the  planets  represent  less  than  1  %  of  the  total 
mass,  they  possess  98  %  of  the  angular  momen¬ 
tum.  There  appeared  to  be  no  way  of  achieving 
such  an  unequal  distribution.  A  second  objection 
to  the  nebular  hypothesis  was  that  it  provided  no 
mechanism  to  form  planets  from  the  postulated 
gas  rings. 

Already  in  1745,  Georges  Louis  Leclerc  de 
Buff  on  had  proposed  that  the  planets  were  formed 
from  a  vast  outflow  of  solar  material,  ejected 
upon  the  impact  of  a  large  comet.  Various  catas¬ 
trophe  theories  were  popular  in  the  19th  century 
and  in  the  first  decades  of  the  20th  century  when 
the  cometary  impact  was  replaced  by  a  close  en¬ 
counter  with  another  star.  The  theory  was  devel¬ 
oped,  e.g.  by  Forest  R.  Moulton  (1905)  and  James 
Jeans  (1917). 

Strong  tidal  forces  during  the  closest  approach 
would  tear  some  gas  out  of  the  Sun;  this  mate¬ 
rial  would  later  accrete  into  planets.  Such  a  close 
encounter  would  be  an  extremely  rare  event.  As¬ 
suming  a  typical  star  density  of  0.15  stars  per 
cubic  parsec  and  an  average  relative  velocity  of 
20  km/s,  only  a  few  encounters  would  have  taken 
place  in  the  whole  Galaxy  during  the  last  5109 
years.  The  solar  system  could  be  a  unique  speci¬ 
men.  This  is  clearly  against  modern  observations 
(Chap.  22). 

The  main  objection  to  the  collision  theory  is 
that  most  of  the  hot  material  torn  off  the  Sun 
would  be  thrown  out  to  space,  rather  than  remain¬ 
ing  in  orbit  around  the  Sun.  There  also  was  no 
obvious  way  how  the  material  could  form  a  plan¬ 
etary  system. 

In  the  face  of  the  dynamical  and  statistical  dif¬ 
ficulties  of  the  collision  theory,  the  nebular  hy¬ 
pothesis  was  revised  and  modified  in  the  1940’s. 
In  particular,  it  became  clear  that  magnetic  forces 
and  gas  outflow  could  efficiently  transfer  angular 
momentum  from  the  Sun  to  the  planetary  nebula. 
The  main  principles  of  planetary  formation  are 
now  thought  to  be  reasonably  well  understood. 

The  oldest  rocks  found  on  the  Earth  are  about 
3.7  x  109  years  old;  some  lunar  and  meteorite 
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Fig.  7.24  Hubble  Space 
Telescope  images  of  four 
protoplanetary  disks, 

“ proplyds ”,  around  young 
stars  in  the  Orion  nebula. 
The  disk  diameters  are  two 
to  eight  times  the  diameter 
of  our  solar  system.  There 
is  a  T  Tauri  star  in  the 
centre  of  each  disk.  (Mark 
McCaugh- 

rean/Max-Planck-Institute 
for  Astronomy,  C.  Robert 
O’Dell/Rice  University, 
and  NASA) 


samples  are  somewhat  older.  When  all  the  facts 
are  put  together,  it  can  be  estimated  that  the  Earth 
and  other  planets  were  formed  about  4.56  x  109 
years  ago.  On  the  other  hand,  the  age  of  the 
Galaxy  is  at  least  twice  as  high,  so  the  overall 
conditions  have  not  changed  significantly  during 
the  lifetime  of  the  solar  system.  Moreover,  there 
is  even  direct  evidence  nowadays,  such  as  other 
planetary  systems  and  protoplanetary  disks,  pro¬ 
plyds  (Fig.  7.24). 

The  Sun  and  practically  the  whole  solar  sys¬ 
tem  simultaneously  condensed  from  a  rotating 
collapsing  cloud  of  dust  and  gas,  the  density  of 
which  was  some  10,000  atoms  or  molecules  per 
cm3  and  the  temperature  10-50  K.  The  elements 
heavier  than  helium  were  formed  in  the  interiors 
of  stars  of  preceding  generations,  as  will  be  ex¬ 
plained  in  Sect.  12.8.  The  collapse  of  the  cloud 
was  initiated  perhaps  by  a  shock  wave  emanating 
from  a  nearby  supernova  explosion. 

The  original  mass  of  the  cloud  must  be  thou¬ 
sands  of  Solar  masses  to  exceed  the  Jeans  mass . 


When  the  cloud  contracts  the  Jeans  mass  de¬ 
creases.  Cloud  fragments  and  each  fragment  con¬ 
tract  independently  as  explained  in  later  chapters 
of  star  formation.  One  of  the  fragments  became 
the  Sun. 

When  the  fragment  continued  its  collapse,  par¬ 
ticles  inside  the  cloud  collided  with  each  other. 
Rotation  of  the  cloud  allowed  the  particles  to  sink 
toward  the  same  plane,  perpendicular  to  the  rota¬ 
tion  axis  of  the  cloud,  but  prevented  them  from 
moving  toward  the  axis.  This  explains  why  the 
planetary  orbits  are  in  the  same  plane. 

The  mass  of  the  proto-Sun  was  larger  than 
the  mass  of  the  modern  Sun.  The  flat  disk  in  the 
plane  of  the  ecliptic  contained  perhaps  1/10  of 
the  total  mass.  Moreover,  far  outside,  the  rem¬ 
nants  of  the  outer  edges  of  the  original  cloud  were 
still  moving  toward  the  centre.  The  Sun  was  los¬ 
ing  its  angular  momentum  to  the  surrounding  gas 
by  means  of  the  magnetic  field.  When  nuclear 
reactions  were  ignited,  a  strong  solar  wind  car¬ 
ried  away  more  angular  momentum  from  the  Sun. 
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The  final  result  was  the  modern,  slowly  rotating 
Sun. 

Gravitational  and  viscous  torques  transferred 
the  angular  momentum  outwards.  The  former 
means  a  density  wave  caused  by  the  instability  of 
the  disk,  transferring  bot  mass  and  angular  mo¬ 
mentum  outwards.  Collisions  between  dust  parti¬ 
cles  increased  the  velocities  of  outer  particles  and 
slowed  down  inner  particles.  Thus  most  particles 
moved  inwards  but  the  angular  momentum  out¬ 
wards  and  the  disk  spread  out. 

Later,  when  nuclear  reactions  started,  the 
strong  solar  wind  transferred  more  angular  mo¬ 
mentum.  At  this  T  Tauri  stage  the  protosun  lost 
mass  as  much  as  10-8  Mq/ci  in  the  form  of  solar 
wind. 

Collisions  of  the  disk  particles  continued.  Ini¬ 
tially  individual  particles  stick  together  because 
of  the  weak  intermolecular  van  der  Waals  forces. 
In  less  than  10,000  years  the  particle  size  in¬ 
creased  from  a  few  micrometres  to  millimetres. 
The  growth  rate  was  then  proportional  to  the 
cross  sections  of  the  particles. 

When  the  particles  became  bigger  the  growth 
rate  increased  considerably  and  became  propor¬ 
tional  to  the  fourth  power  of  the  particle  radius. 
The  reason  for  this  was  that  the  weak  gravitation 
of  bigger  particles  started  attract  gas  and  dust.  If 
the  mass  of  a  particle  is  M  and  radius  R  and  the 
relative  velocity  of  a  dust  particle  Vo  (Fig-  7.25) 
the  effective  cross  section  of  collisions  to  the  big¬ 
ger  particle  is  s2: 


s 


2 


2  GMR\ 

Vo  y 


(7.50) 


Since  M  oc  R3  we  have  s2  oc  R4. 

The  velocity  of  the  gas  was  about  0.5  % 
smaller  than  the  orbital  velocity,  and  thus  parti¬ 
cles  moved  faster  than  gas  and  swept  away  the 
gas  and  dust.  This  resulted  in  rapidly  growing 
planetesimals ,  with  diameters  from  a  few  metres 
to  kilometres. 

Since  big  particles  were  moving  faster  than 
the  gas  they  experienced  a  small  friction  slow¬ 
ing  their  velocity.  The  effect  was  strongest  on  me¬ 
tre  size  particles.  Thus  small  planetesimals  had  to 
grow  bigger  in  a  few  thousand  years  or  drift  down 
to  the  Sun. 


Fig.  7.25  If  a  particle 
passes  a  massive  object  at  a 
close  distance  it  will  hit  the 
larger  body  and  increase  its 
mass 


When  the  planetesimals  collided  (Fig.  7.26) 
they  grew  bigger  but  the  growth  rate  was  no  more 
proportional  to  the  fourth  power  of  the  radius  but 
slower.  When  the  planetesimals  reached  the  size 
of  planets  their  mutual  gravitation  became  in¬ 
creasingly  important.  Collisions  of  planetesimals 
and  protoplanets  shaped  the  solar  system  until  it 
to  some  extent  looked  like  the  current  system. 
The  formation  of  the  Moon,  the  slow  retrograde 
of  Venus  and  the  abnormal  orientation  of  the  ro¬ 
tation  axis  of  Uranus  were  caused  by  collisions  of 
objects  of  the  size  of  Mars. 

The  formation  of  Jupiter  and  Saturn  took 
about  103— 106  years,  terrestrial  planets  106-107 
years,  Uranus  and  Neptune  107-108  years.  The 
Nice  model  (Sect.  7.12)  suggests  that  originally 
Neptune  was  closer  to  the  Sun  than  Uranus. 
Resonances  caused  Saturn,  Uranus  and  Neptune 
to  drift  farther  from  the  Sun,  whence  Neptune 
moved  outside  Uranus.  Jupiter,  on  the  other  hand, 
moved  closer  to  the  Sun. 

The  strong  perturbations  by  Jupiter  prevented 
the  formation  of  a  large  planet  between  Mars  and 
Jupiter.  The  objects  in  this  asteroid  belt  are  either 
planetesimals  or  shattered  protoplanets. 

Depending  on  the  volatility  the  matter  of  the 
solar  system  can  be  divided  roughly  into  three 
categories:  Gases ,  mainly  hydrogen  and  helium, 
consisting  of  about  98.2  %  of  the  total  mass  of 
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Fig.  7.26  A  schematic 
plot  on  the  formation  of  the 
solar  system,  (a)  A  large 
rotating  cloud,  the  mass  of 
which  was  3-4  solar 
masses,  began  to  condense, 
(b)  The  innermost  part 
condensed  most  rapidly 
and  a  disk  of  gas  and  dust 
formed  around  the 
proto-sun.  (c)  Dust 
particles  in  the  disk 
collided  with  each  other 
forming  larger  particles 
and  sinking  rapidly  to 
a  single  plane,  (d)  Particles 
clumped  together  into 
planetesimals  which  were 
of  the  size  of  present 
asteroids,  (e)  These  clumps 
drifted  together,  forming 
planet- size  bodies  which 
began  (f)  to  collect  gas  and 
dust  from  the  surrounding 
cloud,  (g)  The  strong  solar 
wind  “blew”  away  extra 
gas  and  dust;  the  planet 
formation  was  finished 
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Distance  from  the  Sun  [AU] 
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Fig.  7.27  Temperature  distribution  in  the  solar  system 
during  planet  formation.  The  present  chemical  composi¬ 
tion  of  the  planets  reflects  this  temperature  distribution. 
The  approximate  condensing  temperatures  of  some  com¬ 
pounds  have  been  indicated 

the  solar  system  and  remaining  gaseous  until  very 
close  to  the  absolute  zero.  Ices ,  about  1.4  %,  melt¬ 
ing  around  160  K  at  the  pressure  of  the  initial 
nebula.  Rocks ,  about  0.4  %,  melting  over  temper¬ 
atures  exceeding  1000  K  (Fig.  7.27). 

Planets  from  Mercury  to  Mars  consist  mainly 
of  rocks.  When  they  were  born  the  temperature 
in  that  region  was  too  high  for  gases  and  ices 
to  remain  bound  to  planets.  In  this  region  over 
99  %  of  the  matter  remained  outside  the  plan¬ 
ets.  The  temperature  distribution  is  seen  in  the 
chemical  contents  of  the  planets.  At  the  distance 
of  Mercury  the  temperature  had  decreased  below 
1400  K,  which  meant  that  compounds  of  iron  and 
nickel  could  condense  from  the  nebula.  In  fact, 
they  form  about  60  %  of  the  mass  of  Mercury. 
When  we  move  outwards  other  elements  become 
more  abundant.  At  the  distance  of  the  Earth  the 
temperature  is  about  600  K  and  near  Mars  only 
450  K.  The  mantle  of  the  Earth  contains  about 
10  %  of  iron(II)oxide  FeO.  In  Mars  there  is  con¬ 
siderably  more  FeO,  but  in  Mercury  hardly  any¬ 
thing  at  all. 

Table  7.5  gives  the  mass  distribution  of  the 
solar  system  and  Table  7.6  the  minimum  mass 
needed  for  the  existing  planets.  This  takes  into  ac¬ 
count  the  different  composition  of  the  planets  and 
the  Sun.  In  reality,  the  mass  of  the  accretion  disk 
must  be  much  bigger,  since  not  all  of  the  mass  did 
not  end  up  in  planets. 

Using  the  minimum  mass  we  can  also  calcu¬ 
late  the  required  density  distribution  of  the  ac¬ 


Table  7.5  Mass  distribution  of  the  solar  system 


Part  of  the  (%)  total  mass 

Sun 

99.80 

Jupiter 

0.10 

Comets 

0.05 

Other  planets 

0.04 

Moons  and  rings 

0.00005 

Asteroids 

0.000002 

Dust 

0.0000001 

cretion  disk.  If  the  mass  of  the  planet  is  M  and 
it  has  accreted  its  material  in  the  distance  range 
([r0,  r i])  from  a  disk  whose  density  is  p(r ),  we 
get 

/r  2tt  nr\ 

p(r)dA  =  I  /  p(r)r  dr  dO 
Jo  j To 

rn 

=  2tt  I  p(r)rdr.  (7.51) 

Jr0 

The  density  profile  of  the  disk  seems  to  obey 
ar2  law  pretty  well  except  in  the  asteroid  belt, 
where  these  is  a  clear  mass  defect  (Fig.  7.28). 

At  the  distance  of  Jupiter  and  Saturn  the  tem¬ 
perature  was  already  so  low  that  icy  bodies  could 
form.  Some  satellites  of  Saturn  are  examples  of 
such  bodies.  From  the  surrounding  cloud  the  gi¬ 
ant  planets  collected  gas  that  could  stay  around 
the  planets  because  they  were  relatively  far  from 
the  Sun.  Jupiter  and  Saturn  contain  mostly  hy¬ 
drogen  and  helium.  In  Uranus  and  Neptune  the 
content  of  these  gases  is  smaller,  possibly  around 
twenty  percent. 

Continuous  collisions  of  meteoroids,  shrink¬ 
ing  of  the  planets  under  their  own  gravity,  and 
radioactive  decay  of  relatively  short  lived  nu¬ 
clei  produced  a  lot  of  heat.  Heating  caused  par¬ 
tial  melting  of  planets,  leading  to  differentiation : 
heavier  elements  sank  down  and  lighter  ones  rose 
towards  the  surface. 

The  bombardment  continued  for  about  half  a 
billion  years.  Its  effects  are  still  seen  on  most 
solid  bodies.  For  instance,  the  Lunar  maria  are 
remnants  of  that  era.  On  the  Earth  the  tectonic 
resurfacing  and  erosion  have  destroyed  most  me¬ 
teorite  craters. 


7.11  Origin  of  the  Solar  System 


173 


Table  7.6  Minimum  mass  of  the  primordial  nebula  sition  consistent  with  the  Sun.  The  Nice  model  will  change 

needed  for  the  planets.  The  factor  is  a  value  by  which  the  the  values  of  this  table  and  Fig.  7.28 

mass  of  the  planet  has  to  be  multiplied  to  make  the  compo- 


Distance  [au] 

Mass  Earth  =  1 

Factor 

Total  mass 

Cumulative  mass 

Mercury 

0.4 

0.055 

350 

19.3 

19 

Venus 

0.7 

0.815 

270 

220.1 

239 

Earth  +  Moon 

1.0 

1.012 

235 

237.8 

477 

Mars 

1.5 

0.107 

235 

25.1 

502 

Asteroids 

2.8 

0.002 

200 

0.4 

503 

Jupiter 

5.2 

317.89 

5 

1589.5 

2092 

Saturn 

9.6 

95.17 

8 

761.4 

2853 

Uranus 

19.2 

14.56 

15 

218.4 

3072 

Neptune 

30.1 

17.24 

20 

344.8 

3417 

Pluto 

40 

0.005 

70 

0.4 

3417 

Fig.  7.28  Surface  density 
[kg/m2]  of  the  accretion 
disc  as  a  function  of 
distance.  The  density 
follows  approximately  an 
r~ 2  law.  Especially  around 
the  asteroid  belt  there 
seems  to  be  a  mass  deficit, 
indicating  that  a 
considerable  amount  of 
matter  has  been  removed 
elsewhere.  The  vertical 
lines  separate  regions  from 
which  each  planet  has 
accreted  its  material 


distance  from  the  Sun  [AU] 


Due  to  the  perturbations  by  large  planets  the 
'‘leftover”  planetesimals  collided  to  planets  or 
were  thrown  out  to  the  outskirts  of  the  solar  sys¬ 
tem  or  even  out  to  the  interstellar  space.  What  re¬ 
mained  where  mainly  the  asteroids  currently  on 
stable  orbits.  Lots  of  low-density  objects,  comets, 
were  thrown  to  the  outer  regions  of  the  solar  sys¬ 
tem.  These  form  the  current  Oort  cloud .  The  total 
mass  of  the  Oort  cloud  may  be  even  40  M0  and 
it  may  contain  billions  of  comets. 

Also  the  small  bodies  beyond  the  orbit  of  Nep¬ 
tune  and  the  somewhat  more  distant  Kuiper  belt 
may  have  originated  nearer  to  the  Sun. 

Planetary  formation  ended  when  the  nuclear 
reactions  of  the  Sun  started  and  the  Sun  entered 
its  T  Tauri  stage  (Sect.  14.3).  The  strong  solar 
wind  caused  the  Sun  to  lose  mass  and  angular 
momentum.  The  mass  loss  was  about  10-6  M0 


a  year,  yet  altogether  maybe  less  than  0.1  M0. 
The  solar  wind  blew  away  the  gas  dust  still  in  the 
interplanetary  space,  and  thus  the  planets  could 
not  accrete  any  more  matter. 

The  solar  wind  or  radiation  pressure  has  no 
effect  on  millimetre-  and  centimetre-sized  parti¬ 
cles.  However,  they  will  drift  into  the  Sun  because 
of  the  Poynting-Robertson  effect ,  first  introduced 
by  John  P.  Poynting  in  1903.  Later  H.P.  Robertson 
derived  the  effect  by  using  the  theory  of  relativity. 
When  a  small  body  absorbs  and  emits  radiation,  it 
loses  its  orbital  angular  momentum  and  the  body 
spirals  to  the  Sun.  At  the  distance  of  the  asteroid 
belt,  this  process  takes  only  a  million  years  or  so. 
Therefore  the  meteors  wee  see  nowadays  must  be 
much  younger  than  the  solar  system.  A  relatively 
big  fraction  of  them  is  material  disrupted  from 
comets. 
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7.12  Nice  Models 

It  has  been  assumed  that  the  distances  of  the  plan¬ 
ets  have  not  changed  much  since  they  were  born. 
The  most  essential  feature  of  the  Nice  model  is 
that  the  giant  planets  were  bom  much  closer  to 
the  Sun.  The  model  is  based  on  a  large  number 
of  computer  simulations,  carried  out  in  the  Cote 
d’Azur  observatory  near  Nice. 

The  planets  drifted  to  their  current  orbits  due 
to  their  mutual  gravity  and  resonances.  At  the 
same  time  planetary  perturbations  made  the  left¬ 
over  material  either  to  collide  with  planets  or 
move  outwards  to  the  asteroid  belt,  Kuiper  belt 
and  Oort  cloud. 

The  original  Nice  model  could  not  properly 
explain  all  properties  of  the  solar  system,  like  the 
structure  of  the  Kuiper  belt.  These  problems  were 
fixed  in  the  newer  Nice  model  2. 

There  are  several  versions  of  the  models  with 
varying  details.  They  seem  to  approach  the  cor¬ 
rect  explanation,  it  may  still  be  too  early  to  dis¬ 
cuss  here  their  rather  complicated  details  before 
they  have  been  confirmed  more  convincingly. 


Box  7.1  (Tides)  Let  the  tide  generating  body, 
the  mass  of  which  is  M  to  be  at  point  Q  at  a  dis¬ 
tance  d  from  the  centre  of  the  Earth.  The  poten¬ 
tial  V  at  the  point  A  caused  by  the  body  Q  is 


V(A)  = 


GM 

s 


where  s  is  the  distance  of  the  point  A  from  the 
body  Q. 


Applying  the  cosine  law  in  the  triangle 
OAQ ,  the  distance  s  can  be  expressed  in  terms 
of  the  other  sides  and  the  angle  z  —  AO  Q 

s2  =  d2  +  r2  —  2 dr  cos  z, 


where  r  is  the  distance  of  the  point  A  from  the 
centre  of  the  Earth.  We  can  now  rewrite  (1) 


V(A)  = 


GM 

\! d2  +  r2  —  2 dr  cos  z 


When  the  denominator  is  expanded  into  a  Tay¬ 
lor  series 


(1+*H 


1  3  9 

^  1 - v  +  -x - 

2  8 


where 


v  = 


r 

-  2- cos z 
d 


and  ignoring  all  terms  higher  than  or  equal  to 
1  / d4  one  obtains 


GM  GM 

V(A)  =  — - — I - -r-r  cosz 

d  dz 

GMr 2  1 


+ 


d3 


-(3cos2z  —  l).  (3) 


The  gradient  of  the  potential  V (A)  gives 
a  force  vector  per  mass  unit.  The  first  term  of 
(3)  vanishes,  and  the  second  term  is  a  constant 
and  independent  of  r.  It  represents  the  central 
motion.  The  third  term  of  the  force  vector,  how¬ 
ever,  depends  on  r.  It  is  the  main  term  of  the 
tidal  force.  As  one  can  see,  it  depends  inversely 
on  the  third  power  of  the  distance  d.  The  tidal 
forces  are  diminished  very  rapidly  when  the 
distance  of  a  body  increases.  Therefore  the  tidal 
force  caused  by  the  Sun  is  less  than  half  of  that 
of  the  Moon  in  spite  of  much  greater  mass  of 
the  Sun. 

We  may  rewrite  the  third  term  of  (3)  as 


V2  =  2D 


where 

3  r 2 
D  =  -GM—r 

4  d3 

is  called  Doodson’s  tidal  constant.  It’s  value 
for  the  Moon  is  2.628  m2  s-2  and  for  the  Sun 
1.208  m2  s~2.  We  can  approximate  that  z  is  the 
zenith  angle  of  the  body.  The  zenith  angle  z  can 
be  expressed  in  terms  of  the  hour  angle  h  and 
declination  8  of  the  body  and  the  latitude  0  of 
the  observer 

cos  z  =  cos  h  cos  8  cos  0  +  sin  8  sin  0 . 


7.13  Examples 


175 


Inserting  this  into  (4)  we  obtain  after  a  lengthy 
algebraic  operation 

Vz  =  D  ^cos2  0  cos2  8  cos  2  h 
±  sin  20  cos  28  cos  h 

si,rS-I)) 

=  D(S  +  T  +  Z).  (5) 

Equation  (5)  is  the  traditional  basic  equation  of 
the  tidal  potential,  the  Laplace's  tidal  equation. 

In  (5)  one  can  directly  see  several  charac¬ 
teristics  of  tides.  The  term  S  causes  the  semi¬ 
diurnal  tide  because  it  depends  on  cos  2 h.  It  has 
two  daily  maxima  and  minima,  separated  by  12 
hours,  exactly  as  one  can  obtain  in  following 
the  ebb  and  flood.  It  reaches  its  maximum  at 
the  equator  and  is  zero  at  the  poles  (cos2  0). 

The  term  T  expresses  the  diurnal  tides 
(cos  A).  It  has  its  maximum  at  the  latitude  ±45° 
and  is  zero  at  the  equator  and  at  the  poles 
(sin  20) .  The  third  term  Z  is  independent  of  the 
rotation  of  the  Earth.  It  causes  the  long  period 
tides ,  the  period  of  which  is  half  the  orbital  pe¬ 
riod  of  the  body  (about  14  days  in  the  case  of 
the  Moon  and  6  months  for  the  Sun).  It  is  zero 
at  the  latitude  ±35.27°  and  has  its  maximum 
at  the  poles.  Moreover,  the  time  average  of  Z 
is  non-zero,  causing  a  permanent  deformation 
of  the  Earth.  This  is  called  the  permanent  tide. 
It  slightly  increases  the  flattening  of  the  Earth 
and  it  is  inseparable  from  the  flattening  due  to 
the  rotation. 

The  total  value  of  the  tidal  potential  can  be 
computed  simply  adding  the  potentials  caused 
by  the  Moon  and  the  Sun.  Due  to  the  tidal 
forces,  the  whole  body  of  the  Earth  is  de¬ 
formed.  The  vertical  motion  A r  of  the  crust  can 
be  computed  from 

v2 

A r  =  h — ^0.06V2  [m],  (6) 

8 

where  g  is  the  mean  free  fall  acceleration,  g  ~ 
9.81  ms-2  and  A  is  a  dimensionless  number, 
the  Love  number ,  h  ~  0.6,  which  describes  the 


elasticity  of  the  Earth.  In  the  picture  below, 
one  can  see  the  vertical  motion  of  the  crust  in 
Helsinki,  Finland  (0  =  60°,  X  =  25°)  in  Jan¬ 
uary  1995.  The  non-zero  value  of  the  temporal 
mean  can  already  be  seen  in  this  picture. 


The  tides  have  other  consequences,  too.  Be¬ 
cause  the  Earth  rotates  faster  than  the  Moon  or¬ 
bits  the  Earth,  the  tidal  bulge  does  not  lie  on 
the  Moon-Earth  line  but  is  slightly  ahead  (in 
the  direction  of  Earth’s  rotation),  see  below. 


Due  to  the  drag,  the  rotation  of  the  Earth 
slows  down  by  about  1-2  ms  per  century.  The 
same  reason  has  caused  the  Moon’s  period 
of  rotation  to  slow  down  to  its  orbital  period 
and  the  Moon  faces  the  same  side  towards  the 
Earth.  The  misaligned  bulge  pulls  the  Moon 
forward.  The  acceleration  causes  the  increase 
in  the  semimajor  axis  of  the  Moon,  about  3  cm 
per  year. 


7.13  Examples 

Example  7.1  (Sidereal  and  Synodic  Period)  The 
time  interval  between  two  successive  oppositions 
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of  Mars  is  779.9  d.  Calculate  the  semimajor  axis 
of  Mars’  orbit. 

The  synodic  period  is  779.9  d  =  2.14  years. 
We  obtain  from  (7.2) 

1  1  1 

—  = - =  0.53  =>  Pi  —  1.88  a. 

P2  1  2.14 

By  using  Kepler’s  third  law  ( m  Af©),  the  semi¬ 

major  axis  is  found  to  be 


where  P  =  1  d  =  24  h.  The  total  energy  is  thus 


w-f. 


ho 

-ho 

P 


Sq (sin  5  sin  0  +  cos  8  cos  0  cos  h) 


x 


2n 

SoP 


d  h 


71 


(ho  sin  8  sin  0  +  cos  8  cos  0  sin  ho) , 


where 


a  =  P2^  =  1.882/3  =  1.52  au. 

Example  7.2  (Solar  Energy  Flux  on  the  Earth) 
Calculate  the  diurnal  solar  energy  flux  per  unit 
area  at  the  distance  of  the  Earth. 

The  solar  flux  density  outside  the  Earth’s  at¬ 
mosphere  (the  solar  constant)  is  So  =  1370  W/m2. 
Consider  a  situation  at  latitude  0,  when  the  solar 
declination  is  8.  If  the  atmospheric  extinction  is 
neglected,  the  flux  density  on  the  surface  is 

S  =  So  sin  a, 


ho  =  arccos(—  tan  8  tan  0) . 

For  example  near  the  equator  (0  =  0°)  cos  ho  =  0 
and 

o  SoP 

W(0  =  0°)  =  —  cos  <5. 

71 

At  those  latitudes  where  the  Sun  will  not  set,  ho  = 
it  and 

Wdrc  =  SoP  sin 8  sin 0 . 

Near  the  poles,  the  Sun  is  always  circumpolar 
when  above  the  horizon,  and  so 

W(0  =  90°)  =  SqP  sin 5. 


where  a  is  the  elevation  of  the  Sun.  We  can  write 
sin  a  as  a  function  of  latitude,  declination,  and 
hour  angle  h : 

sin  a  =  sin  8  sin  0  +  cos  8  cos  0  cos  h . 

On  a  cloudless  day,  the  energy  is  received  be¬ 
tween  sunrise  and  sunset.  The  corresponding 
hour  angles  can  be  obtained  from  the  equation 
above,  when  a  =  0: 


Interestingly  enough,  during  the  summer  when 
the  declination  of  the  Sun  is  large,  the  polar  ar¬ 
eas  receive  more  energy  than  the  areas  close  to 
the  equator.  This  is  true  when 

W(4>  =  90°)  >  W(0  =  0°) 

4^  SoP  sin5  >  SoP  cos8/tv 
4^  tan 8  >  1/tt 
4^  8  >11.1°. 


cos  ho  =  —  tan  8  tan  0 . 

In  the  course  of  one  day,  the  energy  received  on 
a  unit  area  is 

/ho 

Sdt. 

-ho 

The  hour  angle  h  is  expressed  in  radians,  so  the 
time  t  is 


The  declination  of  the  Sun  is  greater  than  this 
about  two  months  every  summer. 

However,  atmospheric  extinction  diminishes 
these  values,  and  the  loss  is  at  its  greatest  at  the 
poles,  where  the  elevation  of  the  Sun  is  always 
relatively  small.  Radiation  must  penetrate  thick 
layers  of  the  atmosphere  and  the  path  length  is 
comparable  to  1/sin  a.  If  it  is  assumed  that  the 
fraction  k  of  the  flux  density  reaches  the  surface 
when  the  Sun  is  at  zenith,  the  flux  density  when 
the  Sun  is  at  the  elevation  a  is 


h 
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The  total  energy  received  during  one  day  is  thus 


This  cannot  be  solved  in  a  closed  form  and  nu¬ 
merical  methods  must  be  used. 

The  figure  on  next  page  shows  the  daily  re¬ 
ceived  energy  W  [kW  h/m2]  during  a  year  at  lat¬ 
itudes  0  =  0°,  60°,  and  90°  without  extinction, 
and  when  k  =  0.8,  which  is  close  to  the  real  value. 


Example  7.3  (Magnitude  of  a  Planet)  The  ap¬ 
parent  magnitude  of  Mars  during  the  1975  op¬ 
position  was  m\  =  —1.6  and  the  distance  to  the 
Sun,  r\  =  1.55  au.  During  the  1982  opposition, 
the  distance  was  r2  =  1.64  au.  Calculate  the  ap¬ 
parent  magnitude  in  the  1982  opposition. 

At  opposition,  the  distance  of  Mars  from  the 
Earth  is  A  =  r  —  1 .  The  observed  flux  density  de¬ 
pends  on  the  distances  to  the  Earth  and  the  Sun, 


F  oc 


1 

r2  A2 


Using  the  magnitude  formula  (4.9)  we  obtain 


m\  —  m2  =  —2.5  lg 


r\(ri 

r}(r\ 


l)2 

l)2 


7*2  (/*2 

=>  m2  =  m\  +  51g — — 

n(n 


1) 

1) 


—  “1.6  +  5  lg 


1.64x0.64 
1.55  x  0.55 


«-l.l. 


The  same  result  is  obtained  if  (7.38)  is  separately 
written  for  both  oppositions. 


Example  7.4  (The  Brightness  of  Venus)  Find 
the  instant  when  Venus  is  brightest  if  the  bright¬ 
ness  is  proportional  to  the  projected  size  of  the 
illuminated  surface.  The  orbits  are  assumed  to  be 
circular. 

The  size  of  the  illuminated  surface  is  the  area 
of  the  semicircle  ACE db  half  the  area  of  the  el¬ 
lipse  ABCD.  The  semiaxes  of  the  ellipse  are 
R  and  R  cos  a.  If  the  radius  of  the  planet  is  R , 
the  illuminated  area  is 

R2  1  7r  2 

jt - 1 — ttR  x  Rcosa  =  —R  (1  +  coso0, 

2'  2'  2' 


where  a  is  the  phase  angle.  The  flux  density  is 
inversely  proportional  to  the  square  of  the  dis¬ 
tance  A.  Thus 


F  oc 


1  +  COS  O' 

A2 


A 


The  cosine  formula  yields 

=  r2  +  A2  —  2  A r  cos  oc. 


When  cos  a  is  solved  and  inserted  in  the  flux  den¬ 
sity  we  obtain 


F  oc 


2 A r  +  r2  +  A2  -  M| 
2r  A3 


The  minimum  of  the  equation  yields  the  distance 
where  Venus  is  brightest: 


dF 

9A 


4rA  +  3r2-3M2  +  A2 
2r  A4 


=>►  A  =  —2  r  zb 


Jr2  +  3  Af|. 


If  r  =  0.723  au  and  R ®  =  1  au,  the  distance  is 
A  =  0.43  au  and  the  corresponding  phase  angle 
is  a  =  118°. 
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Thus  Venus  is  brightest  shortly  after  the  largest  If  T  =  273  K,  we  the  distance  is  r  =  1 .4  au;  when 
eastern  elongation  and  before  the  largest  western  T  =  373  K,  we  get  r  =  0.8  au. 
elongation.  From  the  sine  formula  we  obtain 


sin  s  sin  a 

r  M®  ' 

The  corresponding  elongation  is  s  =  40°,  and 


1  +  cos  Of 
2 


x  100  %  =  27  % 


of  the  surface  is  seen  lit. 


7.14  Exercises 

Exercise  7.1  What  is  the  greatest  possible  elon¬ 
gation  of  Mercury,  Venus  and  Mars?  How  long 
before  sunrise  or  after  sunset  is  the  planet  visible? 
Assume  that  the  declination  of  the  planet  and  the 
Sun  is  8  =  0°. 


Example  7.5  (Magnitude  of  an  Asteroid)  The 
parameters  of  the  asteroid  44  Nysa  are  H  = 
6.929,  G\  =  0.050,  G2  =  0.67  and  the  semima¬ 
jor  axis  of  the  orbit  a  =  2.42  au.  Find  V(l,  1°). 
What  is  the  apparent  magnitude  at  phase  angles 
0°  and  1°? 

Values  of  the  basis  functions  are  <£i(l°)  = 
0.9667,  02(1°)  =  0.9900,  <Z>3(1°)  =  0.577.  Us¬ 
ing  these  we  get  the  absolute  magnitude  at  the 
phase  angle  1 0 : 

V(l,  1°)  =  6.929  -  2.51g[0.050  x  0.9667 
+  0.67  x  0.990 

+  (1  -0.050- 0.67)  x  0.577] 

=  7.076.  (7.52) 


Exercise  7.2  (a)  What  is  the  greatest  possible 
geocentric  latitude  of  Venus,  i.e.  how  far  from  the 
Sun  can  the  planet  be  at  the  inferior  conjunction? 
Assume  the  orbits  are  circular. 

(b)  When  is  the  situation  possible?  The  longi¬ 
tude  of  the  ascending  node  of  Venus  is  77°. 

Exercise  7.3  (a)  Find  the  daily  retrograde  appar¬ 
ent  motion  of  an  exterior  planet  at  its  opposition. 
Assume  that  the  planet  and  the  Earth  have  circu¬ 
lar  orbits. 

(b)  Pluto  was  found  in  1930  from  two  plates, 
exposed  6  days  apart  during  the  opposition  of  the 
planet.  On  those  plates  one  degree  corresponded 
to  3  cm.  How  much  (in  cm)  had  Pluto  moved  be¬ 
tween  the  exposures?  How  much  does  a  typical 
main  belt  asteroid  move  in  the  same  time? 


Near  the  opposition  we  can  approximate  A  = 
r  —  1  =  1 .42  au.  The  apparent  magnitude  at  the 
opposition  is  then 

m  =  6.929  +  5  lg(  1 .42  x  2.42)  -  2.5  lg  0(0°) 

=  6.929  +  51g3.36  =  9.561.  (7.53) 

When  the  phase  angle  is  1°  the  absolute  magni¬ 
tude  is  0.147  greater  than  at  the  opposition.  The 
apparent  magnitudes  differ  by  the  same  amount; 
hence  m (a  =  1°)  =  9.71. 


Example  7.6  Find  the  distance  of  a  comet  from 
the  Sun  when  its  temperature  reaches  0  °C  and 
100  °C.  Assume  the  Bond  albedo  of  the  comet 
is  0.05. 

Solve  r  from  (7.47): 


r  = 


1  -  A 


Exercise  7.4  A  planet  is  observed  at  the  oppo¬ 
sition  or  inferior  conjunction.  Due  to  the  finite 
speed  of  light  the  apparent  direction  of  the  planet 
differs  from  the  true  place.  Find  this  difference  as 
a  function  of  the  radius  of  the  orbit.  You  can  as¬ 
sume  the  orbits  are  circular.  Which  planet  has  the 
largest  deviation? 

Exercise  7.5  The  angular  diameter  of  the  Moon 
is  0.5°.  The  full  moon  has  an  apparent  magnitude 
of  —12.5  and  the  Sun  —26.7.  Find  the  geomet¬ 
ric  and  Bond  albedos  of  the  Moon,  assuming  that 
the  reflected  light  is  isotropic  (into  a  solid  angle 
In  sterad). 

Exercise  7.6  The  eccentricity  of  the  orbit  of 
Mercury  is  0.206.  How  much  does  the  apparent 
magnitude  of  the  Sun  vary  as  seen  from  Mercury? 
How  does  the  surface  brightness  of  the  Sun  vary? 
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Exercise  7.7  An  asteroid  with  a  diameter  of  What  do  you  think  about  the  chances  of  finding 
100  m  approaches  the  Earth  at  a  velocity  of  the  asteroid  well  in  advance  the  crash? 

30  kms-1.  Find  the  apparent  magnitude  of  the 

asteroid  (a)  one  week,  (b)  one  day  before  the  col-  Exercise  7.8  Find  the  centripetal  acceleration  at 
lision.  Assume  that  the  phase  angle  is  a  =  0°  and  the  poles  and  on  the  equator  of  the  Earth, 
the  geometric  albedo  of  the  asteroid  is  p  =  0.1. 


Objects  of  the  Solar  System 
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Our  solar  system  contains  eights  planets  orbit¬ 
ing  the  Sun,  dwarf  planets,  asteroids,  comets  and 
meteors,  as  well  satellites  orbiting  planets  and 
smaller  ring  particles.  In  this  chapter  the  proper¬ 
ties  of  these  different  object  are  discussed. 


8.1  Mercury 

Mercury  is  the  innermost  planet  of  the  solar  sys¬ 
tem.  Its  diameter  is  4800  km.  Mercury  is  always 
found  in  the  vicinity  of  the  Sun;  its  maximum 
elongation  is  only  28°.  Observations  are  diffi¬ 
cult  because  Mercury  is  always  seen  in  a  bright 
sky  and  close  to  the  horizon.  Mercury  has  phases 
like  the  Moon.  When  it  is  closest  to  the  Earth 
in  the  inferior  conjunction,  the  dark  side  of  the 
planet  is  toward  us.  And  when  the  whole  illumi¬ 
nated  hemisphere  is  towards  the  Earth  Mercury 
is  behind  the  Sun  and  farthest  from  us.  A  couple 
of  times  in  a  century  Mercury  transits  the  solar 
disk  (Sect.  7.5).  Observations  during  transits  have 
shown  that  Mercury  does  not  have  an  appreciable 
atmosphere. 

The  first  maps  of  Mercury  were  drawn  at  the 
end  of  the  19th  century  but  the  reality  of  the  de¬ 
tails  was  not  confirmed.  As  late  as  in  the  begin¬ 
ning  of  the  1960’s,  it  was  believed  that  Mercury 
always  turns  the  same  side  toward  the  Sun.  How¬ 
ever,  measurements  of  the  thermal  radio  emission 
showed  that  the  temperature  of  the  night  side  is 
too  high,  about  100  K,  instead  of  almost  absolute 
zero.  Finally,  the  rotation  period  was  established 
by  radar. 


Fig.  8.1  Length  of  day  in  Mercury.  The  positions  of  Mer¬ 
cury  during  the  first  revolution  are  shown  outside  the  el¬ 
lipse.  Upon  returning  to  the  aphelion,  the  planet  has  turned 
540°  (1  ^  revolutions).  After  two  full  cycles  the  planet  has 
rotated  three  times  around  its  axis  and  the  same  side  points 
toward  the  Sun.  The  length  of  the  day  is  176  d,  longer  than 
on  any  other  planet 

One  revolution  around  the  Sun  takes  88  days. 
The  rotation  period  is  two-thirds  of  this,  59  days. 
This  means  that  every  second  time  the  planet  is 
in,  say,  perihelion,  the  same  hemisphere  faces  the 
Sun  (Fig.  8.1).  This  kind  of  spin-orbit  coupling 
can  result  from  tidal  forces  exerted  by  a  central 
body  on  an  object  moving  in  a  fairly  eccentric  or¬ 
bit. 

Re-examination  of  old  observations  revealed 
why  Mercury  had  been  presumed  to  rotate  syn¬ 
chronously.  Owing  to  its  geometry,  Mercury  is 
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Fig.  8.2  Mercury  imaged  by  the  Messenger  probe.  The 
centre  of  the  upper  mosaic  image  is  at  the  intersection 
of  the  equator  and  meridian  of  the  planet.  The  ray  crater 
Debussy  is  near  the  lower  edge.  The  mosaic  image  on 
the  opposing  page  shows  the  opposite  hemisphere.  The 
circled  area  is  the  over  1500  km  wide  Caloris  basin. 
Below  it  a  more  easily  distinguished  225  km  wide  cir¬ 
cular  depression  is  the  Mozart  basin.  (NASA/Johns  Hop- 

easiest  to  observe  in  spring  and  autumn.  In  six 
months,  Mercury  orbits  twice  around  the  Sun,  ro¬ 
tating  exactly  three  times  around  its  own  axis. 
Consequently,  during  observations,  the  same  side 
was  always  facing  the  Sun!  The  details  visible  on 
the  surface  are  very  obscure  and  the  few  excep¬ 
tional  observations  were  interpreted  as  observa¬ 
tional  errors. 

Since  the  rotation  period  of  Mercury  is  r*  = 
58.6  d  ans  the  orbital  period  P  =  87.97  d,  (2.43) 
shows  that  the  length  of  day  is  r  =  176  d,  or  two 
Mercury’s  years.  The  rotation  axis  is  almost  per¬ 
pendicular  to  the  orbital  plane. 

The  mean  distance  from  the  Sun  is  0.39  au. 
The  eccentricity  of  the  orbit  is  0.21,  which  means 


kins  University  Applied  Physics  Laboratory/Camegie  In¬ 
stitution  of  Washington).  Below :  A  detail  of  the  surface 
photographed  by  Mariner  10  during  it’s  first  encounter 
with  Mercury  in  1974.  The  scarp  is  about  350  kilome¬ 
tres  long  and  transects  two  craters  35  and  55  kilometres 
in  diameter.  It  is  up  to  2  km  high  in  some  places  and  it  ap¬ 
pears  to  be  a  fault  produced  by  compression  of  the  crust. 
(NASA/JPL/Northwestern  University) 

that  the  distance  varies  between  0.31  and  0.47  au. 
Because  of  the  high  eccentricity,  the  surface  tem¬ 
perature  of  the  subsolar  point  varies  substantially: 
at  the  perihelion,  the  temperature  is  about  700  K; 
at  the  aphelion,  it  is  100  K  lower.  Temperature 
variations  on  Mercury  are  the  most  extreme  in  the 
solar  system  because  in  the  night  side  the  temper¬ 
ature  drops  below  100  K. 

The  precession  of  the  perihelion  of  Mercury 
is  more  than  0.15°  per  century.  When  the  New¬ 
tonian  perturbations  are  subtracted,  there  remains 
an  excess  of  43".  This  is  fully  explained  by  the 
general  theory  of  relativity.  The  explanation  of 
the  perihelion  precession  was  one  of  the  first  tests 
of  the  general  theory  of  relativity. 
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Fig.  8.2  (Continued) 
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The  first  spacecraft  studying  Mercury  was  the 
US  Mariner  10  that  passed  Mercury  three  times 
in  1974  and  1975.  The  orbital  period  of  Mariner 
10  around  the  Sun  was  exactly  twice  the  period 
of  Mercury.  The  two-thirds-factor  meant  that  the 
same  side  of  the  planet  was  illuminated  during 
every  fly-by!  The  other  side  remained  unknown. 
It  was  only  in  2004  that  the  Messenger  (MErcury 
Surface,  Space  ENvironment,  GEochemistry,  and 
Ranging)  mapped  the  whole  surface  of  Mercury, 
first  on  three  fly-bys  in  2008-2009  and  from  an 
orbit  around  Mercury  in  2011-2015.  The  next 
probe  will  be  the  BepiColombo  built  by  the  Eu¬ 
ropean  Space  Agency  ESA.  The  launch  is  sched¬ 
uled  for  2017,  and  after  several  fly-bys  it  should 
settle  on  an  orbit  around  Mercury  in  2024. 

Since  Mercury  has  no  satellites,  precise  values 
of  its  mass  and  density  could  be  calculated  only 
after  Mariner’s  flight. 

The  Mariner  10  data  revealed  a  moon-like 
landscape.  Mercury’s  surface  is  covered  by  im¬ 
pact  craters  (Fig.  8.2),  indicating  that  the  surface 
is  old  and  undisturbed  by  continental  drift  or  vol¬ 
canic  eruptions.  There  are  some  signs  of  volcanic 
activity,  but  they  are  very  old,  possibly  over  a  bil¬ 
lion  years. 

There  are  also  some  circular  formations  re¬ 
sembling  lunar  maria,  formed  by  impacts  of 
larger  object  and  filled  by  lava  seeping  from  the 
interior  of  the  planet.  The  largest  circular  area  is 
the  1500  km  wide  Caloris  Basin.  The  shock  wave 
produced  by  the  Caloris  impact  was  focused  to 
the  antipodal  point,  breaking  the  crust  into  com¬ 
plex  blocks  in  a  large  area,  the  diameter  of  which 
is  hundreds  of  kilometers.  This  region  is  named 
the  Weird  Terrain. 

There  are  also  faults  that  were  possibly  pro¬ 
duced  by  compression  of  the  crust.  The  volume 
change  probably  was  due  to  the  cooling  of  the 
planet. 

Mercury’s  relatively  small  size  and  proxim¬ 
ity  to  the  Sun,  resulting  in  low  gravity  and  high 
temperature,  are  the  reasons  for  its  lack  of  atmo¬ 
sphere.  There  is  a  layer  made  up  of  atoms  blasted 
off  the  surface  by  the  solar  wind.  The  tenuous  “at¬ 
mosphere”  is  composed  mainly  of  oxygen,  nitro¬ 
gen,  and  helium.  The  atoms  quickly  escape  into 
space  and  are  constantly  replenished. 


Due  to  the  absence  of  an  atmosphere,  the  tem¬ 
perature  on  Mercury  drops  very  rapidly  after  sun¬ 
set.  The  rotational  axis  is  almost  perpendicular 
to  the  orbital  plane;  therefore  it  is  possible  that, 
close  to  the  poles,  there  are  areas  where  the  tem¬ 
perature  is  permanently  below  the  freezing  point. 
Radar  echos  from  the  surface  of  Mercury  show 
several  anomalously  reflective  and  highly  depo¬ 
larised  features  at  the  north  and  south  poles. 
Some  of  these  areas  can  be  addressed  to  the 
craters,  the  bottoms  of  which  are  permanently  in 
shadow.  One  candidate  of  the  radar-bright  fea¬ 
tures  is  water  ice  that  has  survived  in  the  perma¬ 
nent  shadow. 

Existence  of  ice  was  confirmed  by  the  Mes¬ 
senger  probe.  At  the  bottoms  of  some  craters  the 
temperature  never  exceeds  100  K.  The  ice  is  cov¬ 
ered  by  regolith  that  prevents  the  ice  from  subli¬ 
mating  and  evaporating  to  space.  It  has  been  esti¬ 
mated  that  the  total  amount  of  ice  could  be  even 
1/1000  of  the  ice  in  the  Antarctic. 

It  has  been  said  that  Mercury  looks  like  the 
Moon  from  the  outside  but  is  terrestrial  from  the 
inside.  According  to  theoretical  models,  the  in¬ 
ternal  structure  is  similar  to  that  of  the  Earth  but 
the  core  is  substantially  larger.  The  density  of  the 
planet  is  about  the  same  as  that  of  the  Earth,  in¬ 
dicating  that  the  size  of  the  Fe-Ni  core  is  roughly 
about  75  %  of  the  planet’s  radius.  The  thickness 
of  the  mantle  is  only  500-700  km  and  that  of  the 
crust  100-300  km. 

Due  to  the  vicinity  of  the  Sun,  the  temperature 
of  the  primeval  nebula  at  the  distance  of  Mercury 
was  quite  high  during  planetary  formation.  Thus 
the  relative  abundances  of  the  volatile  elements 
are  smaller  than  on  any  other  terrestrial  planet. 

The  Sun  creates  strong  tides  on  Mercury,  with 
main  periods  of  44  and  88  days.  Measuring  the 
tidal  variation  is  difficult  due  to  its  slowness,  but 
it  has  bee  calculated  that  the  vertical  motion  near 
the  equator  could  be  a  couple  of  meters.  (The 
crust  of  the  Earth  moves  about  30  cm  due  to  the 
lunar  tides.) 

Mercury  has  a  weak  magnetic  field,  about  1  % 
as  strong  as  that  of  the  Earth.  The  presence  of 
the  magnetic  field  is  unexpected  because  Mer¬ 
cury  is  much  smaller  than  the  Earth  and  it  rotates 
slowly.  According  to  the  dynamo  theory,  a  mag- 
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Fig.  8.3  The  phases  of  Venus  were  discovered  by  Galileo  Galilei  in  1610.  This  drawing  illustrates  how  the  apparent 
size  of  Venus  changes  with  phase.  The  planet  is  far  behind  the  Sun  when  the  illuminated  side  faces  the  Earth 


netic  field  is  generated  by  flows  in  a  liquid,  elec¬ 
trically  conducting  core.  The  magnetic  field  can¬ 
not  be  a  remnant  from  ancient  times,  since  the 
internal  temperature  of  the  planet  must  have  ex¬ 
ceeded  the  critical  Curie  point.  Therefore,  it  must 
be  assumed  that  a  part  of  the  core  is  molten. 
Possibly  the  deformations  caused  by  tides  and 
the  friction  releasing  heat  keep  the  core  molten 
and  maintain  mass  flows  generating  the  magnetic 
field. 


8.2  Venus 

Venus  is  the  brightest  object  in  the  sky,  after  the 
Sun  and  the  Moon.  Like  Mercury,  Venus  can  be 
seen  only  in  the  morning  or  in  the  evening  sky. 
Sometimes  it  is  possible  to  see  Venus  even  if  the 
Sun  is  above  the  horizon,  if  its  exact  position 
is  known.  In  antiquity,  Venus  was  thought  to  be 
two  different  planets,  Hesperos  and  Phosphorus , 
evening  star  and  morning  star. 

The  maximum  elongation  of  Venus  is  about 
47°.  Venus  is  a  remarkable  object  when  shining 
in  the  dark  sky  at  its  brightest,  35  days  before  or 
after  the  inferior  conjunction,  when  one-third  of 
the  surface  is  seen  lit  (Fig.  8.3). 

At  the  inferior  conjunction,  the  Earth- Venus 
distance  is  only  42  million  km.  The  diameter 
of  Venus  is  about  12,000  km,  which  means  that 
the  apparent  diameter  can  be  as  large  as  one  arc 
minute.  Under  favourable  conditions  it  is  even 
possible  to  see  the  shape  of  the  crescent  Venus 
with  binoculars.  At  the  superior  conjunction,  the 
apparent  diameter  is  only  10  arc  seconds. 

Venus  is  covered  by  clouds.  Its  surface  is 
nowhere  visible;  only  featureless  yellowish  cloud 
tops  can  be  seen  (Fig.  8.4).  Therefore,  even  the 
rotation  period  was  long  unknown. 


The  chemical  composition  of  the  Venusian 
atmosphere  was  known  prior  to  the  space  age. 
Spectroscopic  observations  revealed  CO2,  but  no 
oxygen.  About  98  %  of  the  atmosphere  is  car¬ 
bon  dioxide.  Some  clues  about  the  cloud  compo¬ 
sition  were  obtained  from  polarimetric  observa¬ 
tions.  The  famous  French  planetary  astronomer 
Bernard  Lyot  made  polarimetric  observations  in 
the  1920’s,  but  not  until  decades  later  was  it  re¬ 
alised  that  his  observations  could  be  explained  by 
assuming  that  light  was  scattered  by  liquid  spher¬ 
ical  particles  whose  index  of  refraction  is  1.44. 
This  is  significantly  higher  than  the  index  of  re¬ 
fraction  of  water,  1.33.  Moreover,  water  is  not 
liquid  at  that  temperature.  A  good  candidate  was 
sulphuric  acid  H2SO4.  Later,  spacecraft  con¬ 
firmed  this  interpretation. 

In  1962,  radar  measurements  revealed  that  the 
rotation  period  is  243  days  in  a  retrograde  direc¬ 
tion,  i.e.  opposite  to  the  rotation  of  other  planets. 
The  axis  of  rotation  is  almost  perpendicular  to  the 
orbital  plane;  the  inclination  is  177°.  The  reason 
for  the  strange  rotation  is  not  known. 

The  temperature  at  the  cloud  tops  is  about 
250  K.  Because  the  Bond  albedo  is  as  high 
as  75  %,  the  surface  temperature  was  believed 
to  be  moderate,  even  suitable  for  life.  Opinions 
changed  dramatically  when  thermal  radio  emis¬ 
sion  was  measured  at  the  end  of  the  1950’s.  This 
emission  originates  on  the  surface  of  the  planet 
and  can  penetrate  the  clouds.  The  surface  tem¬ 
perature  turned  out  to  be  750  K,  well  above  the 
melting  point  of  lead.  The  reason  for  this  is  the 
greenhouse  effect.  The  outgoing  infrared  radia¬ 
tion  is  blocked  by  atmospheric  carbon  dioxide. 
The  pressure  of  the  atmosphere  at  the  surface  is 
90  atm. 
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Fig.  8.4  Left:  Venus  in  visible  light  imaged  by  the  Galileo 
orbiter  in  February  1990.  The  cloud  features  are  caused 
by  winds  that  blow  from  east  to  west  at  about  100  m/s. 
Right:  The  northern  hemisphere  of  Venus  in  a  computer- 

Mariner  2  (1962)  was  the  first  spacecraft  to 
encounter  the  planet.  Five  years  later,  the  Soviet 
Venera  4  sent  the  first  data  from  below  the  clouds, 
and  the  first  pictures  of  the  surface  were  sent  by 
Venera  9  and  10  in  1975.  The  first  radar  map  was 
completed  in  1980,  after  18  months  of  mapping 
by  the  US  Pioneer  Venus  1.  The  best  and  the 
most  complete  maps  (about  98  %  of  the  planet’s 
surface)  were  made  using  the  synthetic  aperture 
radar  observations  of  the  Magellan  spacecraft  in 
1990-1994.  The  resolution  of  the  maps  is  as  high 
as  100  m  and  the  elevation  was  measured  with 
a  resolution  of  30  metres.  The  Venus  Express 
launched  by  ESA  studied  Venus  from  an  orbit 
during  2006-2014. 

Radar  mappings  revealed  canyons,  mountains, 
craters,  volcanoes  and  other  volcanic  formations 
(Fig.  8.5).  The  surface  of  Venus  is  covered  by 
about  20  %  of  lowland  plains,  70  %  of  gently 
rolling  uplands  and  lava  flows,  and  10  %  of  high¬ 
lands. 

There  are  only  few  major  highland  areas. 
The  largest  continent,  Aphrodite  Terra ,  is  close 
to  the  equator  of  Venus;  its  size  is  similar  to 
South  America.  Another  large  continent  at  the 
latitude  70°N  is  called  Ishtar  Terra ,  where  the 


generated  picture  of  the  radar  observations.  The  north  pole 
is  at  the  centre  of  the  image  of  the  Magellan  synthetic 
aperture  radar  mosaic.  (NASA/JPL) 


highest  mountain  on  Venus,  the  12  km  high 
Maxwell  Montes  is  situated.  (IAU  has  decided 
that  the  Venusian  nomenclature  has  to  be  femi¬ 
nine.  Maxwell  Montes,  after  the  famous  physicist 
James  Clerk  Maxwell,  is  an  exception.) 

On  Venus  there  are  volcanic  features  all  over 
the  surface.  There  is  no  evidence  of  massive  tec¬ 
tonic  movement  although  local  deformations  can 
exist.  Small  volcanoes  are  evenly  distributed  but 
big  ones  are  concentrated  mainly  on  highlands. 
The  origin  of  the  latter  may  resemble  that  of  the 
Hawaiian  volcanoes:  the  volcanoes  are  located 
above  a  “hot  spot”  of  the  mantle,  in  a  place  where 
currents  of  the  mantle  move  hot  magma  towards 
the  surface. 

Venus  has  more  volcanoes  than  any  other 
planet  in  the  solar  system.  Over  1500  major  vol¬ 
canoes  or  volcanic  features  are  known,  and  there 
may  even  be  one  million  smaller  ones.  Most  are 
shield  volcanoes,  but  there  are  also  many  com¬ 
plex  features.  None  are  known  to  be  active  at 
present,  although  large  variations  of  sulphur  diox¬ 
ide  in  the  atmosphere  may  indicate  that  some  vol¬ 
canoes  are  active. 

Almost  all  volcanism  on  Venus  seems  to  in¬ 
volve  fluid  lava  flows  without  any  explosive  erup- 
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Fig.  8.5  Surface  features  of  Venus.  (Top  left):  A  Magel¬ 
lan  image  of  a  50  km  peak-ring  crater  Barton  at  27.4°N 
and  337. 5°E.  ( Top  right):  A  Magellan  radar  image  of  a  re¬ 
gion  300  km  across,  located  in  a  vast  plain  to  the  south 
of  Aphrodite  Terra.  The  large  circular  structure  near  the 
centre  of  the  image  is  a  corona,  approximately  200  km  in 


diameter.  North  of  the  corona  is  a  35  km  flat- topped  vol¬ 
canic  construct  known  as  a  pancake  dome.  Complex  frac¬ 
ture  patterns  like  in  the  upper  right  of  the  image  are  often 
observed  in  association  with  coronas  and  various  volcanic 
features.  (NASA/JPL).  (Bottom):  The  surface  of  Venus 
photographed  by  the  Venera  14  lander  in  March  1982 


tions.  Due  to  the  high  air  pressure,  Venusian  lavas 
need  a  much  higher  gas  content  than  the  Earth 
lavas  to  erupt  explosively.  The  main  gas  driv¬ 
ing  lava  explosions  on  the  Earth  is  water  vapour, 
which  does  not  exist  on  Venus. 

Flat-topped  volcanic  constructs  known  as  pan¬ 
cake  domes  are  probably  formed  by  the  eruption 
of  an  extremely  viscous  lava.  A  corona  is  a  cir¬ 
cular  trench  surrounding  an  elevated  plain,  the 
diameter  of  which  can  be  as  big  as  several  hun¬ 
dreds  of  kilometres.  They  are  possibly  examples 
of  local  hot  spots,  mantle  upwellings  that  have 
expanded  and  formed  bulges.  When  the  flow  has 
stopped,  the  bulge  has  sunk  and  formed  a  set  of 
ring  mountains. 

In  other  places  fluid  lava  flows  have  produced 
long,  sinuous  channels  extending  for  hundreds  of 
kilometres. 

Venus  has  no  large  scale  tectonic  activity.  The 
reason  may  be  the  thinner  and  weaker  crust  and 


lack  of  water;  therefore  there  are  no  large  subduc- 
tion  zones  between  continents.  Instead,  the  crust 
can  consist  of  local  zones  only.  Since  the  inter¬ 
nal  heat  cannot  flow  out  like  on  the  Earth,  the 
crust  will  warm  up  until  at  a  critical  temperature 
it  will  collapse  and  be  renewed  in  hundred  mil¬ 
lion  years.  The  last  time  this  happened  was  about 
500  million  years  ago. 

Most  of  the  Venusian  impact  craters  are  un¬ 
deformed.  This  indicates  that  the  Venusian  sur¬ 
face  must  be  young  because  erosion,  volcan- 
ism  and  tectonic  forces  should  affect  the  craters, 
too.  Resurfacing  processes  may  frequently  cover 
the  old  craters,  and  all  craters  visible  are  there¬ 
fore  young,  presumably  less  than  500  million 
years.  There  are  no  impact  crates  smaller  than 
a  few  kilometers  because  smaller  meteoroids  are 
burned  in  the  thick  atmosphere. 

The  Earth  and  Venus  are  almost  equal  in  size, 
and  their  interiors  are  assumed  to  be  similar. 
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Venus  has  an  iron  core  about  3000  km  in  radius 
and  a  molten  rocky  mantle  covering  the  majority 
of  the  planet.  Probably  due  to  its  slow  rotation, 
however,  Venus  has  no  magnetic  field.  The  anal¬ 
yses  made  by  the  Venera  landers  have  shown  that 
the  surface  material  of  Venus  is  similar  to  terres¬ 
trial  granite  and  basalt. 

About  1  %  of  the  incident  light  reaches  the 
surface  of  Venus;  this  light  is  deep  red  after  trav¬ 
elling  through  clouds  and  the  thick  atmosphere. 
Most  of  the  incident  light,  about  75  %,  is  reflected 
back  from  the  upper  layers  of  clouds.  The  ab¬ 
sorbed  light  is  emitted  back  in  infrared.  The  car¬ 
bon  dioxide  atmosphere  very  effectively  prevents 
the  infrared  radiation  from  escaping,  and  the  tem¬ 
perature  had  not  reached  the  equilibrium  until  at 
750  K. 

In  spite  of  the  thick  layer  of  clouds,  the  visibil¬ 
ity  on  the  surface  is  several  kilometres,  and  even 
in  the  clouds,  a  few  hundred  metres.  Therefore, 
the  clouds  are  relatively  transparent.  The  densest 
clouds  are  at  a  height  of  50  km,  but  their  thick¬ 
ness  is  only  2-3  km.  Above  this,  there  are  haze¬ 
like  layers  which  form  the  visible  “surface”  of  the 
planet.  The  uppermost  clouds  move  rapidly;  they 
rotate  around  the  planet  in  about  4  days,  pushed 
by  strong  winds  powered  by  the  Sun.  The  flow 
patterns  of  the  clouds  are  best  visible  in  ultravio¬ 
let  light.  The  sulphuric  acid  droplets  do  not  rain 
on  the  Venusian  surface  but  they  evaporate  in  the 
lower  atmosphere  before  reaching  the  surface. 

Venus’  atmosphere  is  very  dry:  the  amount  of 
water  vapour  present  is  only  1/1,000,000  of  that 
in  the  Earth’s  atmosphere.  One  possible  explana¬ 
tion  is  that,  due  to  solar  UV  radiation,  the  water 
has  dissociated  to  hydrogen  and  oxygen  in  the  up¬ 
per  layers  of  the  atmosphere,  the  former  escaping 
into  interplanetary  space. 

Venus  has  no  satellites. 


8.3  The  Earth  and  the  Moon 

The  third  planet  from  the  Sun,  the  Earth ,  and  its 
satellite,  the  Moon ,  form  almost  a  double  planet. 
The  relative  size  of  the  Moon  is  larger  than  that 
of  any  other  satellite,  excluding  the  moon  of  the 
dwarf  planet  Pluto.  Usually  satellites  are  much 
smaller  than  their  parent  planets. 
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Fig.  8.6  Internal  structure  of  the  Earth.  The  speed  of 
the  seismic  waves,  density,  pressure,  and  temperature  are 
shown  as  a  function  of  depth.  The  crust,  lithosphere,  lower 
mantle  and  inner  core  are  solid,  the  outer  mantle  is  molten 
and  the  asthenosphere  is  partly  molten 

The  Earth  is  a  unique  body,  since  a  consid¬ 
erable  amount  of  free  water  is  found  on  its  sur¬ 
face.  This  is  possible  only  because  the  temper¬ 
ature  is  above  the  freezing  point  and  below  the 
boiling  point  of  water  and  the  atmosphere  is  thick 
enough.  The  Earth  is  also  the  only  planet  where 
life  is  known  to  exist.  (Whether  it  is  intelligent  or 
not  is  yet  to  be  resolved. . . ).  The  moderate  tem¬ 
perature  and  the  water  are  essential  for  terrestrial 
life,  although  some  life  forms  can  be  found  in  ex¬ 
treme  conditions. 

The  diameter  of  the  Earth  is  12,000  km.  At  the 
centre,  there  is  an  iron-nickel  core  where  the  tem¬ 
perature  is  5000  K,  the  pressure  3  x  1011  N  m-2 
and  the  density  12,000  kgm-3  (Fig.  8.6). 

The  core  is  divided  into  two  layers,  inner  and 
outer  core.  The  inner  core,  below  5150  km  com¬ 
prises  only  of  1.7  %  of  the  mass  of  the  Earth.  It  is 
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solid  because  of  high  pressure.  The  nonexistence 
of  the  seismic  transverse  S  waves  below  a  depth 
of  2890  km  indicates  that  the  outer  core  is  molten. 
However,  the  speed  of  the  longitudinal  P  waves 
change  rapidly  at  a  depth  of  5 150  km  showing  an 
obvious  phase  transition.  It  has  been  discovered 
that  the  solid  inner  core  rotates  with  respect  to 
the  outer  core  and  mantle. 

The  outer  core  comprises  about  31  %  of  the 
mass  of  the  Earth.  It  is  a  hot,  electrically  conduct¬ 
ing  layer  of  liquid  Fe-Ni  where  the  convective 
motions  take  place.  There  are  strong  currents  in 
the  conductive  layer  that  are  responsible  for  the 
magnetic  field. 

Between  the  outer  core  and  the  lower  mantle 
there  is  a  200  km  thick  transition  layer.  Although 
this  D"  layer  is  often  included  as  a  part  of  the 
lower  mantle,  seismic  discontinuities  suggest  that 
it  might  differ  chemically  from  the  lower  mantle. 

A  silicate  mantle  extends  from  2890  km  up¬ 
ward  up  to  a  depth  of  few  tens  of  kilometres.  The 
part  below  650  km  is  often  identified  as  the  lower 
mantle.  It  contains  about  49  %  of  the  mass  and 
is  composed  mainly  of  silicon,  magnesium,  and 
oxygen  but  some  iron,  calcium,  and  aluminium 
may  also  exist.  The  major  minerals  are  olivine 
(Mg,Fe)2Si04  and  pyroxene  (Mg,Fe)Si03.  Un¬ 
der  pressure  the  material  behaves  like  a  viscous 
liquid  or  an  amorphous  medium,  resulting  in  slow 
vertical  flows. 

Between  the  lower  and  upper  mantle  there  is 
a  250  km  thick  transition  region  or  mesosphere.  It 
is  the  source  of  basaltic  magmas  and  is  rich  in  cal¬ 
cium  and  aluminium.  The  upper  mantle,  between 
some  tens  of  kilometres  down  to  400  km  contains 
about  10  %  of  the  mass.  Part  of  the  upper  man¬ 
tle,  called  the  astheno sphere,  might  be  partially 
molten. 

A  thin  crust  floats  on  the  mantle.  The  thick¬ 
ness  of  the  crust  is  only  10-65  km;  it  is  thick¬ 
est  below  high  mountain  ranges  such  as  the 
Himalayas  and  thinnest  below  the  mid-ocean 
basins.  The  seismic  discontinuity  showing  the 
border  between  the  crust  and  mantle  was  discov¬ 
ered  in  1909  by  the  Croatian  scientist  Andrija 
Mohorovicic ,  and  it  is  now  known  as  the  Moho 
discontinuity. 

The  basaltic  oceanic  crust  is  very  young, 
mostly  less  than  100  million  years  and  nowhere 


more  than  200  Ma.  It  is  made  through  tectonic 
activity  at  the  mid-ocean  ridges ,  where  new  ma¬ 
terial  is  spreading  out. 

The  continental  crust  is  mainly  composed  of 
crystalline  rocks  that  are  dominated  by  quartz 
(SiCU)  and  feldspars  (metal-poor  silicates).  Be¬ 
cause  the  continental  crust  is  lighter  than  the 
oceanic  crust  (average  densities  are  about 
2700  kg  m~3  and  3000  kg  m-3,  respectively),  the 
continents  are  floating  on  top  of  other  layers,  and 
currently  they  are  neither  created  nor  destroyed. 

The  lithosphere  is  the  rigid  outer  part  of  the 
Earth  (crust  and  the  topmost  part  of  the  upper 
mantle).  Below  that  is  the  partially  molten  as- 
thenosphere  where  the  damping  of  seismic  waves 
is  stronger  than  in  the  rigid  lithosphere. 

The  lithosphere  is  not  a  single  rigid  and  seam¬ 
less  layer;  instead  it  is  divided  into  more  than  20 
individual  plates.  The  plate  tectonics  (“continen¬ 
tal  drift”)  is  powered  by  the  motion  of  the  ma¬ 
terial  in  the  mantle  (Fig.  8.7).  New  material  is 
flowing  up  at  the  mid-ocean  ridges,  pushing  the 
tectonic  plates  apart.  New  oceanic  crust  is  gen¬ 
erated  at  the  rate  of  17  km3  per  year.  The  Earth 
is  the  only  planet  that  shows  any  large-scale  tec¬ 
tonic  activity.  The  history  of  the  motion  can  be 
studied  by  using  e.g.  the  paleomagnetic  data  of 
magnetic  orientation  of  crystallised  rocks. 

At  the  end  of  the  Precambrian  era,  about  700 
million  years  ago,  more  than  half  of  the  conti¬ 
nents  were  together  forming  the  continent  known 
as  Gondwana,  containing  Africa,  South  Amer¬ 
ica,  Australia  and  Antarctica.  About  350  mil¬ 
lion  years  ago  Gondwana  was  on  the  South  Pole 
but  it  moved  toward  the  equator  before  the  final 
breakup.  Mutual  collisions  formed  new  moun¬ 
tains  and  finally  in  the  beginning  of  the  Mesozoic 
era,  about  200  million  years  ago,  all  the  conti¬ 
nents  were  joined  into  one  supercontinent,  Pan¬ 
gaea. 

Quite  soon  the  flow  pattern  in  the  mantle 
changed  and  the  Pangaea  broke  up.  The  Atlantic 
Ocean  is  still  growing  and  new  material  is  flow¬ 
ing  up  at  the  mid-Atlantic  ridge.  North  America 
is  drifting  away  from  Europe  at  the  rate  of  a  few 
centimetres  per  year  (your  fingernails  are  grow¬ 
ing  at  the  same  speed).  At  the  same  time,  parts 
of  the  Pacific  oceanic  plate  are  disappearing  be¬ 
low  other  plates.  When  an  oceanic  crust  is  pushed 
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Fig.  8.7  The  tectonic  plates.  The  dots  on  the  map  indicate 
the  location  of  earthquakes  with  magnitudes  greater  than  5 
in  the  years  1980-1989.  Arrows  show  the  velocities  ob¬ 


served  with  permanent  GPS  (Global  Positioning  System) 
tracking  stations.  The  velocity  scale  is  shown  at  lower  left 


below  a  continental  crust,  a  zone  of  active  volca¬ 
noes  is  created.  The  earthquakes  in  the  subdue  - 
tion  zones  can  even  originate  600  km  below  the 
surface.  In  the  mid-ocean  ridges,  the  depth  is  only 
tens  of  kilometres. 

Mountains  are  formed  when  two  plates  col¬ 
lide.  The  push  of  the  African  plate  toward  the 
Eurasian  plate  formed  the  Alps  about  45  million 
years  ago.  The  collision  of  the  Indian  plate  cre¬ 
ated  the  Himalayas  some  40  million  years  ago, 
and  they  are  still  growing.  The  plates  can  also 
move  sideways.  Earthquakes  in  such  a  fault  usu¬ 
ally  occur  in  depths  less  than  about  100  kilome¬ 
tres.  An  example  is  the  San  Andreas  fault  in  Cal¬ 
ifornia. 

No  other  planet  has  such  large  scale  tectonic 
activity.  The  interiors  of  Mars  and  Mercury  are 
already  too  cold  to  maintain  convection  in  their 
mantles.  The  structure  of  the  crust  of  Venus  is  too 
weak  to  move  large  regions.  Also,  the  free  water 
of  the  Earth  may  be  important  in  keeping  up  the 
plate  tectonics. 

The  climate  is  an  efficient  factor  reshaping  the 
surface.  The  mountains  created  by  collisions  of 
tectonic  plates  are  eroded  already  in  a  few  hun¬ 
dred  million  years  by  temperature  changes,  rain, 
wind  and  ice.  Impact  crater  disappear  even  faster, 


and  many  of  the  currently  known  craters  have 
been  found  by  local  gravitational  anomalies  in¬ 
stead  of  geographical  features. 

Most  of  the  Earth’s  surface  is  covered  with  wa¬ 
ter.  The  Earth  is  the  only  planet  that  currently  has 
free  running  water.  The  water  condensed  from  the 
water  vapour  released  in  volcanic  eruptions.  De¬ 
pending  on  glaciation,  the  sea  level  ascends  or 
descends  by  over  100  metres  with  respect  to  the 
solid  ground.  During  the  last  ice  age  the  sea  level 
was  150  m  below  the  current.  Due  to  the  melting 
of  glaciers  the  sea  level  is  rising  by  one  millimetre 
per  year,  but  the  rate  is  increasing  by  the  green¬ 
house  effect. 

Compared  with  the  geological  time  scale  the 
increase  in  the  greenhouse  effect  caused  by  hu¬ 
man  activity  is  a  very  young  phenomenon.  Ice 
ages  have  come  and  gone  in  periods  of  a  time 
scales  of  100,000  years.  The  current  warming  of 
the  climate  is  a  temporary  phenomenon,  which, 
however,  may  have  some  effect  on  the  beginning 
of  the  next  ice  age.  Although  the  human  activi¬ 
ties  have  little  effect  from  the  geological  point  of 
view,  they  may  have  disastrous  effects  on  the  cur¬ 
rent  biosphere. 

The  primordial  atmosphere  of  the  Earth  was 
very  different  from  the  modern  one;  there  was,  for 
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Fig.  8.8  Hurricane  Elena 
in  the  Gulf  of  Mexico  seen 
from  the  space  shuttle 
Discovery  in  September  1, 
1985.  Wind  speeds  may 
exceed  170  km/h.  Compare 
this  to  the  Great  Red  Spot 
of  Jupiter  in  Fig.  8.18. 

(NO  A  A) 


example,  no  oxygen.  Earlier  it  was  assumed  that 
the  original  atmosphere  was  reducing,  but  more 
recent  studies  seem  to  indicate  that  it  was  neutral. 
When  organic  chemical  processes  started  in  the 
oceans  more  than  2  x  109  years  ago  and  reached 
the  level  of  photosynthesis,  the  amount  of  oxy¬ 
gen  rapidly  increased  (and  was  poison  to  the 
first  forms  of  life!).  The  original  carbon  dioxide 
is  now  mainly  concentrated  in  carbonate  rocks, 
such  as  limestone,  and  the  methane  was  disso¬ 
ciated  by  solar  UV  radiation.  The  current  atmo¬ 
sphere  has  very  little  resemblance  to  the  early  at¬ 
mosphere  a  couple  of  billion  years  ago. 

The  Earth’s  main  atmospheric  constituents  are 
nitrogen  (77  %  by  volume)  and  oxygen  (21  %). 
Other  gases,  such  as  argon,  carbon  dioxide,  and 
water  vapour  are  present  in  minor  amounts.  The 
chemical  composition  is  unchanged  in  the  lower 
part  of  the  atmosphere,  called  the  troposphere. 
Most  of  the  climatic  phenomena  occur  in  the  tro¬ 
posphere,  which  reaches  up  to  8-10  km  (Fig.  8.8). 
The  height  of  the  layer  is  variable,  being  lowest  at 
the  poles,  and  highest  at  the  equator,  where  it  can 
extend  up  to  18  km. 

The  layer  above  the  troposphere  is  the  strato¬ 
sphere,  extending  up  to  60  km.  The  boundary 


between  the  troposphere  and  the  stratosphere 
is  called  the  tropopause.  In  the  troposphere, 
the  temperature  decreases  5-7  K/km,  but  in  the 
stratosphere  it  begins  to  rise,  due  to  the  absorp¬ 
tion  of  solar  radiation  by  carbon  dioxide,  wa¬ 
ter  vapour  and  ozone.  The  ozone  layer,  which 
shields  the  Earth  from  the  solar  UV  radiation,  is 
at  a  height  of  20-25  km. 

A  total  of  99  %  of  air  is  in  the  troposphere 
and  stratosphere.  The  stratopause  at  a  height  of 
50-60  km  separates  the  stratosphere  from  the 
mesosphere. 

The  mesosphere  extends  up  to  85  km.  In  this 
layer,  the  temperature  decreases  again,  reaching 
the  minimum  of  about  —90  °C  at  the  height  of 
80-90  km  in  the  mesopause.  Chemicals  in  the 
mesosphere  are  mostly  in  an  excited  state,  as  they 
absorb  energy  from  the  Sun. 

Above  the  mesopause  is  the  thermosphere  that 
extends  up  to  500  kilometres.  The  temperatures 
increases  with  altitude  and  can  be  above  1200  °C 
at  the  height  of  500  km.  The  gas  is  in  the  form  of 
a  fully  ionised  plasma.  Therefore,  the  layer  above 
the  mesopause  is  also  called  the  ionosphere. 

The  density  of  air  below  a  height  of  150  km 
is  high  enough  to  cause  colliding  meteoroids  to 
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Fig.  8.9  A  map  of  the  Lunar  surface,  composed  of  im¬ 
ages  taken  by  the  Clementine  space  probe  in  1994.  Note 
the  large  areas  of  maria  in  the  Lunar  near  side,  at  the  centre 

burn  into  ashes  due  to  friction.  It  also  plays  an 
important  role  in  radio  communications,  since  ra¬ 
dio  waves  are  reflected  by  the  ionosphere.  Auro¬ 
ras  are  phenomena  of  the  upper  part  of  the  iono¬ 
sphere. 

The  thermosphere  goes  over  into  the  exo¬ 
sphere  at  about  500  km.  There  the  air  pressure  is 
much  lower  than  in  the  best  laboratory  vacuums. 

The  magnetic  field  of  the  Earth  is  generated  by 
flows  in  its  core.  The  field  is  almost  a  dipole  but 
there  are  considerable  local  and  temporal  varia¬ 
tions.  The  mean  field  strength  close  to  the  equa¬ 
tor  is  3. 1  x  1(T5  Tesla  (0.3 1  Gauss).  The  dipole  is 
tilted  11°  with  respect  to  the  Earth’s  axis,  but  the 
direction  gradually  changes  with  time.  Currently 
the  field  is  decreasing  about  30  nT  a  year  and 
moving  counterclockwise;  however,  the  angle  be¬ 
tween  the  magnetic  axis  and  the  rotation  axis  re¬ 
mains  nearly  constant.  Moreover,  the  magnetic 
north  and  south  poles  have  exchanged  places  sev¬ 
eral  times  during  the  past,  once  every  100,000- 
1,000,000  years.  The  last  time  this  happened  was 
about  780,000  years  ago. 

The  Moon  Our  nearest  neighbour  in  space  is 
the  Moon  (Fig.  8.9).  Its  motion  was  described  in 
Sect.  7.4.  Dark  and  light  areas  are  visible  even 


of  the  figure,  as  compared  to  the  almost  complete  absence 
of  the  maria  at  the  Lunar  far  side.  (US  Naval  Observatory) 


with  the  naked  eye.  For  historical  reasons,  the  for¬ 
mer  are  called  seas  or  maria  (from  Latin,  mare , 
sea,  pi.  maria).  The  lighter  areas  are  uplands  but 
the  maria  have  nothing  in  common  with  terres¬ 
trial  seas,  since  there  is  no  water  on  the  Moon. 
Numerous  craters,  all  meteorite  impacts,  can  be 
seen,  even  with  binoculars  or  a  small  telescope. 
The  lack  of  atmosphere,  volcanism,  and  tectonic 
activity  help  to  preserve  these  formations. 

The  Moon  is  the  best-known  body  after  the 
Earth.  The  first  man  landed  on  the  Moon  in 
1969  during  the  Apollo  11  flight.  A  total  of  over 
2000  samples,  weighing  382  kg,  were  collected 
during  the  six  Apollo  flights  (Fig.  8.10).  More¬ 
over,  the  unmanned  Soviet  Luna  spacecraft  col¬ 
lected  and  returned  about  310  grams  of  Lunar 
soil.  Instruments  placed  on  the  Moon  by  the 
Apollo  astronauts  operated  as  long  as  eight  years. 
These  included  seismometers,  which  detected 
moonquakes  and  meteorite  impacts,  and  passive 
laser  reflectors  which  made  exact  Earth-Moon 
distance  measurements  possible.  The  reflectors 
are  still  used  for  Lunar  laser  ranging  (LLR)  mea¬ 
surements. 

Seismometric  and  gravimetric  measurements 
have  supplied  basic  information  on  the  internal 
structure  of  the  Moon.  Moonquakes  take  place 
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Fig.  8.10  Apollo  17 
astronaut  Harrison  Schmitt 
on  the  Moon  in  1972. 
(NASA) 


at  a  depth  of  800-1000  km,  considerably  deeper 
than  earthquakes,  and  they  are  also  much  weaker 
than  on  the  Earth.  Most  of  the  quakes  occur  at 
the  boundary  of  the  solid  mantle,  the  lithosphere , 
and  the  asthenosphere  (Fig.  8.11).  The  transver¬ 
sal  S  waves  cannot  penetrate  the  asthenosphere, 
indicating  that  it  is  at  least  partially  molten.  Tidal 
forces  may  generate  at  least  some  of  the  moon- 
quakes  because  most  of  them  occur  close  to 
perigee  or  apogee. 

Lunar  orbiters  have  observed  local  mass  con¬ 
centrations,  mascons ,  beneath  the  maria.  These 
are  large  basaltic  blocks,  formed  after  the  huge 
impacts  which  produced  the  maria.  The  craters 
were  filled  by  lava  flows  during  the  next  billion 
years  or  so  in  several  phases.  This  can  be  seen, 
e.g.  in  the  area  of  Mare  Imbrium.  Large  maria 
were  formed  about  4  x  109  years  ago  when  mete¬ 
orite  bombardment  was  much  heavier  than  today. 
The  last  3  x  109  years  have  been  quite  peaceful, 
without  any  major  events. 

The  centre  of  mass  is  not  at  the  geometric  cen¬ 
tre  of  the  Moon  but  about  2.5  km  away  due  to 
the  20-30  km  thick  basaltic  plates  below  the  large 
maria.  Moreover,  the  thickness  of  the  crust  varies, 
being  the  thickest  at  the  far  side  of  the  Moon, 


Fig.  8.11  Structure  of  the  Moon.  The  height  differences 
of  the  surface  are  strongly  exaggerated 


about  100  km.  On  the  near  side  the  thickness  of 
the  crust  is  about  60  km. 

The  mean  density  of  the  Moon  is  3400  kg  m-3 , 
which  is  comparable  to  that  of  basaltic  lavas  on 
the  Earth.  The  Moon  is  covered  with  a  layer  of 
soil  with  scattered  rocks,  regolith.  It  consists  of 
the  debris  blasted  out  by  meteorite  impacts.  The 
original  surface  is  nowhere  visible.  The  thickness 
of  the  regolith  is  estimated  to  be  at  least  tens  of 
metres.  A  special  type  of  rock,  breccia ,  which 
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is  a  fragment  of  different  rocks  compacted  and 
welded  together  by  meteor  impacts,  is  found  ev¬ 
erywhere  on  the  Moon. 

The  maria  are  mostly  composed  of  dark  ba¬ 
salts,  which  form  from  rapid  cooling  of  massive 
lava  flows.  The  highlands  are  largely  composed 
of  anorthosite ,  an  igneous  rock  that  forms  when 
lava  cools  more  slowly  than  in  the  case  of  basalts. 
This  implies  that  the  rocks  of  the  maria  and  high¬ 
lands  cooled  at  different  rates  from  the  molten 
state  and  were  formed  under  different  conditions. 

Data  returned  by  the  Lunar  Prospector  and 
Clementine  spacecraft  indicated  that  water  ice  is 
present  at  both  the  north  and  south  lunar  poles. 
Data  indicates  that  there  may  be  nearly  pure  wa¬ 
ter  ice  buried  beneath  the  dry  regolith.  The  ice  is 
concentrated  at  the  bottoms  of  deep  valleys  and 
craters  that  are  in  a  permanent  shadow  where  the 
temperature  is  below  100  K. 

The  Moon  has  no  global  magnetic  field.  Some 
of  the  rocks  have  remanent  magnetism  indicat¬ 
ing  a  possible  global  magnetic  field  early  in 
the  Moon’s  history.  Without  the  atmosphere  and 
magnetic  field,  the  solar  wind  can  reach  the 
Moon’s  surface  directly.  The  ions  from  the  solar 
wind  have  embedded  in  the  regolith.  Thus  sam¬ 
ples  returned  by  the  Apollo  missions  proved  valu¬ 
able  in  studies  of  the  solar  wind. 

The  origin  of  the  Moon  is  still  uncertain;  it  has, 
however,  not  been  torn  off  from  the  Earth  at  the 
Pacific  Ocean,  as  is  sometimes  believed.  The  Pa¬ 
cific  is  less  than  200  million  years  old  and  formed 
as  a  result  of  continental  drift.  Also,  the  chemical 
composition  of  the  lunar  soil  is  different  from  that 
of  terrestrial  material. 

Recently  it  was  suggested  that  the  Moon  was 
formed  in  the  early  stages  of  the  formation  of  the 
Earth,  when  a  lot  of  protoplanet  embryos  were 
orbiting  the  Sun.  An  off-axis  collision  of  a  Mars- 
size  body  resulted  in  ejection  of  a  large  amount 
of  debris,  a  part  of  which  then  accreted  to  form 
the  Moon.  Differences  in  chemical  compositions 
of  the  modern  Earth  and  the  Moon  can  be  ex¬ 
plained  with  the  theory,  as  well  as  the  orientation 
and  evolution  of  the  Moon’s  orbit  and  the  Earth’s 
relatively  fast  spin  rate. 


Box  8.1  (Atmospheric  Phenomena)  The  best- 
known  atmospheric  phenomenon  is  the  rain¬ 
bow ,  which  is  due  to  the  refraction  of  light 
from  water  droplets.  The  radius  of  the  arc  of 
the  rainbow  is  about  41°  and  the  width,  1.7°. 
The  centre  of  the  arc  is  opposite  the  Sun  (or 
any  other  source  of  light).  When  the  light  is  re¬ 
fracted  inside  a  water  droplet,  it  is  divided  into 
a  spectrum,  where  the  red  colour  is  at  the  outer 
edge  and  blue,  at  the  inner  edge.  Light  can  be 
reflected  twice  inside  the  droplet,  resulting  in 
a  secondary  rainbow  outside  the  primary  one. 
The  colours  of  the  secondary  rainbow  are  in  re¬ 
versed  order  and  its  radius  is  52°.  A  rainbow 
caused  by  the  Moon  is  usually  very  weak  and 
colourless,  since  the  human  eye  is  incapable  of 
resolving  colours  of  a  dim  object. 

A  halo  results  when  the  solar  or  lunar  light 
is  reflected  from  atmospheric  ice  crystals.  The 
most  common  halo  is  a  22°  arc  or  circle  around 
the  Sun  or  the  Moon.  Usually  the  halo  is  white, 
but  occasionally  even  bright  colours  can  be 
seen.  Another  common  form  is  the  side  lobes 
which  are  at  the  same  height  as  the  Sun  but 
at  a  distance  of  22°  from  it.  All  other  forms 
of  halo  are  less  common.  The  best  “weather” 
for  halos  is  when  there  are  cirrostratus  or  cirrus 
clouds  or  an  icy  fog  in  the  sky. 

Noctilucent  clouds  are  thin  formations  of 
cloud,  at  a  height  of  approximately  80  km.  The 
clouds  contain  particles,  which  are  less  than 
one  micron  in  diameter,  and  become  visible 
only  when  the  Sun  (which  is  below  the  hori¬ 
zon)  illuminates  the  clouds.  Most  favourable 
conditions  are  at  the  northern  latitudes  during 
the  summer  nights  when  the  Sun  is  only  a  few 
degrees  below  the  horizon. 

The  night  sky  is  never  absolutely  dark.  One 
reason  (in  addition  to  light  pollution)  is  the  air- 
glow  or  light  emitted  by  excited  atmospheric 
molecules.  Most  of  the  radiation  is  in  the  in¬ 
frared  domain,  but  e.g.  the  forbidden  line  of 
oxygen  at  558  nm,  has  also  been  detected. 

The  same  greenish  oxygen  line  is  clearly 
seen  in  auroras ,  which  are  formed  at  a  height 
of  80-300  km.  Auroras  can  be  seen  mainly 
from  relatively  high  northern  or  southern  lati- 
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Top :  Halos  are  particularly  common  in  Antarctic  (photo  Marko  Riikonen). 
Bottom :  Auroras  (photo  Pekka  Parviainen) 


tudes  because  the  Earth’s  magnetic  field  forces 
charged  particles,  coming  from  the  Sun,  close 
toward  the  magnetic  poles.  Alaska  and  north¬ 
ern  Scandinavia  are  the  best  places  to  observe 
auroras.  Occasionally,  auroras  can  be  seen  as 
far  south  as  40°.  They  are  usually  greenish  or 
yellow-green,  but  red  auroras  have  been  ob¬ 
served,  too.  They  most  commonly  appear  as 
arcs,  which  are  often  dim  and  motionless,  or  as 
belts,  which  are  more  active  and  may  contain 
rapidly  varying  vertical  rays. 


Meteors  (also  called  shooting  stars  although 
they  have  nothing  to  do  with  stars)  are  small 
grains  of  sand,  a  few  micrograms  or  grams  in 
weight,  which  hit  the  Earth’s  atmosphere.  Due 
to  friction,  the  body  heats  up  and  starts  to  glow 
at  a  height  of  100  km.  Some  20-40  km  lower, 
the  whole  grain  has  burnt  to  ashes.  The  dura¬ 
tion  of  a  typical  meteor  is  less  than  a  second. 
The  brightest  meteors  are  called  bolides  (mag¬ 
nitude  smaller  than  about  —2).  Even  larger  par- 
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Fig.  8.12  Two  pictures  of  Mars,  taken  by  the  Mars 
Global  Surveyor  in  June  and  July  2001.  The  view  from 
June  {left)  shows  the  Tharsis  volcanic  region,  Valles 
Marineris  and  the  late  winter  south  polar  cap.  The  view 


tides  may  survive  down  to  the  Earth.  Meteors 
are  further  discussed  in  Sect.  8.13. 


8.4  Mars 

Mars  is  the  outermost  of  the  terrestrial  planets. 
Its  diameter  is  only  half  of  that  of  the  Earth.  Seen 
through  a  telescope,  Mars  seems  to  be  a  reddish 
disk  with  dark  spots  and  white  polar  caps.  The  po¬ 
lar  caps  wax  and  wane  with  the  Martian  seasons, 
indicating  that  they  are  composed  of  ice.  Darker 
areas  were  suspected  to  be  vegetation.  At  the  end 
of  the  19th  century,  an  Italian  astronomer,  Gio¬ 
vanni  Schiaparelli  claimed  that  there  are  canals 
on  Mars. 

In  the  United  States,  the  famous  planetary 
astronomer  Percival  Lowell  studied  the  canals 
and  even  published  books  on  the  topic.  Martians 
were  also  very  popular  in  science  fiction  litera¬ 
ture.  Now  the  canals  are  known  to  be  nonexis¬ 
tent,  an  optical  illusion  when  the  obscure  details 
at  the  limit  of  visibility  seem  to  form  straight 
lines,  canals.  Finally,  the  first  clear  pictures  by 
Mariner  4  in  1965  buried  even  the  most  optimistic 
hopes  concerning  life  on  Mars.  Later  spacecraft 
revealed  more  details  of  the  planet. 


from  July  shows  the  same  regions,  but  most  of  the  details 
are  hidden  by  dust  storms  and  haze.  (NASA/JPL/Malin 
Space  Science  Systems) 


Mars  is  a  superior  planet,  which  means  that  it 
is  most  easily  observable  when  it  is  closest  to  the 
Earth,  i.e.  during  opposition,  when  the  planet  is 
above  the  horizon  all  night  long. 

The  rotation  axis  of  Mars  is  tilted  25°  to  the 
ecliptic,  about  the  same  amount  as  the  Earth’s 
axis.  A  Martian  day  is  only  half  an  hour  longer 
than  a  terrestrial  day.  Mars’  orbit  is  significantly 
elliptical,  resulting  in  temperature  variations  of 
about  30  °C  at  the  subsolar  point  between  the 
aphelion  and  perihelion.  This  has  a  major  influ¬ 
ence  on  the  climate.  Huge  dust  storms  are  oc¬ 
casionally  seen  on  Mars  (Fig.  8.12).  Usually  the 
storms  begin  when  Mars  is  at  the  perihelion. 
Heating  of  the  surface  gives  rise  to  large  temper¬ 
ature  differences  that  in  turn  cause  strong  winds. 
The  wind-driven  dust  absorbs  more  heat  and  fi¬ 
nally  the  whole  planet  is  covered  by  a  dust  storm 
where  the  wind  speeds  exceed  100  m/s. 

The  atmosphere  of  Mars  is  mainly  composed 
of  carbon  dioxide  (95  %).  It  contains  only  2  %  ni¬ 
trogen  and  0. 1-0.4  %  oxygen.  The  atmosphere 
is  very  dry:  if  all  the  moisture  were  condensed 
on  the  surface,  the  water  layer  would  be  thin¬ 
ner  than  0.1  mm.  Even  the  minor  amount  of  wa- 
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Fig.  8.13  A  topographic  shade  map  of  Mars  made  from 
the  Mars  Global  Surveyor  data.  The  most  prominent  fea¬ 
tures  are  the  large  shield  volcanoes  in  the  northern  hemi- 

ter  vapour  is  sufficient  to  occasionally  form  some 
thin  clouds  or  haze. 

The  air  pressure  is  only  5-8  mbar.  A  part  of 
the  atmosphere  has  escaped  but  it  is  probable  that 
Mars  never  had  a  thick  atmosphere.  The  primor¬ 
dial  atmosphere  of  Mars  was,  however,  somewhat 
similar  to  that  of  the  Earth.  Almost  all  of  its  car¬ 
bon  dioxide  was  used  up  to  form  carbonate  rocks. 
Because  there  are  no  plate  tectonics  on  Mars,  the 
carbon  dioxide  was  not  recycled  back  into  the  at¬ 
mosphere  as  on  the  Earth.  Therefore,  the  green¬ 
house  effect  on  Mars  is  significantly  smaller  than 
on  the  Earth. 

Craters  were  already  found  in  the  first  pictures. 
The  southern  hemisphere  is  especially  marked  by 
craters,  indicating  that  the  original  surface  is  still 
visible  there.  The  largest  impacts,  Hellas  and  Ar- 
gyre  are  about  2000  km  in  diameter.  On  the  other 
hand,  the  northern  hemisphere  has  an  abundance 
of  large  lava  basins  and  volcanoes  (Fig.  8.15). 
The  surface  is  younger  than  in  the  southern  hemi¬ 
sphere.  The  largest  volcano,  Olympus  Mons ,  pro¬ 
trudes  more  than  20  km  above  the  surround¬ 
ing  terrain.  The  diameter  at  the  bottom  is  about 
600  km. 

There  are  no  active  volcanoes  on  Mars.  The 
mare-like  plains  on  Mars  are  of  the  same  age  as 


sphere  and  the  Valles  Marineris  canyon  system  that  is 
more  than  3000  km  long  and  up  to  8  km  deep.  (MOL A 
Science  Team/NASA) 

the  Lunar  maria,  about  3  x  109  years  old.  Volcan- 
ism  in  the  highland  and  mare-like  plains  stopped 
at  that  time,  but  the  giant  shield  volcanoes  are 
much  younger,  possibly  1-2  x  109  years.  The 
youngest  lava  flows  on  Olympus  Mons  are  pos¬ 
sibly  less  than  100  million  years  old.  Mars  shows 
no  sign  of  plate  tectonics.  It  has  no  mountain 
chains,  nor  any  global  patterns  of  volcanism. 

There  are  also  several  canyons,  the  largest  of 
which  is  Valles  Marineris  (Fig.  8.13).  Its  length 
is  5000  km,  width  200  km,  and  depth  about 
6  km.  Compared  with  Valles  Marineris,  the  Grand 
Canyon  is  merely  a  scratch  on  the  surface. 

Ancient  riverbeds  (Fig.  8.15),  too  small  to  be 
seen  from  the  Earth,  were  also  discovered  by 
spacecraft.  Rivers  were  probably  formed  soon  af¬ 
ter  the  formation  of  Mars  itself,  when  there  was 
a  great  deal  of  water  and  the  atmospheric  pres¬ 
sure  and  temperature  were  higher.  At  present,  the 
temperature  and  air  pressure  on  Mars  are  too  low 
for  free  water  to  exist,  although  there  have  been 
speculations  on  warm  weather  cycles  in  the  more 
recent  history  of  the  planet.  The  mean  temper¬ 
ature  is  now  below  —50  °C  and,  on  a  warm 
summer  day,  the  temperature  can  rise  close  to 
zero  near  the  equator.  Most  of  the  water  is  con¬ 
tained  in  kilometres  deep  permafrost  below  the 
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Fig.  8.14  The  panorama  taken  by  the  Opportunity  rover  shows  structures  resembling  terrestrial  sediments  that  could 
have  been  caused  by  water.  (NASA/JPL/Caltech) 


Fig.  8.15  Volcanoes,  impact  craters  and  rivers,  (a)  Mars 
Global  Surveyor  wide-angle  view  of  Olympus  Mons  in 
April  1998.  (b)  Small  impact  craters  and  sand  dunes  with 
a  resolution  of  1 .5  m  per  pixel.  The  picture  covers  a  1 .5  km 
wide  portion  of  Isidis  Planitia.  (c)  Three  major  valley  sys¬ 
tems  east  of  the  Hellas  plains.  These  valleys  have  probably 

surface  and  in  the  polar  caps.  The  theory  was  con¬ 
firmed  in  2002,  when  the  Mars  Odyssey  space¬ 
craft  detected  a  large  supply  of  subsurface  wa¬ 
ter  ice  of  a  wide  area  near  the  south  pole.  The 
ice  is  mixed  into  the  soil  a  meter  below  the  sur¬ 
face.  Two  rovers,  Spirit  and  Opportunity,  oper¬ 
ating  on  the  Mars  discovered  in  2004  minerals 
like  hematite  and  goethite  which  proved  the  ex¬ 
istence  of  liquid  water  on  the  surface  of  Mars. 
The  period  when  the  liquid  water  existed  is  un¬ 
known. 

The  polar  caps  are  composed  both  of  water 
and  carbon  dioxide  ice.  The  northern  cap  is  al¬ 


been  formed  by  large  outbursts  of  liquid  water  but  the  age 
of  the  valleys  is  unknown.  The  valleys  are  all  roughly  1  km 
deep  and  10-40  km  wide.  The  picture  covers  an  area  ap¬ 
proximately  800  km  across.  (Mars  Global  Surveyor,  2000) 
(NASA/JPL/Malin  Space  Science  Systems) 


most  season-independent,  extending  down  to  lati¬ 
tude  70°.  On  the  other  hand,  the  southern  cap, 
which  reaches  to  the  latitude  —60°  in  the  south¬ 
ern  winter,  disappears  almost  totally  during  the 
summer.  The  southern  cap  consists  mostly  of 
CO2  ice.  The  permanent  parts  are  of  ordinary  wa¬ 
ter  ice,  since  the  temperature,  —73  °C,  is  too  high 
for  CO2  ice.  The  water  ice  layers  can  be  hundreds 
of  metres  thick. 

The  dark  areas  are  not  vegetation,  but  loose 
dust,  moved  around  by  strong  winds.  These  winds 
raise  the  dust  high  into  the  atmosphere,  colouring 
the  Martian  sky  red.  The  Mars  landers  have  re- 
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Fig.  8.16  Phobos  (left) 
and  Deimos,  the  two 
moons  of  Mars.  They  can 
be  captured  asteroids. 
(NASA) 


vealed  a  reddish  regolithic  surface,  scattered  with 
boulders  (Fig.  8.14).  The  red  colour  is  caused 
mainly  by  iron  oxide,  rust;  already  in  the  1950’s, 
the  existence  of  limonite  (2  Fe03  3  H2O)  was  de¬ 
duced  from  polarisation  measurements.  The  on¬ 
site  analysis  showed  that  the  soil  consists  of 
13  %  iron  and  21  %  silicon.  The  abundance  of 
sulphur  was  found  to  be  ten  times  that  found  on 
the  Earth. 

The  interior  of  Mars  is  not  well  known. 
Mars  has  probably  a  dense  core  approximately 
1500  km  in  radius,  a  molten  rocky  mantle  which 
is  denser  than  the  Earth’s  mantle  and  a  thin  crust. 
The  crust  is  80  km  thick  in  the  southern  hemi¬ 
sphere  but  only  about  35  km  thick  in  the  northern 
one.  The  low  mean  density  compared  with  other 
terrestrial  planets  may  indicate  that  in  addition  to 
iron  the  core  contains  a  relatively  large  fraction 
of  sulphur. 

The  Mars  Global  Surveyor  confirmed  in  1997 
a  weak  magnetic  field.  It  is  probably  a  remnant  of 
an  earlier  global  field  that  has  since  disappeared. 
This  has  important  implications  for  the  structure 
of  Mars’  interior.  There  are  no  electric  currents 
creating  a  magnetic  field  and  therefore  the  core 
may  be  (at  least  partially)  solid. 

Three  biological  experiments  of  the  Viking 
landers  in  1976  searched  for  signs  of  life.  No  or¬ 
ganic  compounds  were  found — however,  the  bi¬ 
ological  tests  did  give  some  unexpected  results. 
A  closer  look  at  the  results  indicated  no  life,  but 
some  uncommon  chemical  reactions. 


Mars  has  two  moons,  Phobos  and  Deimos 
(Fig.  8.16).  The  size  of  Phobos  is  roughly  27  km  x 
21  km  x  19  km,  and  the  orbital  period  around 
Mars  is  only  7  h  39  min.  In  the  Martian  sky,  Pho¬ 
bos  rises  in  the  west  and  sets  in  the  east.  Deimos 
is  smaller.  Its  diameter  is  15  km  x  12  km  x  1 1  km. 
The  orbital  period  of  Deimos  is  slightly  over  30 
hours,  or  just  six  hours  longer  than  the  rotation 
period  of  Mars.  The  synodic  period  of  Deimos  is 
nearly  5.5  days,  and  so  it  will  stay  over  2.5  days 
above  the  Martian  horizon. 

There  are  craters  on  both  moons.  Polarimetric 
and  photometric  results  show  that  they  are  com¬ 
posed  of  material  resembling  carbonaceous  chon- 
drite  meteorites. 


8.5  Jupiter 

The  realm  of  terrestrial  planets  ends  at  the  aster¬ 
oid  belt.  Outside  this,  the  relative  abundance  of 
volatile  elements  is  higher  and  the  original  com¬ 
position  of  the  solar  nebula  is  still  preserved  in 
the  giant  planets.  The  first  and  largest  is  Jupiter. 
Its  mass  is  2.5  times  the  total  mass  of  all  other 
planets,  almost  1/1000  of  the  solar  mass.  The 
bulk  of  Jupiter  is  mainly  hydrogen  and  helium. 
The  relative  abundance  of  these  elements  are  ap¬ 
proximately  the  same  as  in  the  Sun,  and  the  den¬ 
sity  is  of  the  same  order  of  magnitude,  namely 
1330  kgm-3. 

During  oppositions,  the  angular  diameter  of 
Jupiter  is  as  large  as  50".  The  dark  belts  and 
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Fig.  8.1 7  A  composed 
image  of  Jupiter  taken  by 
the  Cassini  spacecraft  in 
December  2000.  The 
resolution  is  1 14  km/pixel. 
The  dark  dot  is  the  shadow 
of  the  moon  Europa. 
(NASA/JPL/University  of 
Arizona) 


lighter  zones  are  visible  even  with  a  small  tele¬ 
scope.  These  are  cloud  formations,  parallel  to  the 
equator  (Fig.  8.17).  The  most  famous  detail  is  the 
Great  Red  Spot ,  a  huge  cyclone,  rotating  coun¬ 
terclockwise  once  every  six  days.  The  spot  was 
discovered  by  Giovanni  Cassini  in  1655;  it  has 
survived  for  centuries,  but  its  true  age  is  unknown 
(Fig.  8.18). 

The  rotation  of  Jupiter  is  rapid;  one  revolu¬ 
tion  takes  9  h  55  min  29.7  s.  This  is  the  period 
determined  from  the  variation  of  the  magnetic 
held,  and  it  reflects  the  speed  of  Jupiter’s  inte¬ 
riors  where  the  magnetic  held  is  born.  As  might 
be  expected,  Jupiter  does  not  behave  like  a  rigid 
body.  The  rotation  period  of  the  clouds  is  about 
hve  minutes  longer  in  the  polar  region  than  at  the 
equator.  Due  to  its  rapid  rotation,  Jupiter  is  non- 
spherical;  battening  is  as  large  as  1/15. 

There  is  possibly  an  iron-nickel  core  in  the 
centre  of  Jupiter.  The  mass  of  the  core  is  proba¬ 
bly  equal  to  a  few  tens  of  Earth  masses.  The  core 
is  surrounded  by  a  layer  of  metallic  liquid  hy¬ 
drogen,  where  the  temperature  is  over  10,000  K 


and  the  pressure,  three  million  atm.  Owing  to  this 
huge  pressure,  the  hydrogen  is  dissociated  into 
single  atoms,  a  state  unknown  in  ordinary  labora¬ 
tory  environments.  In  this  exotic  state,  hydrogen 
has  many  features  typical  of  metals.  This  layer 
is  electrically  conductive,  giving  rise  to  a  strong 
magnetic  held.  Closer  to  the  surface  where  the 
pressure  is  lower,  the  hydrogen  is  present  as  nor¬ 
mal  molecular  hydrogen,  H2.  At  the  top  there  is 
a  1000  km  thick  atmosphere. 

The  atmospheric  state  and  composition  of 
Jupiter  has  been  accurately  measured  by  the 
spacecraft.  In  situ  observations  were  obtained  in 
1995,  when  the  probe  of  the  Galileo  spacecraft 
was  dropped  into  Jupiter’s  atmosphere.  It  sur¬ 
vived  nearly  an  hour  before  crushing  under  the 
pressure,  collecting  the  hrst  direct  measurements 
of  Jupiter’s  atmosphere. 

Belts  and  zones  are  stable  cloud  formations. 
Their  width  and  colour  may  vary  with  time,  but 
the  semi-regular  pattern  can  be  seen  up  to  the  lat¬ 
itude  50°.  The  colour  of  the  polar  areas  is  close  to 
that  of  the  belts.  The  belts  are  reddish  or  brown- 
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Fig.  8.18  Jupiter’s  Great 
Red  Spot  and  its 
surroundings  with  several 
smaller  ovals  as  seen  by 
Voyager  1  in  1979.  Cloud 
details  of  160  kilometres 
are  visible.  (NASA) 


ish,  and  the  motion  of  the  gas  inside  a  belt  is 
downward.  The  gas  flows  upward  in  the  white 
zones.  The  clouds  in  the  zones  are  slightly  higher 
and  have  a  lower  temperature  than  those  in  the 
belts.  Strong  winds  or  jet  streams  blow  along  the 
zones  and  belts.  The  speed  of  the  wind  reaches 
150  m/s  at  some  places  in  the  upper  atmosphere. 
According  to  the  measurements  of  the  Galileo 
probe,  the  wind  speeds  in  the  lower  cloud  lay¬ 
ers  can  reach  up  to  500  m/s.  This  indicates  that 
the  winds  in  deeper  atmospheric  layers  are  driven 
by  the  outflowing  flux  of  the  internal  heat,  not  the 
solar  heating. 

The  colour  of  the  Great  Red  Spot  (GRS)  re¬ 
sembles  the  colour  of  the  belts  (Fig.  8.18).  Some¬ 
times  it  is  almost  colourless,  but  shows  no  signs 
of  decrepitude.  The  GRS  is  14,000  km  wide  and 
30,000-40,000  km  long.  Some  smaller  red  and 
white  spots  can  also  be  observed  on  Jupiter,  but 
their  lifetime  is  generally  much  less  than  a  few 
years. 

The  ratio  of  helium  to  hydrogen  in  the  deep 
atmosphere  is  about  the  same  as  in  the  Sun.  The 


results  of  the  Galileo  spacecraft  gave  consider¬ 
ably  higher  abundance  than  previous  estimates. 
It  means  that  there  are  no  significant  differenti¬ 
ation  of  helium,  i.e.  helium  is  not  sinking  to  the 
interior  of  the  planet  as  was  expected  according 
to  the  earlier  results.  Other  compounds  found  in 
the  atmosphere  include  methane,  ethane  and  am¬ 
monia.  The  temperature  in  the  cloud  tops  is  about 
130  K. 

Jupiter  radiates  twice  the  amount  of  heat  that 
it  receives  from  the  Sun.  This  heat  is  a  remnant 
of  the  energy  released  in  the  gravitational  con¬ 
traction  during  the  formation  of  the  planet.  Thus 
Jupiter  is  still  gradually  cooling.  The  internal  heat 
is  transferred  outward  by  convection;  this  gives 
rise  to  flows  in  the  metallic  hydrogen,  causing  the 
strong  magnetic  field. 

The  ring  of  Jupiter  (Fig.  8.19)  was  discovered 
in  1979.  The  innermost  toroid-shaped  halo  is  be¬ 
tween  92,000-122,500  km  from  Jupiter’s  centre. 
It  consists  of  dust  falling  from  the  main  ring  to¬ 
ward  the  planet.  The  main  ring  extends  from  the 
halo  boundary  out  to  about  128,940  km,  just  in- 
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Fig.  8.19  Mosaic  of  Jupiter’s  ring  system  taken  by  the 
Galileo  spacecraft  when  the  spacecraft  was  in  Jupiter’s 
shadow  looking  back  toward  the  Sun.  Jupiter’s  ring  sys¬ 
tem  is  composed  of  three  parts:  a  thin  outermost  ring,  a  flat 

side  the  orbit  of  the  moon  Adrastea.  The  ring 
particles  are  small,  a  few  microns  only,  and  they 
scatter  light  forward  much  more  effectively  than 
backward.  Therefore,  they  were  not  discovered 
prior  the  Voyager  flyby.  A  ring  consisting  of  such 
small  particles  cannot  be  stable,  and  new  material 
must  enter  the  ring  continuously.  Possible  sources 
are  the  small  moons  Adrastea  and  Metis. 

The  two  faint  outermost  rings  are  fairly  uni¬ 
form  in  nature.  The  inner  of  them  extends  from 
the  orbit  of  Adrastea  out  to  the  orbit  of  Amalthea 
at  181,000  km.  The  fainter  outermost  ring  ex¬ 
tends  out  to  Thebe’s  orbit  at  221,000  km. 

Jupiter’s  rings  and  moons  exist  within  an  in¬ 
tense  radiation  belt  of  Jupiter’s  magnetic  field. 
The  magnetosphere  extends  3-7  million  kilome¬ 
tres  toward  the  Sun,  depending  on  the  strength 
of  the  solar  wind.  In  the  opposite  direction  it 
stretches  to  a  distance  of  at  least  750  million  kilo¬ 
metres,  behind  Saturn’s  orbit. 

Jupiter  is  an  intense  radio  source.  Its  radio 
emission  can  be  divided  into  three  components, 
namely  thermal  millimetre  and  centimetre  radia¬ 
tion,  nonthermal  decimetric  radiation  and  burstal- 
decametric  radiation.  The  nonthermal  emission  is 
most  interesting;  it  is  partly  synchrotron  radia¬ 
tion,  generated  by  relativistic  electrons  in  the  Jo¬ 
vian  magnetosphere.  Its  intensity  varies  in  phase 
with  Jupiter’s  rotation;  thus  the  radio  emission 
can  be  used  for  determining  the  exact  rotation 


main  ring,  and  an  innermost  doughnut- shaped  halo.  These 
rings  are  made  up  of  dust- sized  particles  that  originate 
from  Io,  or  are  blasted  off  from  the  nearby  inner  satellites 
by  small  impacts.  (NASA/University  of  Arizona) 

rate.  The  decametric  bursts  are  related  to  the  po¬ 
sition  of  the  innermost  large  moon,  Io,  and  are 
possibly  generated  by  the  electric  current  of  mil¬ 
lions  of  Amperes  observed  between  Jupiter  and 
the  plasma  torus  at  the  orbit  of  Io.  Also  auroras 
are  common  in  Jupiter  (Fig.  8.20). 

In  the  beginning  of  year  2016  there  were 
67  known  moons  of  Jupiter,  but  probably  more 
small  satellites  will  be  found  in  the  future.  The 
four  largest,  Io ,  Europa ,  Ganymede  and  Callisto 
are  called  the  Galilean  satellites  (Fig.  8.21),  in 
honour  of  Galileo  Galilei,  who  discovered  them 
in  1610.  The  Galilean  satellites  can  already  be 
seen  with  ordinary  binoculars.  They  are  the  size 
of  the  Moon  or  even  planet  Mercury.  The  other 
moons  are  small,  most  of  them  only  a  few  kilo¬ 
metres  in  diameter. 

Owing  to  tidal  forces,  the  orbits  of  Io,  Eu¬ 
ropa  and  Ganymede  have  been  locked  into  a  res¬ 
onance,  so  that  their  longitudes  X  strictly  satisfy 
the  equation 

T-Io  3A.Europa  T  2A-Ganyme(ie  —  180  .  (8.1) 

Hence  the  moons  can  never  be  in  the  same  direc¬ 
tion  when  seen  from  Jupiter. 

Io  is  the  innermost  Galilean  satellite.  It  is  a  lit¬ 
tle  larger  than  the  Moon.  Its  surface  is  spotted  by 
numerous  calderas,  volcanoes  without  a  moun¬ 
tain.  The  molten  material  is  ejected  up  to  a  height 
of  250  km,  and  a  part  of  the  gas  gets  into  Io’s  or- 
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Fig.  8.20  Left:  NASA  Hubble  Space  Telescope  close-up 
view  of  an  aurora  on  Jupiter.  The  image  shows  the  main 
oval  of  the  aurora,  centred  over  the  magnetic  north  pole, 
and  diffuse  emissions  inside  the  polar  cap.  (NASA,  John 
Clarke/University  of  Michigan)  Right:  The  image  taken 

bit.  The  volcanic  activity  on  Io  is  much  stronger 
than  on  the  Earth.  There  is  a  100  m  bulk  of 
the  permanent  tide  raised  by  Jupiter.  Due  to 
the  orbital  perturbations  caused  by  Europa  and 
Ganymede  the  orbit  of  Io  is  slightly  elliptical  and 
therefore  the  orbital  speed  varies.  The  tidal  bulk 
is  forced  to  move  with  respect  to  the  surface.  This 
generates  friction,  which  is  transformed  to  heat. 
This  heat  keeps  the  sulphur  compounds  molten 
beneath  the  colourful  surface  of  Io.  No  traces  of 
impact  craters  are  visible.  The  whole  surface  is 
new,  being  renewed  continuously  by  eruptions. 
There  is  no  water  on  Io. 

Europa  is  the  smallest  of  the  Galilean  satel¬ 
lites,  a  little  smaller  than  the  Moon.  The  surface 
is  ice-covered  and  the  geometric  albedo  is  as  high 
as  0.6.  The  surface  is  smooth  with  only  a  few  fea¬ 
tures  more  than  a  hundred  metres  high.  Most  of 
the  markings  seem  to  be  albedo  features  with  very 
low  relief.  Only  a  few  impact  craters  have  been 
found  indicating  that  the  surface  is  young.  The 
surface  is  renewed  by  fresh  water,  trickling  from 
the  internal  ocean.  Galileo  spacecraft  has  found 
a  very  weak  magnetic  field.  The  field  varies  pe¬ 
riodically  as  it  passes  through  Jupiter’s  magnetic 
field.  This  shows  that  there  is  a  conducting  mate¬ 
rial  beneath  Europa’ s  surface,  most  likely  a  salty 
ocean  that  could  even  be  100  km  deep.  At  the  cen¬ 
tre,  there  is  a  solid  silicate  core. 


on  January  2001  by  NASA’s  Cassini  spacecraft  shows  the 
bubble  of  charged  particles  trapped  in  the  magnetosphere. 
The  magnetic  field  and  the  torus  of  the  ionised  mate¬ 
rial  from  the  volcanoes  of  Io  are  drawn  over  the  image. 
(NASA/JPL/Johns  Hopkins  University) 

Ganymede  is  the  largest  moon  in  the  solar  sys¬ 
tem.  Its  diameter  is  5300  km;  it  is  larger  than 
the  planet  Mercury.  The  density  of  craters  on  the 
surface  varies,  indicating  that  there  are  areas  of 
different  ages.  Ganymede’s  surface  is  partly  very 
old,  highly  cratered  dark  regions,  and  somewhat 
younger  but  still  ancient  lighter  regions  marked 
with  an  extensive  array  of  grooves  and  ridges. 
They  have  a  tectonic  origin,  but  the  details  of  the 
formations  are  unknown.  About  50  %  of  the  mass 
of  the  moon  is  water  or  ice,  the  other  half  being 
silicates  (rocks).  Contrary  to  Callisto,  Ganymede 
is  differentiated:  a  small  iron  or  iron/sulphur  core 
surrounded  by  a  rocky  silicate  mantle  with  an 
icy  (or  liquid  water)  shell  on  top.  Ganymede  has 
a  weak  magnetic  field. 

Callisto  is  the  outermost  of  the  large  moons. 
It  is  dark;  its  geometric  albedo  is  less  than  0.2. 
Callisto  seems  to  be  undifferentiated,  with  only 
a  slight  increase  of  rock  toward  the  centre.  About 
40  %  of  Callisto  is  ice  and  60  %  rock/iron.  The 
ancient  surface  is  peppered  by  meteorite  craters; 
no  signs  of  tectonic  activity  are  visible.  How¬ 
ever,  there  have  been  some  later  processes,  be¬ 
cause  small  craters  have  mostly  been  obliterated 
and  ancient  craters  have  collapsed. 

The  currently  known  moons  can  be  divided 
into  four  different  groups:  small  moons  inside  the 
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Fig.  8.21  (Top)  The  Galilean  satellites  of  Jupiter.  From 
left  to  right :  Io,  Europa,  Ganymede,  and  Callisto 
(NASA/DLR).  (Right  page  top )  Surface  details  of  Io  and 


Europa.  (Right  page  bottom )  Surface  details  of  Ganymede 
and  Callisto.  (NASA/Brown  University,  NASA/JPL) 
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Fig.  8.22  Saturn  and  its  rings.  Three  satellites  (Tethys, 
Dione,  and  Rhea)  are  seen  to  the  left  of  Saturn,  and  the 
shadows  of  Mimas  and  Tethys  are  visible  on  Saturn’s 
cloud  tops.  (NASA/JPL) 

orbits  of  the  Galilean  satellites,  the  Galilean  satel¬ 
lites,  and  two  groups  of  irregular  moons  outside 
the  orbit  of  the  Galilean  satellites.  The  moons  of 
the  inner  distant  group  move  on  direct  orbits,  and 
the  moons  of  the  most  distant  group  on  retrograde 
orbits.  It  is  possible  that  many  of  these  are  small 
asteroids  captured  by  Jupiter. 


8.6  Saturn 

Saturn  (Fig.  8.22)  is  the  second  largest  planet.  Its 
diameter  is  about  120,000  km,  ten  times  the  diam¬ 
eter  of  the  Earth,  and  the  mass,  95  Earth  masses. 
The  density  is  only  700  kgm-3,  less  than  the 
density  of  water.  The  rotation  axis  is  tilted  about 
27°  with  respect  to  the  orbital  plane,  so  every  15 
years,  the  northern  or  the  southern  pole  is  well 
observable. 

The  rotation  period  is  10  h  39.4  min,  deter¬ 
mined  from  the  periodic  variation  of  the  magnetic 


Fig.  8.23  A  schematic  drawing  of  the  structure  of  the 
Saturnian  rings.  The  brightest  A  and  B  rings  can  be  seen 
even  with  a  small  telescope;  the  other  ones  are  very  faint 

field  by  the  Voyager  spacecraft  in  198 1 .  However, 
Cassini  spacecraft  observed  in  2004  the  period 
of  10  h  45  min.  The  reason  for  the  change  is  un¬ 
known.  Due  to  the  rapid  rotation,  Saturn  is  flat¬ 
tened;  the  flattening  is  1/10,  which  can  be  easily 
seen  even  with  a  small  telescope. 

The  internal  structure  of  Saturn  resembles  that 
of  Jupiter.  Due  to  its  smaller  size,  the  metallic  hy¬ 
drogen  layer  is  not  so  thick  as  on  Jupiter.  The 
thermal  radiation  of  Saturn  is  2.8  times  that  of 
the  incoming  solar  flux.  The  heat  excess  origi¬ 
nates  from  the  differentiation  of  helium.  The  he¬ 
lium  atoms  are  gradually  sinking  inward  and  the 
released  potential  energy  is  radiated  out  as  a  ther¬ 
mal  radiation.  The  abundance  of  helium  in  Sat¬ 
urn’s  atmosphere  is  only  about  half  of  that  on 
Jupiter. 

The  winds,  or  jet  streams,  are  similar  to  those 
of  Jupiter  but  Saturn’s  appearance  is  less  colour¬ 
ful.  Viewed  from  the  Earth,  Saturn  is  a  yellowish 
disk  without  any  conspicuous  details.  The  clouds 
have  fewer  features  than  those  on  Jupiter,  be¬ 
cause  a  haze,  composed  of  hydrogen,  ammonium 
and  methane  floats  above  the  cloud  tops.  Further¬ 
more,  Saturn  is  farther  from  the  Sun  than  Jupiter 
and  thus  has  a  different  energy  budget. 

The  temperature  at  the  cloud  tops  is  about 
94  K.  Close  to  the  equator  the  wind  speeds  ex¬ 
ceed  400  m/s  and  the  zone  in  which  the  direc¬ 
tion  of  the  wind  remains  the  same  extends  40° 
from  the  equator.  Such  high  speeds  cannot  be  ex¬ 
plained  with  external  solar  heat,  but  the  reason  for 
the  winds  is  the  internal  flux  of  heat. 

Saturn’s  most  remarkable  feature  is  a  thin  ring 
system  (Figs.  8.23,  8.24),  lying  in  the  planet’s 
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equatorial  plane.  The  Saturnian  rings  can  be  seen 
even  with  a  small  telescope.  The  rings  were  dis¬ 
covered  by  Galileo  Galilei  in  1610;  only  45  years 
later  did  Christian  Huygens  establish  that  the  for¬ 
mation  observed  was  actually  a  ring,  and  not 
two  oddly  behaving  bulbs,  as  they  appeared  to 
Galileo.  In  1857  James  Clerk  Maxwell  showed 
theoretically  that  the  rings  cannot  be  solid  but 
must  be  composed  of  small  particles. 

The  rings  are  made  of  normal  water  ice.  The 
size  of  the  ring  particles  ranges  from  microns  to 
truck- size  chunks.  Most  of  the  particles  are  in 
range  of  centimetres  to  metres.  The  width  of  the 
ring  system  is  more  than  60,000  km  (about  the  ra¬ 
dius  of  Saturn)  and  the  thickness,  at  most  100  m, 
and  possibly  only  a  few  metres.  Cassini  space¬ 
craft  discovered  also  molecular  oxygen  around 
the  rings,  probably  as  a  product  of  the  disintegra¬ 
tion  of  water  ice  from  the  rings. 

According  to  Earth-based  observations,  the 
rings  are  divided  into  three  parts,  called  simply 
A,  B,  and  C.  The  innermost  C  ring  is  17,000  km 
wide  and  consists  of  very  thin  material.  There  is 
some  material  even  inside  this  (referred  to  as  the 
D  ring),  and  a  haze  of  particles  may  extend  down 
to  the  clouds  of  Saturn. 

The  B  ring  is  the  brightest  ring.  Its  total  width 
is  26,000  km,  but  the  ring  is  divided  into  thou¬ 
sands  of  narrow  ringlets,  seen  only  by  the  space¬ 
craft.  From  the  Earth,  the  ring  seems  more  or  less 


Fig.  8.24  At  a  close  distance,  the  rings  can  be  seen  to  be 
divided  into  thousands  of  narrow  ringlets.  (JPL/NASA) 


uniform.  Between  B  and  A,  there  is  a  3000  km 
wide  gap,  the  Cassini  division.  It  is  not  totally 
void,  as  was  previously  believed;  some  material 
and  even  narrow  ringlets  have  been  found  in  the 
division  by  the  Voyager  space  probes. 

The  A  ring  is  not  divided  into  narrow  ringlets 
as  clearly  as  the  B  ring.  There  is  one  narrow  but 
obvious  gap,  Encke ’s  division ,  close  to  the  outer 
edge  of  the  ring.  The  outer  edge  is  very  sharp,  due 
to  the  “shepherd”  moon,  some  800  km  outside  the 
ring.  The  moon  prevents  the  ring  particles  from 
spreading  out  to  larger  orbits.  It  is  possible  that 
the  appearance  of  B  is  due  to  yet-undiscovered 
moonlets  inside  the  ring. 

The  F  ring,  discovered  in  1979,  is  about 
3000  km  outside  A.  The  ring  is  only  a  few  hun¬ 
dred  kilometres  wide.  On  both  sides  there  is 
a  small  moon;  these  shepherds  prevent  the  ring 
from  spreading.  An  interior  moon  passing  a  ring 
particle  causes  the  particle  to  move  to  a  larger  or¬ 
bit.  Similarly,  at  the  outer  edge  of  the  ring,  a  sec¬ 
ond  moon  forces  the  particles  inward.  The  net 
result  is  that  the  ring  is  kept  narrow. 

Outside  the  F  ring,  there  are  some  zones  of 
very  sparse  material,  sometimes  referred  to  as  the 
G  and  E  rings.  These  are  merely  collections  of 
small  particles. 

The  Saturnian  rings  were  possibly  formed  to¬ 
gether  with  Saturn  and  are  not  debris  from  some 
cosmic  catastrophe,  like  remnants  of  a  broken 
moon.  The  total  mass  of  the  rings  is  10-7  of  the 
mass  of  Saturn.  If  all  ring  particles  were  collected 
together,  they  would  form  an  ice  ball,  600  km  in 
diameter. 

In  2009  a  very  sparse  ring  was  observed  at 
the  distance  of  the  moon  Phoebe.  It  has  the  same 
inclination  as  Phoebe,  nearly  30°.  It  has  proba¬ 
bly  originated  in  material  shattered  from  Phoebe 
by  micrometeorites.  The  ring  extends  inward  to 
the  orbit  of  Iapetus.  The  hemispheres  of  Iapetus 
are  quite  different,  one  side  is  very  dark.  Since 
Iapetus  always  turns  the  same  side  towards  Sat¬ 
urn,  only  one  hemisphere  is  hit  by  ring  particles. 
Eventually  ice  evaporates  away  leaving  only  the 
dark  cinder. 

A  total  of  62  moons  (in  2016)  of  Saturn 
are  known.  Many  of  the  large  Saturnian  moons 
(Fig.  8.25)  were  observed  by  Pioneer  1 1  and  Voy¬ 
ager  1  and  2  spacecrafts.  Some  of  the  large  moons 
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Fig.  8.25  Saturnian  moons  photographed  by  the  Cassini 
spacecraft  in  2005-2006  (a)  Hyperion,  (b)  Enceladus, 
(c)  Iapetus  and  (d)  Tethys.  (e)  A  radar  picture  of  the  north¬ 
like  Enceladus  and  Tethys  are  composed  mainly 
of  ice.  The  temperature  of  the  primeval  nebula  at 
the  distance  of  Saturn  was  so  low  that  bodies  of 
pure  ice  could  form  and  survive. 

Some  moons  are  dynamically  interesting; 
some  have  an  exotic  geological  past.  Outside  the 
F  ring,  there  are  two  moonlets,  Epimetheus  and 
Janus ,  almost  in  the  same  orbit;  the  difference  of 
the  semimajor  axes  is  about  50  km,  less  than  the 
radii  of  the  moons.  The  inner  moon  is  gaining  on 
the  outer  one.  The  moons  will  not  collide,  since 


em  latitudes  of  Titan,  taken  by  Cassini  in  summer  2006. 
The  black  patches  are  probably  methane  lakes.  The  width 
of  the  picture  is  about  450  km.  (Photo  NASA) 

the  speed  of  the  trailing  moon  increases  and  the 
moon  moves  outward.  The  speed  of  the  leading 
moon  decreases  and  it  drops  inward.  The  moons 
exchange  their  roles  roughly  every  four  years. 
There  are  also  several  shepherding  satellites,  like 
Atlas ,  Prometheus  and  Pandora  that  keep  rings  in 
their  place.  Their  gravitational  pull  prevents  ring 
particles  from  drifting  away. 

The  innermost  of  the  “old”  moons  is  Mimas. 
There  is  a  huge  crater,  named  Herschel,  on  Mi¬ 
mas’  surface  with  a  diameter  of  100  km  and 
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Fig.  8.25  (Continued) 


a  depth  of  9  km  (Fig.  8.25).  Bigger  craters  exist 
in  the  solar  system,  but  relative  to  the  size  of  the 
parent  body,  this  is  almost  the  biggest  crater  there 
could  be  room  for  (otherwise  the  crater  would  be 
bigger  than  Mimas  itself).  On  the  opposite  side, 
some  grooves  can  be  seen,  possibly  signifying 
that  impact  has  almost  torn  the  moon  apart. 

The  surface  of  the  next  moon,  Enceladus ,  con¬ 
sists  of  almost  pure  ice,  and  one  side  is  nearly 
craterless.  Craters  and  grooves  can  be  found  on 
the  other  hemisphere.  Tidal  forces  result  in  vol¬ 
canic  activity  where  water  (not  lava  or  other  “hot” 
material)  is  discharged  to  the  surface. 

Titan  is  the  largest  of  the  Saturnian  moons. 
Its  diameter  is  5100  km,  so  it  is  only  slightly 
smaller  than  Jupiter’s  moon  Ganymede.  Titan  is 
the  only  moon  with  a  dense  atmosphere.  The  at¬ 
mosphere  is  mainly  nitrogen  (99  %)  and  methane, 
and  the  pressure  at  the  surface  is  1.6  bar.  The 
temperature  is  about  90  K.  Reddish  clouds  form 
the  visible  surface  some  200  km  above  the  solid 
body.  The  Huygens  lander  made  measurements 
and  sent  images  of  Titan  in  2005.  It  detected  lakes 
of  methane  as  well  as  geysers  ejecting  methane, 
ammonia  and  water. 


8.7  Uranus 

The  planets  from  Mercury  to  Saturn  were  already 
known  in  antiquity.  The  next  planet,  Uranus,  can¬ 
not  be  seen  with  the  naked  eye. 


The  famous  German-English  amateur  astrono¬ 
mer  William  Herschel  discovered  Uranus  in  1781 . 
Herschel  himself  first  thought  that  the  new  ob¬ 
ject  was  a  comet.  However,  the  extremely  slow 
motion  revealed  that  the  body  was  far  beyond 
the  orbit  of  Saturn.  Based  on  the  first  observa¬ 
tions,  the  Finnish  astronomer  Anders  Lexell  cal¬ 
culated  a  circular  orbit.  He  was  one  of  the  first 
to  propose  that  the  newly  discovered  object  was 
a  planet.  Johann  Bode  of  the  Berlin  Observatory 
suggested  the  name  Uranus  but  more  than  five 
decades  passed  before  the  name  was  unanimously 
accepted. 

The  mean  distance  of  Uranus  is  19  au,  and  the 
orbital  period  84  years.  The  inclination  of  the  ro¬ 
tation  axis  is  98°,  which  is  totally  different  from 
the  other  planets.  Due  to  this  uncommon  geom¬ 
etry,  the  poles  are  either  lit  or  in  darkness  for 
decades.  The  rotation  period,  confirmed  by  the 
Voyager  2  magnetometric  measurements  in  1986, 
is  17.3  hours;  the  exact  period  had  been  uncertain 
prior  to  the  fly-by. 

Uranus  is  greenish,  as  viewed  through  a  tele¬ 
scope.  Its  colour  is  due  to  the  strong  methane  ab¬ 
sorption  bands  in  the  near-infrared.  A  part  of  the 
red  light  is  also  absorbed,  leaving  the  green  and 
blue  part  of  the  spectrum  untouched.  Uranus  is 
almost  featureless  (Fig.  8.26)  because  its  clouds 
are  below  a  thick  haze  or  smog.  The  origin  of  the 
smog  may  be  methane  that  solar  radiation  broke 
into  radicals  which  then  formed  e.g.  asethylene 
and  ethane. 
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Fig.  8.26  Two  views  of  Uranus.  The  left  picture  shows  Uranus  surrounded  by  its  rings.  Also  10  satellites  are  vis- 
Uranus  as  it  would  appear  to  the  naked  eye.  (NASA).  ible  in  the  original  picture.  (Seidelmann,  U.S.  Naval  Ob- 
At  the  right  there  is  a  Hubble  Space  Telescope  view  of  servatory,  and  NASA) 


Fig.  8.27  Left:  The  rings  of  Uranus  are  very  narrow  and 
composed  of  a  dark  material.  Nine  rings  are  visible  in  the 
picture  of  Voyager  in  1986.  Right:  Rings  seen  in  the  light 

The  strong  limb  darkening  caused  by  the  smog 
makes  the  terrestrial  determination  of  the  Uranus’ 
size  difficult.  Therefore,  the  radius  was  not  accu¬ 
rately  determined  until  1977  during  a  stellar  oc- 
cultation  caused  by  Uranus.  The  rings  of  Uranus 
were  discovered  at  the  same  time. 

The  internal  structure  of  Uranus  is  thought 
to  be  slightly  different  from  that  of  other  giant 


scattered  forward  when  the  Voyager  spacecraft  was  in  the 
shadow  of  the  planet.  (NASA) 


planets.  Above  the  innermost  rocky  core,  there  is 
a  layer  of  water,  which,  in  turn,  is  surrounded  by 
a  mantle  of  hydrogen  and  helium.  The  mixture  of 
water  and  ammonia  and  methane  therein  are  dis¬ 
sociated  to  ions  under  the  heavy  pressure.  This 
mixture  behaves  more  like  a  molten  salt  than  wa¬ 
ter.  The  convection  flows  in  this  electrically  con¬ 
ductive  “sea”  give  rise  to  the  Uranian  magnetic 
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Fig.  8.28  Four  Uranian  moons  (from  top  left  to  lower  right):  Miranda,  Ariel,  Titania  and  Umbriel.  (NASA) 


field.  The  strength  of  the  magnetic  field  at  the 
cloud  tops  is  comparable  to  the  terrestrial  field. 
However,  Uranus  is  much  larger  then  the  Earth, 
so  the  true  strength  of  the  field  is  50  times  greater 
than  that  of  the  Earth.  The  Uranian  magnetic  field 
is  tilted  59°  with  respect  to  the  rotation  axis.  No 
other  planet  has  such  a  highly  inclined  magnetic 
field. 

The  Uranian  rings  (Fig.  8.27)  were  discovered 
in  1977,  during  a  stellar  occultation.  Secondary 
occultations  were  observed  before  and  after  the 
main  event.  A  total  of  13  rings  are  known,  nine  of 


which  were  discovered  in  the  occultation.  The  in¬ 
nermost  ring  is  broad  and  diffuse.  All  other  rings 
are  dark  and  very  narrow,  only  a  few  hundred  me¬ 
tres  or  a  few  kilometres  wide.  The  Voyager  2  re¬ 
sults  showed  that  the  rings  contain  very  little  dust, 
unlike  those  of  Jupiter  and  Saturn.  The  mean  size 
of  the  ring  particles  is  more  than  1  metre.  The  ring 
particles  are  darker  than  practically  any  known 
material  in  the  solar  system;  the  cause  of  this  dark 
colour  is  unknown. 

There  are  27  moons  (2016  number)  orbiting 
around  Uranus,  ten  of  which  were  discovered  by 
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Voyager  2.  The  geological  history  of  some  moons 
is  puzzling,  and  many  features  reminiscent  of  an 
active  past  can  be  found. 

The  innermost  of  the  large  moons,  Miranda , 
is  one  of  the  most  peculiar  objects  discovered 
(Fig.  8.28).  It  has  several  geological  formations 
also  found  elsewhere  (but  here  they  are  all  mixed 
together),  in  addition  to  the  quite  unique  V- 
shaped  formations.  It  is  possible  that  Miranda’s 
present  appearance  is  the  result  of  a  vast  colli¬ 
sion  that  broke  the  moon  apart;  some  pieces  may 
have  later  settled  down,  inside  out.  Another  pe¬ 
culiar  object  is  Umbriel.  It  belongs  to  the  ever 
increasing  family  of  unusual  dark  bodies  (such  as 
the  Uranian  rings,  one  side  of  Iapetus  and  Hal¬ 
ley’s  comet).  The  dark  surface  of  Umbriel  is  cov¬ 
ered  by  craters  without  any  traces  of  geological 
activity. 


8.8  Neptune 

The  orbit  of  Uranus  was  already  well  known  in 
the  beginning  of  the  19th  century.  However,  some 
unknown  perturbations  displaced  Uranus  from 
its  predicted  orbit.  Based  on  these  perturbations, 
John  Couch  Adams ,  of  Cambridge,  and  Urbain 


Jean-Joseph  Le  Verrier ,  of  Paris,  independently 
predicted  the  position  of  the  unknown  perturbing 
planet. 

The  new  planet  was  discovered  in  1 846  by  Jo¬ 
hann  Gottfried  Galle  at  the  Berlin  Observatory; 
Le  Verrier’ s  prediction  was  found  to  be  only  1° 
off.  The  discovery  gave  rise  to  a  heated  contro¬ 
versy  as  to  who  should  be  given  the  honour  of 
the  discovery,  since  Adams’  calculations  were 
not  published  outside  the  Cambridge  Observa¬ 
tory.  When  the  quarrel  was  settled  years  later, 
both  men  were  equally  honoured.  The  discovery 
of  Neptune  was  also  a  great  triumph  of  the  New¬ 
tonian  theory  of  gravitation. 

The  semimajor  axis  of  the  orbit  of  Neptune 
is  30  au  and  the  orbital  period  around  the  Sun 
165  years.  The  internal  rotation  period,  confirmed 
by  Voyager  2  in  1989,  is  16  hours  7  minutes 
and  the  rotation  period  of  the  outer  layers  of  the 
clouds  is  about  17  hours.  The  obliquity  of  the  ro¬ 
tation  axis  is  29°  but  the  magnetic  field  is  tilted 
some  47°  with  respect  to  the  rotation  axis.  The 
magnetic  field  is  tilted  like  in  Uranus,  but  the  field 
strength  is  much  smaller. 

The  density  of  Neptune  is  1638  kgm-3,  and 
the  diameter  49,500  km.  Thus  the  density  of  Nep- 


Fig.  8.29  (Left)  Neptune  shows  more  features  than 
Uranus.  In  the  picture  of  Voyager  2  the  Great  Dark  Spot, 
accompanied  by  bright,  white  clouds  is  well  visible.  Their 
appearance  is  changing  rapidly.  To  the  south  of  the  Great 
Dark  Spot  is  a  bright  feature  and  still  farther  south  is  an¬ 
other  dark  spot.  Each  feature  moves  eastward  at  a  different 


velocity.  (Right)  Details  of  the  Southern  Dark  Spot.  The 
V-shaped  structure  near  the  right  edge  of  the  bright  area 
indicates  that  the  spot  rotates  clockwise.  Unlike  the  Great 
Red  Spot  on  Jupiter,  which  rotates  counterclockwise,  the 
material  in  the  Neptune’s  dark  oval  will  be  descending. 
(NASA/JPL) 
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Fig.  8.30  The  rings  of  Neptune.  Ring  particles  are  small 
and  best  visible  in  the  forward  scattered  light.  There  are 
several  brightening s  in  the  outermost  ring.  One  of  the  rings 
appears  to  have  a  twisted  structure.  Neptune  at  left  is  over¬ 
exposed.  (NASA/JPL) 

tune  is  higher  than  that  of  other  giant  planets.  The 
internal  structure  is  quite  simple:  The  core,  com¬ 
posed  of  silicates  (rocks)  is  about  16,000  km  in 
diameter.  This  is  surrounded  by  a  layer  of  wa¬ 
ter  and  liquid  methane  and  the  outermost  gaseous 
layer,  atmosphere,  is  mainly  composed  of  hydro¬ 
gen  and  helium,  methane  and  ethane  being  a  mi¬ 
nor  components. 

Cloud  structures  are  more  complicated  than  on 
Uranus,  and  some  dark  spots,  like  in  Jupiter,  were 
visible  during  the  Voyager  fly-by  (Fig.  8.29).  The 
speed  of  the  winds  are  high,  up  to  400  m/s. 

Like  other  giants,  Neptune  also  has  rings 
(Fig.  8.30).  The  rings  were  discovered  by  Voy¬ 
ager  2,  although  their  existence  was  already  ex¬ 
pected  prior  the  fly-by.  Two  relatively  bright  but 
very  narrow  rings  are  at  a  distance  of  53,000  and 
62,000  km  from  the  centre  of  the  planet.  More¬ 
over,  there  are  some  faint  areas  of  fine  dust. 

There  are  14  known  moons,  six  of  which  were 
discovered  by  Voyager  2.  Before  the  Voyager  fly- 


Fig.  8.31  The  southern  hemisphere  of  Triton,  Neptune’s 
largest  satellite  in  a  picture  taken  in  1989  by  Voyager  2. 
The  dark  spots  may  indicate  eruptions  of  “icy  volcanoes”. 
Voyager  2  images  showed  active  geyser-like  eruptions 
spewing  nitrogen  gas  and  dark  dust  particles  several  kilo¬ 
metres  into  the  atmosphere.  (NASA) 

by,  only  two  of  them,  Triton  and  Nereid,  were 
known. 

The  largest  of  the  moons,  Triton ,  is  2700  km 
in  diameter,  and  it  has  a  thin  atmosphere,  mainly 
composed  of  nitrogen.  The  albedo  is  high:  Triton 
reflects  60-80  %  of  the  incident  light.  The  sur¬ 
face  is  relatively  young,  without  any  considerable 
impact  craters  (Fig.  8.31).  There  are  some  ac¬ 
tive  “geysers”  of  liquid  nitrogen,  which  partly  ex¬ 
plains  the  high  albedo  and  the  lack  of  the  craters. 
The  low  surface  temperature  of  Triton,  37  K, 
means  that  the  nitrogen  is  solid  and  covers  the 
surface  like  snow.  It  is  the  lowest  surface  temper¬ 
ature  known  in  the  solar  system. 


8.9  Dwarf  Planets 

Besides  actual  planets  there  are  some  planet  like 
objects  that  orbit  the  Sun  and  are  nearly  spheri¬ 
cal  but  have  not  managed  to  clear  the  neighbour¬ 
hood  of  their  orbits.  The  upper  and  lower  lim¬ 
its  to  the  size  and  mass  of  dwarf  planets  are  not 
strictly  specified.  The  lower  limit,  however,  is  de¬ 
termined  by  the  hydrostatic  equilibrium  shape, 
but  the  size  at  which  this  happens  may  vary  ac¬ 
cording  to  the  composition  and  history  of  the  ob¬ 
ject.  It  is  estimated  that  up  to  40-50  dwarf  planets 
will  be  discovered  in  coming  years. 
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Fig.  8.32  A  small  portion  of  the  pair  of  pictures  where  Pluto  was  discovered  in  1930.  The  planet  is  marked  with  an 
arrow.  (Lowell  Observatory) 


Currently,  there  are  five  dwarf  planets  in 
the  Solar  System,  namely  Ceres ,  Pluto ,  Eris , 
Haumea ,  and  Makemake.  Ceres  was  formerly 
counted  as  an  asteroid,  Pluto  was  a  planet  and 
Eris  (2003  UB  313,  known  also  by  the  nickname 
Xena)  was  the  first  Trans-Neptunian  object  which 
turned  out  to  be  larger  than  Pluto. 

Pluto  Pluto  was  discovered  in  1930  at  the  Low¬ 
ell  Observatory,  Arizona,  after  an  extensive  pho¬ 
tographic  search  (Fig.  8.32).  This  search  had  al¬ 
ready  been  initiated  in  the  beginning  of  the  cen¬ 
tury  by  Percival  Lowell ,  on  the  basis  of  the  pertur¬ 
bations  observed  in  the  orbits  of  Uranus  and  Nep¬ 
tune.  Finally,  Clyde  Tombaugh  discovered  Pluto 
less  than  6°  off  the  predicted  position.  However, 
Pluto  turned  out  to  be  far  too  small  to  cause  any 
perturbations  on  Uranus  or  Neptune.  Thus  the 
discovery  was  purely  accidental,  and  the  pertur¬ 
bations  observed  were  not  real,  but  caused  by  mi¬ 
nor  errors  of  old  observations. 

The  orbit  of  Pluto  is  different  from  planetary 
orbits.  The  eccentricity  is  0.25  and  the  inclination 
is  17°.  During  its  250  year  orbit,  Pluto  is  closer 
to  the  Sun  than  Neptune  for  20  years;  one  such 
period  lasted  from  1979  to  1999.  There  is  no  dan¬ 
ger  of  Pluto  and  Neptune  colliding,  since  Pluto 
is  high  above  the  ecliptic  when  at  the  distance  of 
Neptune.  Pluto’s  orbital  period  is  in  a  3:2  reso¬ 
nance  with  Neptune. 

Pluto  has  no  visible  disk  as  seen  with  terres¬ 
trial  telescopes;  instead,  it  resembles  a  point,  like 


a  star.  This  fact  gave  an  upper  limit  for  the  di¬ 
ameter  of  Pluto,  which  turned  out  to  be  about 
3000  km.  The  exact  mass  was  unknown  until 
the  discovery  of  the  Plutonian  moon,  Charon ,  in 
1978.  The  mass  of  Pluto  is  only  0.2  %  of  the 
mass  of  the  Earth.  The  orbital  period  of  Charon 
is  6.39  days,  and  this  is  also  the  period  of  rota¬ 
tion  of  both  bodies.  Pluto  and  Charon  rotate  syn¬ 
chronously,  each  turning  the  same  side  towards 
the  other  body.  The  rotation  axis  of  Pluto  is  close 
to  the  orbital  plane:  the  tilt  is  122°. 

Mutual  occultations  of  Pluto  and  Charon  in 
1985-1987  gave  accurate  diameters  of  each  body: 
The  diameter  of  Pluto  is  2300  km  and  that  of 
Charon,  1200  km.  The  density  of  Pluto  turned 
out  to  be  about  2100  kgm-3.  Thus  Pluto  is  not 
a  huge  iceball  but  about  2/3  of  its  mass  is  com¬ 
posed  of  rocks.  The  relatively  small  abundance  of 
ices  is  possibly  due  to  the  low  temperature  dur¬ 
ing  the  planetary  accretion  when  most  of  the  free 
oxygen  was  combined  with  carbon  forming  car¬ 
bon  monoxide.  The  computed  lower  limit  for  wa¬ 
ter  ice  is  about  30  %  which  is  fairly  close  to  the 
value  observed  in  Pluto. 

Pluto  has  a  thin  methane  atmosphere  and  there 
is  possibly  a  thin  haze  over  the  surface.  The  sur¬ 
face  pressure  is  10~5-10-6  atm.  It  has  been  spec¬ 
ulated  that  when  Pluto  is  far  from  perihelion,  the 
whole  atmosphere  will  become  frozen  and  fall  on 
the  surface. 

Pluto  has  five  satellites.  Two  of  them  were  dis¬ 
covered  by  the  Hubble  Space  Telescope  in  2005 
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Fig.  8.33  The  Hubble  Space  Telescope  photographed 
Pluto  and  its  three  known  moons  in  February  2006.  The 
smaller  moons  were  found  in  2005,  and  were  later  named 
Nix  and  Hydra.  Their  diameter  is  estimated  as  40-160 
km.  (M.  Mutchler  (STScI),  A.  Stem  (SwRI),  and  the  HST 
Pluto  Companion  Search  Team,  ESA,  NASA) 


Fig.  8.34  The  only  detailed  images  of  Pluto  were  sent  by 
the  New  Horizons  probe  in  2015.  The  light  heart  shaped 
area  was  named  the  Tombaugh  region  to  honour  the  finder 
of  Pluto.  The  dark  area  to  the  left  of  is  called  the  Cthulhu 
region;  Cthulhu  is  a  mythical  demon  in  H.P.  Lovecraft’s 
horror  stories  (NASA) 


ing  the  limit  to  be  in  the  hydrostatic  equilibrium. 
Thus  it  is  no  more  an  asteroid  but  a  dwarf  planet. 

The  average  opposition  magnitude  of  Ceres  is 
7.9.  Thus  it  can  be  seen  with  binoculars  but  not 
with  the  naked  eye. 

Nearby  images  taken  by  a  space  probe  show  a 
few  highly  reflecting  spots. 

Eris  In  2003  a  Trans-Neptunian  object  was 
found,  initially  labelled  as  2003  UB313.  One  of 
the  initial  nicknames  was  Xena  according  to  the 
heiress  of  a  tv  series.  When  a  moon  orbiting  Xena 
was  found,  it  was  obviously  names  as  Gabrielle. 
However,  these  names  never  became  official.  The 
bigger  object  was  given  the  name  Eris  and  its 
companion  Dysnomia. 

The  size  of  Eris  is  similar  to  that  of  Pluto.  The 
size  is,  however,  an  estimate  based  on  infrared  ra¬ 
diation.  Yet  it  was  big  enough  to  be  classified  as 
a  planet  if  Pluto  had  retained  its  planetary  status. 
Now  Eris  is  a  dwarf  planet.  The  semimajor  axis 
of  the  orbit  is  97  au,  orbital  period  560  years  and 
inclination  45°. 


8.10  Minor  Bodies 

In  addition  to  planets,  moons  orbiting  the  planets, 
dwarf  planets  and  other  biggish  objects  the  solar 
system  contains  a  lot  of  smaller  objects  that  can 
be  collectively  called  minor  bodies. 

The  class  of  minor  bodies  includes  aster¬ 
oids ,  comets ,  meteoroids ,  and  interplanetary  dust. 
However,  there  is  no  clear  distinction  between  the 
groups.  Some  asteroids  are  cometlike,  some  as¬ 
teroids  coming  close  to  the  Earth  may  have  earlier 
been  comets  that  have  lost  all  of  their  volatile  ma¬ 
terial.  The  classification  is  based  more  on  the  vi¬ 
sual  appearance  and  tradition  than  physical  prop¬ 
erties. 


(Fig.  8.33).  They  orbit  Pluto  counterclockwise 
twice  the  distance  of  Charon.  The  New  Horizons 
probe  found  two  more  moons. 

Ceres  Ceres  was  the  first  asteroid  discovered  in 
1801  by  Giuseppe  Piazzi .  It  is  also  the  largest  of 
the  main  belt  asteroids  moving  inside  Jupiter’s  or¬ 
bit.  Its  diameter  is  about  1000  km,  thus  exceed¬ 


8.1 1  Asteroids 

Asteroids  form  a  large  and  scattered  group  of 
Sun-orbiting  bodies.  The  oldest  and  best-known 
group  form  the  main  asteroid  belt  between  Mars 
and  Jupiter,  with  distances  of  2. 2-3. 3  au  from  the 
Sun  (Fig.  8.38).  The  most  distant  asteroids  are  far 
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Fig.  8.35  Top :  Ceres  was  the  first  detected  asteroid. 
Spacecraft  images  shows  its  spherical  shape,  and  so  it  be¬ 
came  a  dwarf  planet.  Middle'.  Vesta  is  clearly  nonspheri- 
cal  and  has  remained  as  a  minor  body.  It  is  the  brightest 
asteroid:  at  opposition  under  ideal  conditions  it  could  be 
seen  even  with  the  naked  eye.  Bottom :  The  asteroid  (243) 
Ida  and  a  small  moon  Dactyl  orbiting  the  asteroid.  The  ir¬ 
regular  shape  is  typical  for  most  asteroids.  Ida  is  the  first 
asteroid  known  to  have  a  satellite 


beyond  the  orbit  of  Pluto,  called  Trans-Neptunian 
objects.  There  are  also  a  number  of  asteroids  that 
come  closer  to  the  Sun  than  the  Earth. 

Earlier  asteroids  were  called  minor  planets  or 
even  planetoids.  These  names  were  actually  more 
natural  than  asteroid  that  only  means  that  the  ob¬ 
ject  looks  a  starlike  dot,  although  it  has  nothing 
to  do  with  stars. 

The  first  asteroid  was  found  in  1801  (Ceres, 
now  a  dwarf  planet),  and  at  the  beginning  of 
2016  there  were  more  than  750,000  catalogued 
and  15,000  named  asteroids.  There  were  also  a 
huge  number  of  asteroids  with  an  approximate  or¬ 
bit.  The  number  of  catalogued  asteroids  increases 
currently  by  thousands  every  month.  It  has  been 
estimated  that  more  than  about  half  a  million  as¬ 
teroids  larger  than  1  km  exist  in  the  Solar  Sys¬ 
tem.  Diameters  of  asteroids  vary  from  hundreds 
of  meters  to  hundreds  of  kilometers.  The  largest 
asteroid  Ceres  is  classified  as  a  dwarf  planet  and 
the  border  between  smallest  asteroids  and  mete¬ 
oroids  is  not  specified. 

An  asteroid  observer  needs  a  telescope,  since 
even  the  brightest  asteroids  are  too  faint  to  be 
seen  with  the  naked  eye.  Asteroids  are  points  of 
light  like  a  star,  even  if  seen  through  a  large  tele¬ 
scope;  only  their  slow  motion  against  the  stellar 
background  reveals  that  they  are  members  of  the 
solar  system. 

The  rotation  of  an  asteroid  gives  rise  to  a  reg¬ 
ular  light  variation.  The  amplitude  of  light  varia¬ 
tion  is  in  most  cases  well  below  1  magnitude  and 
typical  rotation  periods  range  from  4  to  15  hours. 

The  characteristics  of  the  main  belt  asteroids 
are  best  known.  Total  mass  of  the  main  belt  as¬ 
teroids  is  less  than  1/1000  of  the  mass  of  the 
Earth.  The  centre  of  the  asteroid  belt  is  at  a  dis¬ 
tance  of  approximately  2.8  au,  as  predicted  by  the 
Titius-Bode  law  (Sect.  7.11).  According  to  a  for¬ 
merly  popular  theory,  asteroids  were  thought  to 
be  debris  from  the  explosion  of  a  planet.  This  the¬ 
ory,  like  catastrophe  theories  in  general,  has  been 
abandoned. 

The  currently  accepted  theory  assumes  that  as¬ 
teroids  were  formed  simultaneously  with  the  ma¬ 
jor  planets.  The  primeval  asteroids  were  large 
chunks,  most  of  them  orbiting  between  the  orbits 
of  Mars  and  Jupiter.  Due  to  mutual  collisions  and 
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Fig.  8.36  Left:  Asteroid  (951)  Gaspra  was  photographed 
by  the  Galileo  spacecraft  in  October  1991.  The  illumi¬ 
nated  part  of  the  asteroid  is  about  16x12  km.  The  smallest 
craters  in  this  view  are  about  300  m  across.  Right:  A  mo- 


Fig.  8.37  Quasimoons  or  pseudomoons  are  asteroid 
moving  on  rather  peculiar  orbits.  They  are  not  real  moons, 
since  they  orbit  the  Sun,  but  the  orbital  period  is  very  close 
to  that  of  some  planet.  Thus  the  asteroid  seems  to  oscillate 
around  the  planet.  A  quasimoon  moves  on  an  eccentric  or¬ 
bit  inside  and  outside  the  orbit  of  the  planet.  Eventually  the 
asteroid  will  drift  away  from  the  planet  but  will  return  after 
a  long  time.  Venus  and  Neptune  each  have  one  quasimoon 
and  the  Earth  has  several.  The  orbits  of  the  quasimoons 
have  been  studied  e.g.  Seppo  Mikkola  in  Finland  and  the 
Canadians  Kimmo  Innanen  and  Paul  Wiegert 


saic  of  asteroid  (433)  Eros  was  taken  by  the  NEAR  space¬ 
craft  from  a  distance  of  200  km.  The  crater  on  top  is  about 
5  km  in  diameter.  The  NEAR  spacecraft  orbited  Eros  for 
one  year  and  finally  landed  on  it  in  2001.  (JPL/NASA) 

fragmentation,  the  present  asteroids  are  debris  of 
those  primordial  bodies  which  were  never  able  to 
form  a  large  planet.  Some  of  the  biggest  aster¬ 
oids  may  be  those  original  bodies.  The  orbital  el¬ 
ements  of  some  asteroids  are  very  similar.  These 
are  called  the  Hirayama families.  They  are  proba¬ 
bly  remnants  of  a  single,  large  body  that  was  bro¬ 
ken  into  a  group  of  smaller  asteroids.  There  are 
tens  of  identified  Hirayama  families,  the  largest 
ones  including  Hungarias,  Floras,  Eos,  Themis, 
and  Hildas  (named  after  the  main  asteroid  in  the 
group). 

The  distribution  of  asteroids  inside  the  aster¬ 
oid  belt  is  uneven  (Fig.  8.38);  they  seem  to  avoid 
some  areas  known  as  the  Kirkwood  gaps.  The 
most  prominent  void  areas  are  at  distances  where 
the  orbital  period  of  an  asteroid  around  the  Sun 
(given  by  Kepler’s  third  law)  is  in  the  ratio  1:3, 
2:5,  3:7,  or  1:2  to  the  orbital  period  of  Jupiter. 
The  motion  of  an  asteroid  orbiting  in  such  a  gap 
would  be  in  resonance  with  Jupiter,  and  even 
small  perturbations  would  tend  to  grow  with  time. 
The  body  would  eventually  be  moved  to  another 
orbit.  However,  the  resonance  effects  are  not  so 
simple:  sometimes  an  orbit  is  “locked”  to  a  reso- 
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Fig.  8.38  (a)  Most  of  the 
asteroids  orbit  the  Sun  in 
the  asteroid  belt  between 
Mars  and  Jupiter.  The 
figure  shows  the  positions 
of  about  96,000  catalogued 
asteroids  on  January  1, 
2000,  and  the  orbits  and 
the  positions  of  some 
major  planets.  The  orbital 
elements  of  the  asteroids 
are  from  the  Lowell 
Observatory  data  base. 

(b)  The  total  number  of 
asteroids  as  a  function  of 
the  distance  from  the  Sun. 
Each  bin  corresponds  to 
0.1  AU.  The  empty  areas, 
the  Kirkwood  gaps,  are  at 
those  points,  where  the 
orbital  period  of  an  asteroid 
is  in  a  simple  ratio  to  the 
orbital  period  of  Jupiter 


nance,  e.g.  the  Trojans  move  along  the  same  orbit 
as  Jupiter  (1:1  resonance),  and  the  Hilda  group  is 
in  the  2:3  resonance. 

Many  groups  of  asteroids  orbit  the  Sun  outside 
the  main  belt.  These  include  the  above-mentioned 
Trojans,  which  orbit  60°  behind  and  ahead  of 
Jupiter.  The  Trojans,  which  are  close  to  the  spe¬ 
cial  points  L4  and  L5  of  the  solution  of  the  re¬ 
stricted  three-body  problem.  At  these  Lagrangian 
points,  a  massless  body  can  remain  stationary 
with  respect  to  the  massive  primaries  (in  this  case, 
Jupiter  and  the  Sun).  In  fact,  the  asteroids  are 
oscillating  around  the  stationary  points,  but  the 
mean  orbits  can  be  shown  to  be  stable  against  per¬ 
turbations. 

Another  large  family  is  the  Apollo-Amor  as¬ 
teroids.  The  perihelia  of  Apollo  and  Amor  are  in¬ 


side  the  Earth’s  orbit  and  between  the  orbits  of  the 
Earth  and  Mars,  respectively.  These  asteroids  are 
all  small,  less  than  30  km  in  diameter.  The  most 
famous  is  433  Eros  (Fig.  8.36),  which  was  used  in 
the  early  20th  century  for  determining  the  length 
of  the  astronomical  unit.  When  closest  to  the 
Earth,  Eros  is  at  a  distance  of  only  20  million  km 
and  the  distance  can  be  directly  measured  using 
the  trigonometric  parallax.  Some  of  the  Apollo- 
Amor  asteroids  could  be  remnants  of  short-period 
comets  that  have  lost  all  their  volatile  elements. 

There  is  a  marginal  probability  that  some 
Earth-crossing  asteroids  will  collide  with  the 
Earth.  It  has  been  estimated  that,  on  the  average, 
a  collision  of  a  large  asteroid  causing  a  global 
catastrophe  may  take  place  once  in  one  million 
years.  Collisions  of  smaller  bodies,  causing  dam- 
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Fig.  8.39  The  Kuiper  Belt 
is  a  disk- shaped  cloud  of 
distant  icy  bodies  inside 
the  halo  of  the  Oort  cloud. 
The  short-period  comets 
originate  in  the  Kuiper  belt, 
whereas  a  huge  amount  of 
icy  bodies  that  form 
a  source  of  long  period 
comets  resides  in  the  Oort 
cloud  (see  Sect.  8.12). 
(JPL/NASA) 


age  similar  to  a  nuclear  bomb,  may  happen  once 
per  century.  It  has  been  estimated  that  there  are 
500-1000  near-Earth  asteroids  larger  than  one 
kilometre  in  diameter  but  possibly  tens  of  thou¬ 
sands  smaller  objects.  Programs  have  been  started 
to  detect  and  catalogue  all  near-Earth  asteroids 
and  to  predict  the  probabilities  of  hazardous  col¬ 
lisions. 

Distant  asteroids  form  the  third  large  group 
outside  the  main  asteroid  belt.  The  first  asteroid 
belonging  to  this  group  (2060)  Chiron,  was  dis¬ 
covered  in  1977.  Chiron’s  aphelion  is  close  to  the 
orbit  of  Uranus  and  the  perihelion  is  slightly  in¬ 
side  the  orbit  of  Saturn.  Distant  asteroids  are  very 
faint  and  thus  difficult  to  find. 

Already  in  the  1950’s  Gerard  Kuiper  sug¬ 
gested  that  comet-like  debris  from  the  formation 
of  the  solar  system  can  exist  beyond  the  orbit  of 
Neptune  as  an  additional  source  of  comets  to  the 
more  distant  Oort  cloud.  Later,  computer  simula¬ 
tions  of  the  solar  system’s  formation  showed  that 
a  disk  of  debris  should  form  at  the  outer  edge  of 
the  solar  system.  The  disk  is  now  known  as  the 
Kuiper  belt  (Fig.  8.39). 

The  first  Trans -Neptunian  asteroid  (1992  QB 1) 
was  discovered  in  1992,  and  in  the  beginning  of 


year  2006  there  were  about  1000  known  mem¬ 
bers.  The  total  number  of  Kuiper  belt  objects 
larger  than  100  km  in  diameter  is  estimated  to 
be  over  70,000.  Some  of  them  may  be  even  larger 
than  Pluto.  The  Kuiper  belt  objects  are  remnants 
from  the  early  accretion  phases  of  the  solar  sys¬ 
tem.  Several  of  the  Trans-Neptunian  objects  are 
in  or  near  a  3:2  orbital  period  resonance  with 
Neptune,  the  same  resonance  as  Pluto.  Therefore 
they  are  called  plutinos. 

The  exact  sizes  of  asteroids  were  long  un¬ 
known.  Edward  E.  Barnard  of  the  Lick  Ob¬ 
servatory  determined  visually  the  diameters  of 
(1)  Ceres,  (2)  Vesta,  (3)  Juno,  and  (4)  Pallas  in 
the  1890’s  (Fig.  8.39).  Practically  no  other  re¬ 
liable  results  existed  prior  to  the  1960’s,  when 
indirect  methods  applying  photometry  and  spec¬ 
troscopy  were  adopted.  Moreover,  several  stellar 
occultations  caused  by  asteroids  have  been  ob¬ 
served  since  1980’s. 

The  first  images  of  asteroids  were  obtained  in 
the  early  1990’s.  In  1991  the  Galileo  spacecraft 
passed  asteroid  (951)  Gaspra,  and  in  1993  as¬ 
teroid  (243)  Ida,  on  its  long  way  to  Jupiter  (see 
Sect.  8.15).  Finally,  in  2001,  the  NEAR  space- 
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Fig.  8.40  Sizes  of  some  asteroids  compared  with  the 
Moon.  (Moon  image,  NASA) 

craft  landed  on  asteroid  (433)  Eros  after  orbiting 
it  for  one  year. 

The  images  of  asteroids  (Fig.  8.35)  show  ir¬ 
regular,  crater-filled  bodies  with  regolith  and  pul¬ 
verised  rock  on  their  surface.  Some  asteroids  may 
once  have  been  two  separate  objects  that  merged 
into  one.  In  1992  asteroid  (4179)  Toutatis  passed 
the  Earth  only  by  4  million  kilometres.  Radar  im¬ 
ages  revealed  a  two-body  system,  where  the  com¬ 
ponents  were  touching  each  other.  Double  aster¬ 
oids  may  be  quite  common,  and  there  exist  light 
curves  of  some  asteroids  which  have  been  inter¬ 
preted  as  results  of  twin  bodies.  Another  example 
of  a  twin  asteroid  is  243  Ida  that  has  a  “moon”, 
a  smaller  body  gravitationally  bound  to  it. 

The  composition  of  main  belt  asteroids  is  sim¬ 
ilar  to  that  of  iron,  stone  and  iron-stone  mete¬ 
orites.  Most  asteroids  can  be  divided  into  three 
groups,  according  to  their  photometric  and  po- 
larimetric  properties.  95  %  of  the  classified  aster¬ 
oids  belong  to  the  types  C  and  S  types.  Metal-rich 
M  type  asteroids  are  rarer. 

About  75  percent  of  asteroids  belong  to  the 
type  C  type.  The  C  asteroids  are  dark  due  to 
radiation  darkening  (geometric  albedo  p  &  0.06 
or  less),  and  they  contain  a  considerable  amount 
of  carbon  (mnemonic  C  for  carbon).  They  re¬ 
semble  stony  meteorites .  The  material  is  undif¬ 


ferentiated  and  thus  they  belong  to  the  most  pri¬ 
mordial  bodies  of  the  solar  system.  The  reflec¬ 
tivity  of  silicate-rich  S  asteroids  is  higher  and 
their  spectra  are  close  to  those  of  stone-iron  me¬ 
teorites.  Their  spectra  show  signs  of  silicates, 
such  as  olivine,  e.g.  fosterite  Mg2SiC>4  or  fay- 
alite  Fe2Si04.  M  type  asteroids  have  more  met¬ 
als,  mostly  nickel  and  iron;  they  have  undergone 
at  least  a  partial  differentiation. 

The  compositions  and  even  sizes  of  the  Trans- 
Neptunian  objects  are  difficult  to  determine.  They 
are  dim,  and  due  to  their  low  temperature,  the 
black-body  radiation  maximum  is  around  60  pm. 
This  wavelength  is  almost  impossible  to  observe 
on  the  Earth.  Even  the  estimations  of  the  albedos, 
and  therefore  the  diameter  are  very  uncertain. 

Colours  of  TNOs  range  from  blue-grey  to  red 
and  the  distribution  appears  to  be  uniform.  How¬ 
ever,  population  of  the  low-inclination  objects 
seem  to  be  red  and  high-inclination  objects  blue. 
The  unperturbed  orbits  of  the  low-inclination  ob¬ 
jects  suggest  that  they  represent  a  relic  of  the 
original  population  of  the  Kuiper  belt. 

Interpretations  of  the  spectra  are  ambiguous 
and  spectra  may  not  describe  the  composition  of 
the  whole  object.  The  surface  is  altered  by  in¬ 
tense  radiation,  solar  wind  and  micrometeorites 
and  it  can  be  quite  different  from  the  regolith  and 
deeper  layers  underneath. 

Small  TNOs  are  probably  mixtures  of  rock 
and  ice  with  some  organic  surface  material.  The 
composition  is  similar  to  the  comets.  High  den¬ 
sity  (2000-3000  kgm-3)  of  some  large  objects 
suggests  a  high  non-ice  content,  similar  to  Pluto. 


8.12  Comets 

Comets  are  agglomerates  of  ice,  snow,  and  dust; 
a  typical  diameter  is  of  the  order  of  10  km  or  less. 
The  nucleus  contains  icy  chunks  and  frozen  gases 
with  embedded  rock  and  dust.  At  its  centre,  there 
can  be  a  small,  rocky  core. 

Comets  are  the  only  celestial  objects  named 
after  their  finders.  The  letter  P  (periodic)  preced¬ 
ing  some  names  indicates  that  planetary  perturba¬ 
tions  have  changed  the  orbit  so  that  the  comet  has 
remained  orbiting  the  Sun. 
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A  comet  is  invisible  when  far  from  the  Sun; 
when  it  gets  closer  than  about  2  au,  the  heat  of 
the  Sun  starts  to  melt  the  ice  and  snow.  The  out¬ 
flowing  gas  and  dust  form  an  envelope,  the  coma 
around  the  nucleus.  Radiation  pressure  and  the 
solar  wind  push  ionised  gas  and  dust  away  from 
the  Sun,  resulting  in  the  typical  long-tailed  shape 
of  a  comet  (Fig.  8.42). 

The  tail  is  always  pointing  away  from  the  Sun, 
a  fact  which  was  noticed  in  the  16th  century.  Usu¬ 
ally,  there  are  two  tails,  an  ion  tail  (gas  tail)  and 
a  dust  tail.  The  partly  ionised  gas  and  very  fine 
dust  in  the  ion  tail  are  driven  by  the  solar  wind. 
Some  of  the  light  is  reflected  solar  light,  but  the 
brightness  of  the  ion  tail  is  mostly  due  to  emis¬ 
sion  by  the  excited  atoms.  The  dust  tail  is  caused 
by  the  radiation  pressure.  Because  the  velocities 
of  the  particles  of  the  dust  tail  are  lower  than  the 
velocities  in  the  ion  tail,  the  dust  tail  is  often  more 
curved  than  the  ion  tail. 

Fred  Whipple  introduced  in  1950’s  a  “dirty 
snowball”  theory  to  describe  the  cometary  struc¬ 
ture.  According  to  this  model,  cometary  nuclei 


Fig.  8.41  Lower  right:  A  composite  image  of  the  nu¬ 
cleus  of  comet  P/Halley  taken  by  ESA  Giotto  spacecraft  in 
1986.  The  size  of  the  nucleus  is  approximately  13x7  km. 
Dust  jets  are  originating  from  two  regions  on  the  nucleus. 
(ESA/Max  Planck  Institut  fur  Aeronomie) 


are  composed  of  ice  mixed  with  gravel  and  dust. 
The  observations  have  revealed  that  the  classi¬ 
cal  dirty  snowball  model  is  not  quite  accurate; 
at  least  the  surface  is  more  dirt  than  snow,  also 
containing  organic  compounds.  Several  chemical 
compounds  have  been  observed,  including  water 
ice,  which  probably  makes  up  75-80  %  of  the 
volatile  material.  Other  common  compounds  are 
carbon  monoxide  (CO),  carbon  dioxide  (CO2), 
methane  (CH4),  ammonia  (NH3),  and  formalde¬ 
hyde  (H2CO). 

The  most  famous  (and  also  best  known)  pe¬ 
riodic  comet  is  Halley’s  comet  (Fig.  8.42).  Its  or¬ 
bital  period  is  about  76  years;  it  was  last  in  perihe¬ 
lion  in  1986.  During  the  last  apparition,  the  comet 
was  also  observed  by  spacecraft,  revealing  the 
solid  cometary  body  itself  for  the  first  time.  Hal¬ 
ley  is  a  13  x  7  km,  peanut-shaped  chunk  whose 
surface  is  covered  by  an  extremely  black  layer  of 
a  possibly  tar-like  organic  or  other  similar  mate¬ 
rial.  Violent  outbursts  of  gas  and  dust  make  an  ex¬ 
act  prediction  of  its  brightness  impossible,  as  of¬ 
ten  noticed  when  cometary  magnitudes  have  been 
predicted.  Near  the  perihelion,  several  tons  of  gas 
and  dust  burst  out  every  second. 

Cometary  material  is  very  loose.  Ablation  of 
gas  and  dust,  large  temperature  variations  and 
tidal  forces  sometimes  cause  the  whole  comet  to 
break  apart.  Comet  Shoemaker-Levy  9  which  im¬ 
pacted  into  Jupiter  in  1994  was  torn  apart  two 
years  earlier  when  it  passed  Jupiter  at  a  dis¬ 
tance  of  21,000  km  (Fig.  8.44).  The  impact  of 
Shoemaker-Levy  9  showed  that  there  can  be  den¬ 
sity  variation  (and  perhaps  variation  in  composi¬ 
tion,  too)  inside  the  original  cometary  body. 

Comets  are  rather  ephemeral  things,  surviving 
only  a  few  thousand  revolutions  around  the  Sun 
or  less.  The  short-period  comets  are  all  newcom¬ 
ers  and  can  survive  only  a  short  time  here,  in  the 
central  part  of  the  solar  system. 

Since  comets  in  the  central  solar  system  are 
rapidly  destroyed,  there  has  to  be  some  source 
of  new  short-period  comets.  In  1950  Jan  Oort 
discovered  a  strong  peak  for  aphelia  of  long  pe¬ 
riod  comets  at  a  distance  of  about  50,000  au,  and 
that  there  is  no  preferential  direction  from  which 
comets  come  (Fig.  8.43).  He  proposed  that  there 
is  a  vast  cloud  of  comets  at  the  outer  reaches  of 
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Fig.  8.42  Top:  Comet  Mrkos  in  1957.  The  tail  typical  to 
comets  is  caused  by  the  solar  wind  and  radiation  pressure 
when  the  comet  approaches  the  Sun  (Palomar  Observa- 

the  solar  system,  now  know  as  the  Oort  cloud 
(Fig.  8.45).  The  total  mass  of  the  Oort  cloud  is 
estimated  to  be  tens  of  Earth  masses,  containing 
more  than  10 12  comets. 

A  year  later  Gerard  Kuiper  showed  that  there 
is  a  separate  population  of  comets.  Many  of 
the  short  period  comets,  with  periods  less  than 
200  years,  have  the  orbital  inclination  less  than 
40°,  and  they  orbit  the  Sun  in  the  same  direction 
as  the  Earth.  The  orbital  inclination  of  long  period 
comets  are  not  peaked  around  the  plane  of  the 
ecliptic  but  they  are  more  random.  Kuiper  argued 


tory).  Down :  The  comet  Churyumov-Gerasimenko  is  the 
first  comet  where  a  space  probe  has  landed  and  sent  de¬ 
tailed  closeup  images.  (ESA) 

that  the  short  period  comets  originate  from  a  sep¬ 
arate  population  of  comets  that  resides  in  a  disk¬ 
like  cloud  beyond  the  orbit  of  Neptune.  The  area 
is  now  known  as  the  Kuiper  belt  (Fig.  8.39). 

Occasionally  perturbations  from  passing  stars 
send  some  of  the  comets  in  the  Oort  cloud  into  or¬ 
bits,  which  bring  them  into  the  central  parts  of  the 
solar  system,  where  they  are  seen  as  long-period 
comets.  Around  a  dozen  “new”  comets  are  dis¬ 
covered  each  year.  Most  of  these  are  visible  only 
with  a  telescope,  and  only  a  couple  of  times  per 
decade  one  can  see  a  bright  naked-eye  comet. 
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Some  of  the  long  period  comets  are  put  into 
short  period  orbits  by  the  perturbations  of  Jupiter 
and  Saturn,  whereas  some  others  can  be  ejected 
from  the  solar  system.  However,  there  are  no 
comets  that  have  been  proven  to  come  from  inter¬ 
stellar  space,  and  the  relative  abundances  of  sev¬ 
eral  isotopes  in  the  cometary  matter  are  the  same 
as  in  other  bodies  of  our  solar  system. 

The  origin  of  the  Oort  cloud  and  Kuiper  belt 
is  different.  The  Oort  cloud  objects  were  formed 
near  the  giant  planets  and  have  been  ejected  to 
the  outer  edge  of  the  solar  system  by  gravitational 
perturbations  soon  after  the  formation  of  the  solar 
system.  Small  objects  beyond  the  orbit  of  Nep- 


Fig.  8.43  Orbits  of  short  period  comets  projected  to  the 
plane  of  the  ecliptic 


tune  had  no  such  interactions  and  they  remained 
near  the  accretion  disk. 

In  2014  ESA’s  Rosetta  probe  settled  on  an  or¬ 
bit  around  the  comet  Churyumov-Gerasimenko 
and  sent  the  Philae  lander  to  its  surface.  The  lan¬ 
der  found  several  organic  compounds  and  hard 
ice  below  a  dust  layer  about  20  centimetres  thick. 
The  water  vapour  emanating  from  the  comet  was 
different  from  the  terrestrial  water:  the  ratio  of 
deuterium  to  ordinary  hydrogen  was  three  times 
higher  than  on  the  Earth.  This  suggests  that  the 
source  of  water  on  the  Earth  is  not  comets,  at  least 
not  of  those  like  Churyumov-Gerasimenko. 

8.13  Meteoroids 

Solid  bodies  smaller  than  asteroids  are  called  me¬ 
teoroids.  The  boundary  between  asteroids  and 
meteoroids,  however,  is  diffuse;  it  is  a  matter  of 
taste  whether  a  ten  metre  body  is  called  an  as¬ 
teroid  or  a  meteoroid.  We  could  say  that  it  is  an 
asteroid  if  it  has  been  observed  so  often  that  its 
orbital  elements  are  known. 

When  a  meteoroid  hits  the  atmosphere,  an 
optical  phenomenon,  called  a  meteor  (“shooting 
star”)  is  seen  (Fig.  8.47).  The  smallest  bodies 
causing  meteors  have  a  mass  of  about  1  gram;  the 
(micro)meteoroids  smaller  than  this  do  not  result 
in  optical  phenomena.  However,  even  these  can 
be  observed  with  radar  which  is  able  to  detect 
the  column  of  ionised  air.  Micrometeoroids  can 


Fig.  8.44  Comet  Shoemaker-Levy  9  five  months  before  its  collision  to  Jupiter  as  seen  by  the  Hubble  Space  Telescope. 
(JPL/NASA) 
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Fig.  8.45  A  schematic  diagram  of  the  distribution  of  the 
semimajor  axes  of  long-period  comets.  The  abscissa  is 
the  inverse  of  the  semimajor  axis,  1/a  [AU]-1.  The  Oort 
cloud  is  visible  as  a  strong  peak  at  the  very  small  positive 
values  of  1/a.  The  orbits  shown  here  are  the  “original  or¬ 
bits”,  i.e.  computed  backward  in  time  to  remove  all  known 
perturbations 

also  be  studied  with  particle  detectors  installed 
in  satellites  and  space  crafts.  Bright  meteors  are 
called  bolides. 

The  number  of  meteoroids  increases  rapidly  as 
their  size  diminishes.  It  has  been  estimated  that 
at  least  105  kg  of  meteoritic  material  falls  on  the 
Earth  each  day.  Most  of  this  material  is  microm¬ 
eteoroids  and  causes  no  visible  phenomena. 

Due  to  perspective,  all  meteors  coming  from 
the  same  direction  seem  to  radiate  from  the  same 
point.  Such  meteor  streams  (meteor  showers)  are, 
e.g.  the  Perseids  in  August  and  the  Geminides  in 
December;  the  names  are  given  according  to  the 
constellation  in  which  the  radiation  point  seems 
to  be.  On  the  average,  one  can  see  a  few  sporadic 
meteors  (Fig.  8.46)  per  hour.  During  a  strong  me¬ 
teor  shower  one  can  see  even  tens  of  meteors  per 
minute,  although  a  normal  rate  is  some  tens  per 
hour. 

Most  of  the  meteoroids  are  small  and  burn  to 
ashes  at  a  height  of  100  km.  However,  larger  bod¬ 
ies  may  come  through  and  fall  to  the  Earth.  These 
are  called  meteorites.  The  relative  speed  of  a  typi¬ 
cal  meteoroid  varies  in  the  range  10-70  km/s.  The 
speed  of  the  largest  bodies  does  not  diminish  in 
the  atmosphere;  thus,  they  hit  the  Earth  at  their 
cosmic  speeds,  resulting  in  large  impact  craters. 
Smaller  bodies  slow  down  and  drop  like  stones 


Fig.  8.46  When  the  Earth  is  moving  on  its  orbit,  the 
morning  hemisphere  is  the  leading  one.  Hence  most  of  the 
meteoroids  coming  from  random  directions  hit  the  morn¬ 
ing  hemisphere.  Thus  most  meteors  can  be  seen  between 
midnight  and  sunrise 


Fig.  8.47  Meteors  are  easy  to  capture  on  him:  one  just 
leaves  a  camera  loaded  with  a  sensitive  him  on  a  tripod 
with  the  shutter  open  for  an  hour  or  so.  Stars  make  curved 
trails  on  the  him.  (L.  Hakkinen) 

but  impacts  of  large  bodies  (diameter  meters  or 
more)  may  cause  large-scale  disaster. 

Iron  meteorites  or  irons,  composed  of  almost 
pure  nickel-iron,  comprise  about  one  quarter  of 
all  meteorites.  Actually  the  irons  are  in  a  minor¬ 
ity  among  meteoroids,  but  they  survive  their  vio¬ 
lent  voyage  through  the  atmosphere  more  easily 
than  weaker  bodies.  Three-quarters  are  stony  me¬ 
teorites ,  or  stone-iron  meteorites. 
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Fig.  8.48  A  projection  of 
the  entire  infrared  sky 
created  from  observations 
of  the  COBE  satellite.  The 
bright  horizontal  band  is 
the  Milky  Way.  The  dust  of 
the  solar  system,  visible  on 
the  Earth  as  zodiacal  light 
is  an  S- shaped  glow  across 
the  image.  (G.  Greaney  and 
NASA) 


Meteoroids  themselves  can  be  divided  into 
three  groups  of  roughly  equal  size.  One-third 
is  ordinary  stones,  chondrites.  The  second  class 
contains  weaker  carbonaceous  chondrites  and 
the  third  class  includes  cometary  material,  loose 
bodies  of  ice  and  snow  which  are  unable  to  sur¬ 
vive  down  to  the  Earth. 

Many  meteor  streams  are  in  the  same  orbit  as 
a  known  comet,  so  at  least  some  meteoroids  are  of 
cometary  origin.  Near  a  perihelion  passage,  every 
second  several  tons  of  gravel  is  left  on  the  orbit  of 
a  comet.  There  are  several  examples  of  meteorites 
that  have  their  origin  in  the  Moon  or  Mars.  De¬ 
bris  of  large  impacts  may  have  been  ejected  into 
space  and  finally  ended  up  on  the  Earth.  Some 
meteoroids  are  debris  of  asteroids. 


8.1 4  Interplanetary  Dust  and  Other 
Particles 

Two  faint  light  phenomena,  namely  zodiacal  light 
and  gegenschein  (counterglow)  make  it  possible 
to  observe  interplanetary  dust,  small  dust  parti¬ 
cles  reflecting  the  light  of  the  Sun.  This  weak 
glow  can  be  seen  above  the  rising  or  setting 
Sun  (zodiacal  light)  or  exactly  opposite  the  Sun 
(gegenschein).  The  interplanetary  dust  is  concen¬ 
trated  near  the  plane  of  the  ecliptic.  The  typ¬ 
ical  sizes  of  the  particles  are  in  the  range  of 
10-100  pm. 

Solar  Wind  Elementary  particles  hitting  the 
Earth  originate  both  in  the  Sun  and  outside  the 
solar  system.  Charged  particles,  mainly  protons, 


electrons  and  alpha  particles  (helium  nuclei)  flow 
continuously  out  of  the  Sun.  At  the  distance  of 
the  Earth,  the  average  speed  of  this  solar  wind  is 
about  500  km/s.  The  velocity  and  number  of  par¬ 
ticles  depend  on  eruptions  and  activity  of  the  Sun 
(Sects.  13.3  and  13.4). 

The  charged  particles  interact  with  the  solar 
magnetic  field.  The  strength  of  the  solar  magnetic 
field  at  the  Earth’s  distance  is  about  1/1000  of 
that  of  the  Earth.  Particles  coming  from  outside 
the  solar  system  are  called  cosmic  rays,  which 
contain  both  elementary  particles  and  heavier  nu¬ 
clei. 


8.15  Examples 

Example  8.1  (The  Roche  Limit)  A  French  math¬ 
ematician  Edouard  Roche  computed  in  1848 
a  limit  where  a  moon  will  be  torn  apart  due  to 
the  tidal  forces  if  it  approaches  its  parent  planet. 
Roche  proposed  that  the  Saturnian  rings  were 
formed  in  that  way. 


We  can  compute  the  Roche  limit  for  a  body  at 
a  distance  R  of  a  planet  the  mass  of  which  is  M 
approximating  the  body  with  two  small  spheres 
of  radius  r  and  mass  m.  The  difference  of  grav¬ 
itation  affecting  the  small  spheres  by  the  planet 
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is 


A  F  =  G  Mm 


1 


1 


G  Mm 


( R  —  r)2 
4  r 


(R  +  r)2 


R3' 


The  gravitational  force  between  the  small  spheres 
is 

,  Gm2 
F'  = 


4  r2 


If  AF  >  F' ,  the  small  spheres  will  be  pulled 
apart.  The  forces  are  equal  at  the  Roche  limit: 

4r  Gm2 
G  M  ///  —  =  — — r. 

R3  4  r2 


Thus  the  distance  of  the  Roche  limit  R  is 


16  r3M 


m 


Inserting  the  masses  of  the  planet  and  the  spheres 
in  terms  of  the  radii  of  the  planet  S  and  spheres  r, 
and  assuming  that  the  densities  p  are  equal,  m  = 
^Ttr3 p,  M  =  | TtS3p ,  we  obtain 

R  ~  2.5  x  S. 


Our  result  is  valid  only  for  a  body  without  any 
internal  strength.  Smaller  bodies  with  internal 
strength  can  survive  inside  the  Roche  limit.  You, 
dear  reader,  act  as  an  excellent  example  of  this, 
because  you  read  this  example  well  inside  the 
Roche  limit  of  the  Earth.  A  100  km  stony  asteroid 
will  survive  even  if  it  orbits  the  Earth  just  above 
the  atmosphere  but  a  sphere  of  water  of  the  same 
size  would  break  apart. 


8.16  Exercises 

Exercise  8.1  The  interval  between  two  opposi¬ 
tions  of  a  planet  was  398.9  d.  The  angular  di¬ 


ameter  of  the  planet  at  the  opposition  was  47.2". 
Find  the  sidereal  period,  semimajor  axis,  and  the 
true  diameter  (in  kilometres)  of  the  planet.  Which 
planet  was  it? 

Exercise  8.2  (a)  Assume  that  three  bodies  move 
along  circular  orbits  with  angular  velocities  (mean 
motions)  n\,  112  and  n 3.  Show  that  these  bodies 
have  a  common  synodic  period  if  and  only  if  there 
are  nonzero  integers  k\ ,  k^  and  k 3  such  that 

k\Yl\  +  £2^2  +  ^3^3  =0,  k\  +  £2  +  £3  =  0. 

(b)  The  resonance  of  the  Galilean  satellites  can 
be  expressed  in  terms  of  their  mean  motions  as 

n\o  3/iEur0pa  +  2/lGanyme(ie  =  0. 

Find  the  synodic  period  of  these  three  moons. 

Exercise  8.3  The  rotation  period  of  Mars  is 
24.62  hours  and  the  orbital  period  of  Deimos 
30.30  hours.  How  long  is  the  time  between  two 
successive  risings  of  Deimos  and  how  much 
Deimos  moves  in  one  hour  as  seen  from  Mars? 

Exercise  8.4  Find  the  distance  of  the  Roche 
limit  for  all  planets  and  see  if  any  of  their  moons 
is  orbiting  the  limit.  And  how  are  the  rings  of  gi¬ 
ant  planets  situated  relative  to  the  Roche  limit? 

Exercise  8.5  Assume  that  the  hydrogen  atom  is 
a  solid  sphere  with  a  radius  of  5.3  x  10  11  m  (the 
Bohr  radius)  and  mass  1.67  x  10-27  kg.  Use  such 
balls  to  build  a  regular  (infinite)  grid  where  each 
ball  is  touching  its  six  neighbours.  What  is  the 
density  of  such  matter?  Compare  with  the  density 
of  Jupiter. 
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All  our  information  about  the  physical  proper¬ 
ties  of  stars  comes  more  or  less  directly  from 
studies  of  their  spectra.  In  particular,  by  study¬ 
ing  the  strength  of  various  absorption  lines,  stel¬ 
lar  masses,  temperatures  and  compositions  can  be 
deduced.  The  line  shapes  contain  detailed  infor¬ 
mation  about  atmospheric  processes. 

As  we  have  seen  in  Chap.  3,  the  light  of  a  star 
can  be  dispersed  into  a  spectrum  by  means  of 
a  prism  or  a  diffraction  grating.  The  distribution 
of  the  energy  flux  density  over  frequency  can  then 
be  derived.  The  spectra  of  stars  consist  of  a  con¬ 
tinuous  spectrum  or  continuum  with  narrow  spec¬ 
tral  lines  superimposed  (Fig.  9.1).  The  lines  in 
stellar  spectra  are  mostly  dark  absorption  lines , 
but  in  some  objects  bright  emission  lines  also  oc¬ 
cur. 

In  a  very  simplified  way  the  continuous  spec¬ 
trum  can  be  thought  of  as  coming  from  the  hot 
surface  of  the  star.  Atoms  in  the  atmosphere 
above  the  surface  absorb  certain  characteristic 
wavelengths  of  this  radiation,  leaving  dark  “gaps” 
at  the  corresponding  points  in  the  spectrum.  In 
reality  there  is  no  such  sharp  separation  between 
surface  and  atmosphere.  All  layers  emit  and  ab¬ 
sorb  radiation,  but  the  net  result  of  these  pro¬ 
cesses  is  that  less  energy  is  radiated  at  the  wave¬ 
lengths  of  the  absorption  lines. 

The  spectra  of  stars  are  classified  on  the  ba¬ 
sis  of  the  strengths  of  the  spectral  lines.  Isaac 
Newton  observed  the  solar  spectrum  in  1666,  but, 
properly  speaking,  spectroscopy  began  in  1814 
when  Joseph  Fraunhofer  observed  the  dark  lines 
in  the  spectrum  of  the  Sun.  He  assigned  capi¬ 


tal  letters,  like  D,  G,  H  and  K,  to  some  of  the 
stronger  dark  lines  without  knowing  the  elements 
responsible  for  the  origin  of  the  lines  (Sect.  9.2). 
The  absorption  lines  are  also  known  as  Fraun¬ 
hofer  lines.  In  1860,  Gustav  Robert  Kirchhoff  and 
Robert  Bunsen  identified  the  lines  as  the  charac¬ 
teristic  lines  produced  by  various  elements  in  an 
incandescent  gas. 


9.1  Measuring  Spectra 

The  most  important  methods  of  forming  a  spec¬ 
trum  are  by  means  of  an  objective  prism  or  a  slit 
spectrograph.  In  the  former  case  one  obtains 
a  photograph,  where  each  stellar  image  has  been 
spread  into  a  spectrum.  Up  to  several  hundred 
spectra  can  be  photographed  on  a  single  plate 
and  used  for  spectral  classification.  The  amount 
of  detail  that  can  be  seen  in  a  spectrum  depends 
on  its  dispersion ,  the  range  of  wavelengths  per 
millimetre  on  the  plate  (or  per  pixel  on  a  CCD). 
The  dispersion  of  an  objective  prism  is  a  few 
tens  of  nanometres  per  millimetre.  More  detailed 
observations  require  a  slit  spectrograph,  which 
can  reach  a  dispersion  1-0.01  nm/mm.  The  de¬ 
tailed  shape  of  individual  spectral  lines  can  then 
be  studied. 

The  photograph  of  the  spectrum  is  converted 
to  an  intensity  tracing  showing  the  flux  density  as 
a  function  of  wavelength.  This  is  done  by  means 
of  a  microdensitometer,  measuring  the  amount  of 
light  transmitted  by  the  recorded  spectrum.  Since 
the  blackening  of  a  photographic  plate  is  not  lin¬ 
early  related  to  the  amount  of  energy  it  has  re- 
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Fig.  9.1  A  typical  stellar  spectrum.  The  continuous  spec-  superimposed  on  the  continuum.  The  spectrum  of  the  star, 
trum  is  brightest  at  about  550  nm  and  gets  fainter  towards  r\  Pegasi,  is  very  similar  to  that  of  our  Sun.  (Mt.  Wilson 
shorter  and  longer  wavelengths.  Dark  absorption  lines  are  Observatory) 


b)  588  590  592  594  596  598  600  c)  588  590  592  594  596  598  600 
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Fig.  9.2  (a)  A  section  of  a  photograph  of  a  stellar  spec¬ 
trum  and  the  corresponding  rectified  microdensitometer 
intensity  tracing.  The  original  spectrum  was  taken  at  the 
Crimean  Observatory,  (b)  A  more  extensive  part  of  the 

ceived,  the  measured  blackening  has  to  be  cali¬ 
brated  by  comparison  with  known  exposures. 

Nowadays  CCD  cameras  are  used  also  in  spec¬ 
trographs  and  the  intensity  curve  is  determined 
directly  without  the  intervening  step  of  a  pho¬ 
tographic  plate.  Some  image  processing  is  still 
needed  since  the  height  of  the  spectrum  is  sev¬ 
eral  pixels  and  the  spectrum  may  not  be  aligned 
with  the  pixel  rows  of  the  camera. 

For  measurements  of  line  strengths  the  spec¬ 
trum  is  usually  rectified  by  dividing  by  the  con¬ 
tinuum  intensity.  Figure  9.2  shows  a  photograph 
of  the  spectrum  of  a  star  and  the  intensity  curve 
obtained  from  a  calibrated  and  rectified  micro¬ 


spectrum.  (c)  The  picture  the  intensity  curve  of  the  first 
picture  has  been  rectified  by  normalising  the  value  of  the 
continuum  intensity  to  one.  (Pictures  by  J.  Kyrolainen  and 
H.  Virtanen,  Helsinki  Observatory) 

densitometer  tracing.  The  second  pair  of  pic¬ 
tures  shows  the  intensity  curve  before  and  after 
the  normalisation.  The  absorption  lines  appear  as 
troughs  of  various  sizes  in  the  curve.  In  addition 
to  the  clear  and  deep  lines,  there  are  large  num¬ 
bers  of  weaker  lines  that  can  barely  be  discerned. 
The  graininess  of  the  photographic  emulsion  is 
a  source  of  noise  which  appears  as  irregular  fluc¬ 
tuations  of  the  intensity  curve.  Some  lines  are  so 
close  together  that  they  appear  blended  at  this  dis¬ 
persion. 

The  detailed  shape  of  a  spectral  line  is  called 
the  line  profile  (Sect.  5.3).  The  true  shape  of  the 
line  reflects  the  properties  of  the  stellar  atmo- 
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sphere,  but  the  observed  profile  is  also  spread  out 
by  the  measuring  instrument.  However,  the  total 
absorption  in  the  line,  usually  expressed  in  terms 
of  the  equivalent  width ,  is  less  sensitive  to  obser¬ 
vational  effects  (see  Fig.  5.6). 

The  equivalent  width  of  a  spectral  line  de¬ 
pends  on  how  many  atoms  in  the  atmosphere  are 
in  a  state  in  which  they  can  absorb  the  wave¬ 
length  in  question.  The  more  atoms  there  are,  the 
stronger  and  broader  the  spectral  line  is.  For  ex¬ 
ample,  a  typical  equivalent  width  of  a  metal  line 
(Fe)  in  the  solar  spectrum  is  about  10  pm.  Line 
widths  are  often  expressed  in  angstroms  (1  A  = 
10“ 10  m  =  0.1  nm). 

Only  in  weak  lines  the  equivalent  width  de¬ 
pends  linearly  on  the  number  of  absorbing  atoms. 
The  equivalent  width  as  a  function  of  the  amount 
of  absorbing  atoms  is  known  as  the  curve  of 
growth.  It  is,  however,  beyond  the  scope  of  this 
book. 

Line  profiles  are  also  broadened  by  the  Dopp¬ 
ler  effect.  In  stellar  atmospheres  there  are  motions 
of  small  and  large  scale,  like  thermal  motion  of 
the  atoms  and  convective  flows. 

The  chemical  composition  of  the  atmosphere 
can  be  determined  from  the  strengths  of  the  spec¬ 
tral  lines.  With  the  introduction  of  large  com¬ 
puters  it  has  become  feasible  to  construct  quite 
detailed  models  of  the  structure  of  stellar  atmo¬ 
spheres,  and  to  compute  the  emergent  spectrum 
for  a  given  model.  The  computed  synthetic  spec¬ 
trum  can  be  compared  with  the  observations  and 
the  theoretical  model  modified  until  a  good  fit 
is  obtained.  The  theoretical  models  then  give  the 
number  of  absorbing  atoms,  and  hence  the  ele¬ 
ment  abundances,  in  the  atmosphere.  The  con¬ 
struction  of  model  atmospheres  will  be  discussed 
in  Sect.  9.6. 


9.2  The  Harvard  Spectral 
Classification 

The  spectral  classification  scheme  in  present  use 
was  developed  at  Harvard  Observatory  in  the 
United  States  in  the  early  20th  century.  The  work 
was  begun  by  Henry  Draper  who  in  1872  took 
the  first  photograph  of  the  spectrum  of  Vega. 
Later  Draper’s  widow  donated  the  observing 


equipment  and  a  sum  of  money  to  Harvard  Ob¬ 
servatory  to  continue  the  work  of  classification. 

The  main  part  of  the  classification  was  done  by 
Annie  Jump  Cannon  using  objective  prism  spec¬ 
tra.  The  Henry  Draper  Catalogue  (HD)  was  pub¬ 
lished  in  1918-1924.  It  contains  225,000  stars  ex¬ 
tending  down  to  9  magnitudes.  Altogether  more 
than  390,000  stars  were  classified  at  Harvard. 

The  Harvard  classification  is  based  on  lines 
that  are  mainly  sensitive  to  the  stellar  tempera¬ 
ture,  rather  than  to  gravity  or  luminosity.  Impor¬ 
tant  lines  are  the  hydrogen  B aimer  lines,  the  lines 
of  neutral  helium,  the  iron  lines,  the  H  and  K  dou¬ 
blet  of  ionised  calcium  at  396.8  and  393.3  nm, 
the  G  band  due  to  the  CH  molecule  and  some 
metals  around  43 1  nm,  the  neutral  calcium  line  at 
422.7  nm  and  the  lines  of  titanium  oxide  (TiO). 

The  main  types  in  the  Harvard  classification 
are  denoted  by  capital  letters.  They  were  ini¬ 
tially  ordered  in  alphabetical  sequence,  but  subse¬ 
quently  it  was  noticed  that  they  could  be  ordered 
according  to  temperature.  With  the  temperature 
decreasing  towards  the  right  the  sequence  is 

C 

O-B- A-F-G-K-M-L-T . 

S 

Additional  notations  are  Q  for  novae,  P  for  plan¬ 
etary  nebulae  and  W  for  Wolf-Ray et  stars.  The 
class  C  consists  of  the  earlier  types  R  and  N. 
The  spectral  classes  C  and  S  represent  parallel 
branches  to  types  G-M,  differing  in  their  surface 
chemical  composition.  The  most  recent  addition 
are  the  spectral  classes  L  and  T  continuing  the 
sequence  beyond  M,  representing  brown  dwarfs. 
There  is  a  well-known  mnemonic  for  the  spectral 
classes,  but  due  to  its  chauvinistic  tone  we  refuse 
to  tell  it. 

The  spectral  classes  are  divided  into  sub¬ 
classes  denoted  by  the  numbers  0, . . . ,  9;  some¬ 
times  decimals  are  used,  e.g.  B  0.5  (Figs.  9.3 
and  9.4). 

Classes  at  the  beginning  of  the  sequence  are 
sometimes  called  early  classes  and  those  at  the 
end  of  the  sequence  late  classes.  This  is  no  way 
related  to  stellar  evolution;  it  only  reflects  the  po¬ 
sition  in  the  sequence  O-B-A-F-G-K-M. 
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Fig.  9.3  Spectra  of  early  and  late  spectral  type  stars  be¬ 
tween  375  and  390  nm.  (a)  The  upper  star  is  Vega,  of 
spectral  type  AO,  and  (b)  the  lower  one  is  Aldebaran,  of 

Spectra  of  brown  dwarfs  are  shown  in  Fig.  9.4 
and  compared  with  those  of  M  dwarfs. 

The  main  characteristics  of  the  different  classes 
are: 

O  Blue  stars,  surface  temperature  20, GOO- 
35, 000  K.  Spectrum  with  lines  from  multiply 
ionised  atoms,  e.g.  Hell,  CIII,  NIII,  OIII, 
Si  V.  He  I  visible,  H I  lines  weak. 

B  Blue- white  stars,  surface  temperature  about 
15,000  K.  Hell  lines  have  disappeared,  He  I 
(403  nm)  lines  are  strongest  at  B2,  then  get 
weaker  and  have  disappeared  at  type  B9.  The 
K  line  of  Call  becomes  visible  at  type  B3. 
H  I  lines  getting  stronger.  O II,  Si  II  and  Mg  II 
lines  visible. 

A  White  stars,  surface  temperature  about 
9000  K.  The  HI  lines  are  very  strong  at 
A0  and  dominate  the  whole  spectrum,  then 
get  weaker.  H  and  K  lines  of  Call  getting 
stronger.  He  I  no  longer  visible.  Neutral  metal 
lines  begin  to  appear. 

F  Yellow- white  stars,  surface  temperature  about 
7000  K.  HI  lines  getting  weaker,  H  and  K  of 
Call  getting  stronger.  Many  other  metal  lines, 
e.g.  Fel,  Fell,  CrII,  Till,  clear  and  getting 
stronger. 

G  Yellow  stars  like  the  Sun,  surface  tempera¬ 
ture  about  5500  K.  The  HI  lines  still  getting 
weaker,  H  and  K  lines  very  strong,  strongest 
at  GO.  Metal  lines  getting  stronger.  G  band 
clearly  visible.  CN  lines  seen  in  giant  stars. 

K  Orange-yellow  stars,  surface  temperature 
about  4000  K.  Spectrum  dominated  by  metal 
lines.  HI  lines  insignificant.  Cal  422.7  nm 
clearly  visible.  Strong  H  and  K  lines  and 
G  band.  TiO  bands  become  visible  at  K5. 


spectral  type  K5.  The  hydrogen  B  aimer  lines  are  strong  in 
the  spectrum  of  Vega;  in  that  of  Aldebaran,  there  are  many 
metal  lines.  (Lick  Observatory) 

M  Red  stars,  surface  temperature  about  3000  K. 
TiO  bands  getting  stronger.  Cal  422.7  nm 
very  strong.  Many  neutral  metal  lines. 

L  Brown  (actually  dark  red)  stars,  surface  tem¬ 
perature  about  2000  K.  The  TiO  and  VO 
bands  disappear  for  early  L  class.  Very  strong 
and  broad  lines  of  Na  I  and  K I. 

T  Brown  dwarfs,  surface  temperature  about 
1000  K.  Very  strong  molecular  absorption 
bands  of  CH4  and  H2O. 

C  Carbon  stars,  previously  R  and  N.  Very  red 
stars,  surface  temperature  about  3000  K. 
Strong  molecular  bands,  e.g.  C2,  CN  and  CH. 
No  TiO  bands.  Line  spectrum  like  in  the  types 
K  and  M. 

S  Red  low-temperature  stars  (about  3000  K). 
Very  clear  ZrO  bands.  Also  other  molecular 
bands,  e.g.  YO,  LaO  and  TiO. 

The  main  characteristics  of  the  classification 
scheme  can  be  seen  in  Fig.  9.5  showing  the  vari¬ 
ations  of  some  typical  absorption  lines  in  the  dif¬ 
ferent  spectral  classes.  Different  spectral  features 
are  mainly  due  to  different  effective  temperatures. 
Different  pressures  and  chemical  compositions  of 
stellar  atmospheres  are  not  very  important  factors 
in  the  spectral  classification,  except  in  some  pe¬ 
culiar  stars. 

The  early,  i.e.  hot,  spectral  classes  are  char¬ 
acterised  by  the  lines  of  ionised  atoms,  whereas 
the  cool,  or  late,  spectral  types  have  lines  of  neu¬ 
tral  atoms.  In  hot  stars  molecules  dissociate  into 
atoms;  thus  the  absorption  bands  of  molecules  ap¬ 
pear  only  in  the  spectra  of  cool  stars  of  late  spec¬ 
tral  types. 

To  see  how  the  strengths  of  the  spectral  lines 
are  determined  by  the  temperature,  we  consider, 
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Fig.  9.4  (a)  Intensity  curves  for  various  spectral  classes 
showing  characteristic  spectral  features.  The  name  of  the 
star  and  its  spectral  and  luminosity  class  are  given  next 
to  each  curve,  and  the  most  important  spectral  features 
are  identified.  (Drawing  by  J.  Dufay.)  (b)  Optical  spec- 

for  example,  the  neutral  helium  lines  at  402.6  nm 
and  447.2  nm.  These  are  only  seen  in  the  spec¬ 
tra  of  hot  stars.  The  reason  for  this  is  that  the 
lines  are  due  to  absorption  by  excited  atoms, 


tra  of  M  stars  and  brown  dwarfs.  In  an  approximate  sense, 
the  brown  dwarfs  continue  the  spectral  sequence  towards 
lower  temperatures,  although  in  many  respects  they  dif¬ 
fer  from  the  earlier  spectral  types.  (J.D.  Kirkpatrick  2005, 
ARAA  43,  205) 

and  that  a  high  temperature  is  required  to  pro¬ 
duce  any  appreciable  excitation.  As  the  stellar 
temperature  increases,  more  atoms  are  in  the  re¬ 
quired  excited  state,  and  the  strength  of  the  he- 
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Fig.  9.5  Equivalent 
widths  of  some  important 
spectral  lines  in  the  various 
spectral  classes.  [Struve,  O. 
(1959):  Elementary 
Astronomy  (Oxford 
University  Press,  New 
York)  p.  259] 
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lium  lines  increases.  When  the  temperature  be¬ 
comes  even  higher,  helium  begins  to  be  ionised, 
and  the  strength  of  the  neutral  helium  lines  be¬ 
gins  to  decrease.  In  a  similar  way  one  can  under¬ 
stand  the  variation  with  temperature  of  other  im¬ 
portant  lines,  such  as  the  calcium  H  and  K  lines. 
These  lines  are  due  to  singly  ionised  calcium,  and 
the  temperature  must  be  just  right  to  remove  one 
electron  but  no  more. 

The  hydrogen  B aimer  lines  H^,  H y  and  are 
strongest  in  the  spectral  class  A2.  These  lines  cor¬ 
respond  to  transitions  to  the  level  the  principal 
quantum  number  of  which  is  n  =  2.  If  the  tem¬ 
perature  is  too  high  the  hydrogen  is  ionised  and 
such  transitions  are  not  possible. 


9.3  The  Yerkes  Spectral 
Classification 

The  Harvard  classification  only  takes  into  ac¬ 
count  the  effect  of  the  temperature  on  the  spec¬ 
trum.  For  a  more  precise  classification,  one  also 
has  to  take  into  account  the  luminosity  of  the  star, 
since  two  stars  with  the  same  effective  tempera¬ 
ture  may  have  widely  different  luminosities. 

A  two-dimensional  system  of  spectral  classi¬ 
fication  was  introduced  by  William  W.  Morgan , 
Philip  C.  Keenan  and  Edith  Kellman  of  Yerkes 
Observatory.  This  system  is  known  as  the  MKK 
or  Yerkes  classification.  (The  MK  classification  is 
a  modified,  later  version.)  The  MKK  classifica¬ 
tion  is  based  on  the  visual  scrutiny  of  slit  spec¬ 
tra  with  a  dispersion  of  11.5  nm/mm.  It  is  care¬ 
fully  defined  on  the  basis  of  standard  stars  and  the 


specification  of  luminosity  criteria.  Six  different 
luminosity  classes  are  distinguished: 

-  Ia  most  luminous  supergiants, 

-  Ib  less  luminous  supergiants, 

-  II  luminous  giants, 

-  Ill  normal  giants, 

-  IV  subgiants, 

-  V  main  sequence  stars  (dwarfs). 

The  luminosity  class  is  denoted  by  a  Roman 
numeral  after  the  spectral  class.  For  example,  the 
class  of  the  Sun  is  G2  V. 

The  luminosity  class  is  determined  from  spec¬ 
tral  lines  that  depend  strongly  on  the  stellar  sur¬ 
face  gravity,  which  is  closely  related  to  the  lu¬ 
minosity.  The  masses  of  giants  and  dwarfs  are 
roughly  similar,  but  the  radii  of  giants  are  much 
larger  than  those  of  dwarfs.  Therefore  the  grav¬ 
itational  acceleration  g  =  GM/R2  at  the  surface 
of  a  giant  is  much  smaller  than  for  a  dwarf.  In 
consequence,  the  gas  density  and  pressure  in  the 
atmosphere  of  a  giant  is  much  smaller.  This  gives 
rise  to  luminosity  effects  in  the  stellar  spectrum, 
which  can  be  used  to  distinguish  between  stars  of 
different  luminosities. 

1 .  For  spectral  types  B-F,  the  lines  of  neutral  hy¬ 
drogen  are  deeper  and  narrower  for  stars  of 
higher  luminosities.  The  reason  for  this  is  that 
the  metal  ions  give  rise  to  a  fluctuating  elec¬ 
tric  field  near  the  hydrogen  atoms.  This  field 
leads  to  shifts  in  the  hydrogen  energy  levels 
(the  Stark  effect),  appearing  as  a  broadening 
of  the  lines.  The  effect  becomes  stronger  as 
the  density  increases.  Thus  the  hydrogen  lines 
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Fig.  9.6  Luminosity 
effects  in  the  hydrogen  Hy 
line  in  A  stars.  The  vertical 
axis  gives  the  normalised 
intensity.  HD  223385 
0 upper  left )  is  an 
A2  supergiant,  where  the 
line  is  very  weak, 

6  Aurigae  A  is  a  giant  star 
and  a 2  Geminorum  is 
a  main  sequence  star, 
where  the  line  is  very 
broad.  [Aller,  L.H.  (1953): 
Astrophysics.  The 
Atmospheres  of  the  Sun 
and  Stars  (The  Ronald 
Press  Company,  New  York) 
p.  318] 


0 


0 


are  narrow  in  absolutely  bright  stars,  and  be¬ 
come  broader  in  main  sequence  stars  and  even 
more  so  in  white  dwarfs  (Fig.  9.6). 

2.  The  lines  from  ionised  elements  are  relatively 
stronger  in  high-luminosity  stars.  This  is  be¬ 
cause  the  higher  density  makes  it  easier  for 
electrons  and  ions  to  recombine  to  neutral 
atoms.  On  the  other  hand,  the  rate  of  ionisa¬ 
tion  is  essentially  determined  by  the  radiation 
field,  and  is  not  appreciably  affected  by  the  gas 
density.  Thus  a  given  radiation  field  can  main¬ 
tain  a  higher  degree  of  ionisation  in  stars  with 
more  extended  atmospheres.  For  example,  in 
the  spectral  classes  F-G,  the  relative  strengths 
of  the  ionised  strontium  (Sr  II)  and  neutral  iron 
(Fe  I)  lines  can  be  used  as  a  luminosity  indica¬ 


tor.  Both  lines  depend  on  the  temperature  in 
roughly  the  same  way,  but  the  Sr  II  lines  be¬ 
come  relatively  much  stronger  as  the  luminos¬ 
ity  increases. 

3.  Giant  stars  are  redder  than  dwarfs  of  the  same 
spectral  type.  The  spectral  type  is  determined 
from  the  strengths  of  spectral  lines,  including 
ion  lines.  Since  these  are  stronger  in  giants, 
a  giant  will  be  cooler,  and  thus  also  redder, 
than  a  dwarf  of  the  same  spectral  type. 

4.  There  is  a  strong  cyanogen  (CN)  absorption 
band  in  the  spectra  of  giant  stars,  which  is  al¬ 
most  totally  absent  in  dwarfs.  This  is  partly 
a  temperature  effect,  since  the  cooler  atmo¬ 
spheres  of  giants  are  more  suitable  for  the  for¬ 
mation  of  cyanogen. 
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Fig.  9.7  Peculiar  spectra,  (a)  R  Geminorum  {above)  is  an 
emission  line  star,  with  bright  emission  lines,  indicated  by 
arrows,  in  its  spectrum;  (b)  the  spectrum  of  a  normal  star 


9.4  Peculiar  Spectra 

The  spectra  of  some  stars  differ  from  what  one 
would  expect  on  the  basis  of  their  temperature 
and  luminosity  (see,  e.g.,  Fig.  9.7).  Such  stars  are 
celled  peculiar.  The  most  common  peculiar  spec¬ 
tral  types  will  now  be  considered. 

The  Wolf-Ray et  stars  are  very  hot  stars;  the 
first  examples  were  discovered  by  Charles  Wolf 
and  Georges  Rayet  in  1867.  The  spectra  of  Wolf- 
Rayet  stars  have  broad  emission  lines  of  hydro¬ 
gen  and  ionised  helium,  carbon,  nitrogen  and 
oxygen.  There  are  hardly  any  absorption  lines. 
The  Wolf-Rayet  stars  are  thought  to  be  very 
massive  stars  that  have  lost  their  outer  layers  in 
a  strong  stellar  wind.  This  has  exposed  the  stellar 
interior,  which  gives  rise  to  a  different  spectrum 
than  the  normal  outer  layers.  Many  Wolf-Rayet 
stars  are  members  of  binary  systems. 

In  some  O  and  B  stars  the  hydrogen  absorp¬ 
tion  lines  have  weak  emission  components  ei¬ 
ther  at  the  line  centre  or  in  its  wings.  These  stars 
are  called  Be  and  shell  stars  (the  letter  e  after 
the  spectral  type  indicates  that  there  are  emis¬ 
sion  lines  in  the  spectrum).  The  emission  lines 
are  formed  in  a  rotationally  flattened  gas  shell 
around  the  star.  The  shell  and  Be  stars  show  ir¬ 
regular  variations,  apparently  related  to  structural 
changes  in  the  shell.  About  15  %  of  all  O  and 
B  stars  have  emission  lines  in  their  spectra. 


is  compared  with  one  in  which  the  zirconium  lines  are  un¬ 
usually  strong.  (Mt.  Wilson  Observatory  and  Helsinki  Ob¬ 
servatory) 

The  strongest  emission  lines  are  those  of  the 
P  Cygni  stars,  which  have  one  or  more  sharp  ab¬ 
sorption  lines  on  the  short  wavelength  side  of 
the  emission  line.  It  is  thought  that  the  lines 
are  formed  in  a  thick  expanding  envelope.  The 
P  Cygni  stars  are  often  variable.  For  example, 
P  Cygni  itself  has  varied  between  three  and  six 
magnitudes  during  the  past  centuries.  At  present 
its  magnitude  is  about  5. 

The  peculiar  A  stars  or  Ap  stars  (p  =  peculiar) 
are  usually  strongly  magnetic  stars,  where  the 
lines  are  split  into  several  components  by  the 
Zeeman  effect.  The  lines  of  certain  elements, 
such  as  magnesium,  silicon,  europium,  chromium 
and  strontium,  are  exceptionally  strong  in  the  Ap 
stars.  Lines  of  rarer  elements  such  as  mercury, 
gallium  or  krypton  may  also  be  present.  Other¬ 
wise,  the  Ap  stars  are  like  normal  main  sequence 
stars. 

The  Am  stars  (m  =  metallic)  also  have  anoma¬ 
lous  element  abundances,  but  not  to  the  same  ex¬ 
tent  as  the  Ap  stars.  The  lines  of  e.g.  the  rare 
earths  and  the  heaviest  elements  are  strong  in 
their  spectra;  those  of  calcium  and  scandium  are 
weak. 

We  have  already  mentioned  the  S  and  C  stars, 
which  are  special  classes  of  K  and  M  giants  with 
anomalous  element  abundances.  In  the  S  stars, 
the  normal  lines  of  titanium,  scandium  and  vana¬ 
dium  oxide  are  replaced  with  oxides  of  heavier 
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Fig.  9.8  The  Hertzsprung-Russell  diagram.  The  horizon¬ 
tal  coordinate  can  be  either  the  colour  index  B  —  V,  ob¬ 
tained  directly  from  observations,  or  the  spectral  class.  In 
theoretical  studies  the  effective  temperature  Tt  is  com¬ 
monly  used.  These  correspond  to  each  other  but  the  depen¬ 
dence  varies  somewhat  with  luminosity.  The  vertical  axis 
gives  the  absolute  magnitude.  In  a  (lg(L/L©),  lgTe)  plot 
the  curves  of  constant  radius  are  straight  lines.  The  dens- 

elements,  zirconium,  yttrium  and  barium.  A  large 
fraction  of  the  S  stars  are  irregular  variables.  The 
name  of  the  C  stars  refers  to  carbon.  The  metal 
oxide  lines  are  almost  completely  absent  in  their 
spectra;  instead,  various  carbon  compounds  (CN, 
C2,  CH)  are  strong.  The  abundance  of  carbon  rel¬ 
ative  to  oxygen  is  4-5  times  greater  in  the  C  stars 
than  in  normal  stars.  The  C  stars  are  divided  into 
two  groups,  hotter  R  stars  and  cooler  N  stars. 

Another  type  of  giant  stars  with  abundance 
anomalies  are  the  barium  stars.  The  lines  of  bar¬ 
ium,  strontium,  rare  earths  and  some  carbon  com¬ 
pounds  are  strong  in  their  spectra.  Apparently  nu¬ 
clear  reaction  products  have  been  mixed  up  to  the 
surface  in  these  stars. 


est  areas  are  the  main  sequence  and  the  horizontal,  red  gi¬ 
ant  and  asymptotic  branches  consisting  of  giant  stars.  The 
supergiants  are  scattered  above  the  giants.  To  the  lower 
left  are  some  white  dwarfs  about  10  magnitudes  below  the 
main  sequence.  The  apparently  brightest  stars  (m  <  4)  are 
marked  with  crosses  and  the  nearest  stars  (r  <  50  ly)  with 
dots.  The  data  are  from  the  Hipparcos  catalogue 


9.5  The  Hertzsprung-Russell 
Diagram 

Around  1910,  Ejnar  Hertzsprung  and  Henry  Nor¬ 
ris  Russell  studied  the  relation  between  the  abso¬ 
lute  magnitudes  and  the  spectral  types  of  stars. 
The  diagram  showing  these  two  variables  is  now 
known  as  the  Hertzsprung-Russell  diagram  or 
simply  the  HR  diagram  (Fig.  9.8).  It  has  turned 
out  to  be  an  important  aid  in  studies  of  stellar  evo¬ 
lution. 

In  view  of  the  fact  that  stellar  radii,  luminosi¬ 
ties  and  surface  temperatures  vary  widely,  one 
might  have  expected  the  stars  to  be  uniformly  dis¬ 
tributed  in  the  HR  diagram.  However,  it  is  found 
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that  most  stars  are  located  along  a  roughly  diag¬ 
onal  curve  called  the  main  sequence.  The  Sun  is 
situated  about  the  middle  of  the  main  sequence. 

The  HR  diagram  also  shows  that  the  yellow 
and  red  stars  (spectral  types  G-K-M)  are  clus¬ 
tered  into  two  clearly  separate  groups:  the  main 
sequence  of  dwarf  stars  and  the  giants.  The  gi¬ 
ant  stars  fall  into  several  distinct  groups.  The 
horizontal  branch  is  an  almost  horizontal  se¬ 
quence,  about  absolute  visual  magnitude  zero. 
The  red  giant  branch  rises  almost  vertically  from 
the  main  sequence  at  spectral  types  K  and  M  in 
the  HR  diagram.  Finally,  the  asymptotic  branch 
rises  from  the  horizontal  branch  and  approaches 
the  bright  end  of  the  red  giant  branch.  These  var¬ 
ious  branches  represent  different  phases  of  stellar 
evolution  (cf.  Sects.  12.3  and  12.4):  dense  areas 
correspond  to  evolutionary  stages  in  which  stars 
stay  a  long  time. 

A  typical  horizontal  branch  giant  is  about 
a  hundred  times  brighter  than  the  Sun.  Since  gi¬ 
ants  and  dwarfs  of  the  same  spectral  class  have 
nearly  the  same  surface  temperature,  the  differ¬ 
ence  in  luminosity  must  be  due  to  a  difference  in 
radius  according  to  (5.21).  For  example  Arcturus, 
which  is  one  of  the  brightest  stars  in  the  sky,  has 
a  radius  about  thirty  times  that  of  the  Sun. 

The  brightest  red  giants  are  the  supergiants 
with  magnitudes  up  to  My  =  —  7.  One  example 
is  Betelgeuze.  in  Orion,  with  a  radius  of  400  so¬ 
lar  radii  and  20,000  times  more  luminous  than  the 
Sun. 

About  10  magnitudes  below  the  main  se¬ 
quence  are  the  white  dwarfs.  They  are  quite  nu¬ 
merous  in  space,  but  faint  and  difficult  to  find. 
The  best-known  example  is  Sirius  B,  the  com¬ 
panion  of  Sirius. 

There  are  some  stars  in  the  HR  diagram 
which  are  located  below  the  giant  branch,  but 
still  clearly  above  the  main  sequence.  These  are 
known  as  subgiants.  Similarly,  there  are  stars 
below  the  main  sequence,  but  brighter  than  the 
white  dwarfs,  known  as  subdwarfs. 

When  interpreting  the  HR  diagram,  one  has 
to  take  into  account  selection  effects:  absolutely 
bright  stars  are  more  likely  to  be  included  in  the 
sample,  since  they  can  be  discovered  at  greater 
distances.  If  only  stars  within  a  certain  distance 


from  the  Sun  are  included,  the  distribution  of 
stars  in  the  HR  diagram  looks  quite  different.  This 
can  be  seen  in  Fig.  9.8:  there  are  no  giant  or  bright 
main  sequence  stars  among  these. 

The  HR  diagrams  of  star  clusters  are  particu¬ 
larly  important  for  the  theory  of  stellar  evolution. 
They  will  be  discussed  in  Chap.  17. 


9.6  Model  Atmospheres 

The  stellar  atmosphere  consists  of  those  layers  of 
the  star  where  the  radiation  that  is  transmitted  di¬ 
rectly  to  the  observer  originates.  Thus  in  order  to 
interpret  stellar  spectra,  one  needs  to  be  able  to 
compute  the  structure  of  the  atmosphere  and  the 
emerging  radiation. 

In  actual  stars  there  are  many  factors,  such  as 
rotation  and  magnetic  fields,  that  complicate  the 
problem  of  computing  the  structure  of  the  atmo¬ 
sphere.  We  shall  only  consider  the  classical  prob¬ 
lem  of  finding  the  structure,  i.e.  the  distribution  of 
pressure  and  temperature  with  depth,  in  a  static, 
unmagnetised  atmosphere.  In  that  case  a  model 
atmosphere  is  completely  specified  by  giving  the 
chemical  composition,  the  gravitational  acceler¬ 
ation  at  the  surface,  g,  and  the  energy  flux  from 
the  stellar  interior,  or  equivalently,  the  effective 
temperature  Te. 

The  basic  principles  involved  in  computing 
a  model  stellar  atmosphere  are  the  same  as  for 
stellar  interiors  and  will  be  discussed  in  Chap.  11. 
Essentially,  there  are  two  differential  equations 
to  be  solved:  the  equation  of  hydrostatic  equilib¬ 
rium,  which  fixes  the  distribution  of  pressure,  and 
an  equation  of  energy  transport,  which  will  have 
a  different  form  depending  on  whether  the  atmo¬ 
sphere  is  radiative  or  convective,  and  which  de¬ 
termines  the  temperature  distribution. 

The  values  of  the  various  physical  quantities 
in  an  atmosphere  are  usually  given  as  functions  of 
some  suitably  defined  continuum  optical  depth  r . 
Thus  pressure,  temperature,  density,  ionisation 
and  the  population  numbers  of  various  energy 
levels  can  all  be  obtained  as  functions  of  r .  When 
these  are  known,  the  intensity  of  radiation  emerg¬ 
ing  from  the  atmosphere  can  be  computed.  In 
Box  9.1,  it  is  shown  that  approximately  the  emer¬ 
gent  spectrum  originates  at  unit  optical  depth, 
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measured  along  each  light  ray.  On  this  basis,  one 
can  predict  whether  a  given  spectral  line  will  be 
present  in  the  spectrum. 

Consider  a  spectral  line  formed  when  an  atom 
(or  ion)  in  a  given  energy  state  absorbs  a  photon. 
From  the  model  atmosphere,  the  occupation  num¬ 
ber  of  the  absorbing  level  is  known  as  a  function 
of  the  (continuum)  optical  depth  r.  If  now  there 
is  a  layer  above  the  depth  r  =  1  where  the  ab¬ 
sorbing  level  has  a  high  occupancy,  the  optical 
depth  in  the  line  will  become  unity  before  r  =  1 , 
i.e.  the  radiation  in  the  line  will  originate  higher 
in  the  atmosphere.  Because  the  temperature  in¬ 
creases  inwards,  the  intensity  in  the  line  will  cor¬ 
respond  to  a  lower  temperature,  and  the  line  will 
appear  dark.  On  the  other  hand,  if  the  absorbing 
level  is  unoccupied,  the  optical  depth  at  the  line 
frequency  will  be  the  same  as  the  continuum  op¬ 
tical  depth.  The  radiation  at  the  line  frequency 
will  then  come  from  the  same  depth  as  the  ad¬ 
jacent  continuum,  and  no  absorption  line  will  be 
formed. 

The  expression  for  the  intensity  derived  in 
Box  9.1  also  explains  the  phenomenon  of  limb 
darkening  seen  in  the  Sun  (Sect.  12.2).  The  radia¬ 
tion  that  reaches  us  from  near  the  edge  of  the  solar 
disk  emerges  at  a  very  oblique  angle  (0  near  90°), 
i.e.  cos  0  is  small.  Thus  this  radiation  originates  at 
small  values  of  r,  and  hence  at  low  temperatures. 
In  consequence,  the  intensity  coming  from  near 
the  edge  will  be  lower,  and  the  solar  disk  will  ap¬ 
pear  darker  towards  the  limb.  The  amount  of  limb 
darkening  also  gives  an  empirical  way  of  deter¬ 
mining  the  temperature  distribution  in  the  solar 
atmosphere. 

Our  presentation  of  stellar  atmospheres  has 
been  highly  simplified.  In  practice,  the  spectrum 
is  computed  numerically  for  a  range  of  parameter 
values.  The  values  of  TQ  and  element  abundances 
for  various  stars  can  then  be  found  by  comparing 
the  observed  line  strengths  and  other  spectral  fea¬ 
tures  with  the  theoretical  ones.  We  shall  not  go 
into  details  on  the  procedures  used. 


9.7  What  Do  the  Observations  Tell 
Us? 

To  conclude  this  chapter,  we  shall  give  a  sum¬ 
mary  of  the  properties  of  stars  revealed  by  the  ob¬ 


servations.  At  the  end  of  the  book,  there  are  tables 
of  the  brightest  and  of  the  nearest  stars. 

Of  the  brightest  stars,  four  have  a  negative 
magnitude.  Some  of  the  apparently  bright  stars 
are  absolutely  bright  supergiants,  others  are  sim¬ 
ply  nearby. 

In  the  list  of  the  nearest  stars,  the  dominance  of 
faint  dwarf  stars,  already  apparent  in  the  HR  dia¬ 
gram,  is  worth  noting.  Most  of  these  belong  to  the 
spectral  types  K  and  M.  Some  nearby  stars  also 
have  very  faint  companions  with  masses  about 
that  of  Jupiter,  i.e.  planets.  They  have  not  been 
included  in  the  table. 

Stellar  spectroscopy  offers  an  important  way 
of  determining  fundamental  stellar  parameters,  in 
particular  mass  and  radius.  However,  the  spec¬ 
tral  information  needs  to  be  calibrated  by  means 
of  direct  measurements  of  these  quantities.  These 
will  be  considered  next. 

The  masses  of  stars  can  be  determined  in  the 
case  of  double  stars  orbiting  each  other.  (The  de¬ 
tails  of  the  method  will  be  discussed  in  Chap.  9.) 
These  observations  have  shown  that  the  larger 
the  mass  of  a  main  sequence  star  becomes,  the 
higher  on  the  main  sequence  it  is  located.  One 
thus  obtains  an  empirical  mass-luminosity  rela¬ 
tion,  which  can  be  used  to  estimate  stellar  masses 
on  the  basis  of  the  spectral  type. 

The  observed  relation  between  mass  and  lu¬ 
minosity  is  shown  in  Fig.  9.9.  The  luminosity  is 
roughly  proportional  to  the  power  3.8  of  the  mass: 

L  oc  M3'8.  (9.1) 

The  relations  is  only  approximate.  According  to 
it,  a  ten  solar  mass  star  is  about  6300  times 
brighter  than  the  Sun,  corresponding  to  9.5  mag¬ 
nitudes. 

The  smallest  observed  stellar  masses  are  about 
1  /20  of  the  solar  mass,  corresponding  to  stars  in 
the  lower  right-hand  part  of  the  HR  diagram.  The 
masses  of  white  dwarfs  are  less  than  one  solar 
mass.  The  masses  of  the  most  massive  main  se¬ 
quence  and  supergiant  stars  are  between  10  and 
possibly  even  150 M©. 

Direct  interferometric  measurements  of  stel¬ 
lar  angular  diameters  have  been  made  for  only 
a  few  dozen  stars.  When  the  distances  are  known, 
these  immediately  yield  the  value  of  the  radius.  In 
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eclipsing  binaries,  the  radius  can  also  be  directly 
measured  (see  Sect.  10.4).  Altogether,  close  to 
a  hundred  stellar  radii  are  known  from  direct 
measurements.  In  other  cases,  the  radius  must  be 
estimated  from  the  absolute  luminosity  and  effec¬ 
tive  temperature. 

In  discussing  stellar  radii,  it  is  convenient  to 
use  a  version  of  the  HR  diagram  with  lg  TQ  on  the 
horizontal  and  Mb0i  or  lg(L/L©)  on  the  vertical 
axis.  If  the  value  of  the  radius  R  is  fixed,  then 
(5.21)  yields  a  linear  relation  between  the  bolo- 
metric  magnitude  and  1  gTe.  Thus  lines  of  con¬ 
stant  radius  in  the  HR  diagram  are  straight.  Lines 
corresponding  to  various  values  of  the  radius  are 
shown  in  Fig.  9.8.  The  smallest  stars  are  the  white 
dwarfs  with  radii  of  about  one  per  cent  of  the  so- 


log  M/M . 

Fig.  9.9  Mass-luminosity  relation.  The  picture  is  based 
on  binaries  with  known  masses.  Different  symbols  refer 
to  different  kinds  of  binaries.  (From  Bohm-Vitense:  In¬ 
troduction  to  Stellar  Astrophysics ,  Cambridge  University 
Press  (1989-1992)) 


lar  radius,  whereas  the  largest  supergiants  have 
radii  several  thousand  times  larger  than  the  Sun. 
Not  included  in  the  figure  are  the  compact  stars 
(neutron  stars  and  black  holes)  with  typical  radii 
of  a  few  tens  of  kilometres. 

Since  the  stellar  radii  vary  so  widely,  so  do 
the  densities  of  stars.  The  density  of  giant  stars 
may  be  only  10-4  kg/m3,  whereas  the  density  of 
white  dwarfs  is  about  109  kg/m3. 

The  range  of  values  for  stellar  effective  tem¬ 
peratures  and  luminosities  can  be  immediately 
seen  in  the  HR  diagram.  The  range  of  effective 
temperature  is  2000-40,000  K,  and  that  of  lumi¬ 
nosity  10_4-106  L©. 

The  rotation  of  stars  appears  as  a  broadening 
of  the  spectral  lines.  One  edge  of  the  stellar  disk 
is  approaching  us,  the  other  edge  is  receding,  and 
the  radiation  from  the  edges  is  Doppler  shifted 
accordingly.  The  rotational  velocity  observed  in 
this  way  is  only  the  component  along  the  line  of 
sight.  The  true  velocity  is  obtained  by  dividing 
with  sin  i ,  where  i  is  the  angle  between  the  line 
of  sight  and  the  rotational  axis.  A  star  seen  from 
the  direction  of  the  pole  will  show  no  rotation. 

Assuming  the  axes  of  rotation  to  be  randomly 
oriented,  the  distribution  of  rotational  velocities 
can  be  statistically  estimated.  The  hottest  stars 
appear  to  rotate  faster  than  the  cooler  ones.  The 
rotational  velocity  at  the  equator  varies  from 
200-250  km/s  for  O  and  B  stars  to  about  20  km/s 
for  spectral  type  G.  In  shell  stars,  the  rotational 
velocity  may  reach  500  km/s. 

The  chemical  composition  of  the  outer  layers 
is  deduced  from  the  strengths  of  the  spectral  lines. 
About  three-fourths  of  the  stellar  mass  is  hy¬ 
drogen.  Helium  comprises  about  one-fourth,  and 
the  abundance  of  other  elements  is  very  small. 
The  abundance  of  heavy  elements  in  young  stars 
(about  2  %)  is  much  larger  than  in  old  ones, 
where  it  is  less  than  0.02  %. 


Box  9.1  (The  Intensity  Emerging  from  a  Stellar 
Atmosphere)  The  intensity  of  radiation  emerg¬ 
ing  from  an  atmosphere  is  given  by  the  expres¬ 
sion  (5.45),  i.e. 

poo 

Iv(0,0)  =  /  Sv (tv )  e_ Ty  sec ^  sec  0  dxv . 

Jo 

(9.2) 
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If  a  model  atmosphere  has  been  computed,  the 
source  function  Sv  is  known. 

An  approximate  formula  for  the  intensity 
can  be  derived  as  follows.  Let  us  expand  the 
source  function  as  a  Taylor  series  about  some 
arbitrary  point  r*,  thus 

Sy  =  Sy(z^  +  (^Cy  ~  T*)  Sy  (t*)  +  *  *  * 

where  the  dash  denotes  a  derivative.  With  this 
expression,  the  integral  in  (9.2)  can  be  evalu¬ 
ated,  yielding 

/v(0, 6)  =  Sv( r*)  +  ( cose  -  t*)S'v(t*)  +  •  •  •  . 

If  we  now  choose  r*  =  cos0,  the  second 
term  will  vanish.  In  local  thermodynamic 


equilibrium  the  source  function  will  be  the 
Planck  function  BV(T).  We  thus  obtain  the 
Eddington-Barbier  approximation 

IV(O,0)  =  Bv(T[tv  =cos0]). 

According  to  this  expression,  the  radiation 
emerging  in  a  given  direction  originated  at  unit 
optical  depth  along  that  direction. 


9.8  Exercise 

Exercise  9.1  Arrange  the  below  spectra  in  the 
order  of  decreasing  temperature. 
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Quite  often,  two  stars  may  appear  to  be  close  to¬ 
gether  in  the  sky,  although  they  are  really  at  very 
different  distances.  Such  chance  pairs  are  called 
optical  binary  stars.  However,  many  close  pairs 
of  stars  really  are  at  the  same  distance  and  form 
a  physical  system  in  which  two  stars  are  orbit¬ 
ing  around  each  other.  Less  than  half  of  all  stars 
are  single  stars  like  the  Sun.  More  than  50  %  be¬ 
long  to  systems  containing  two  or  more  members. 
In  general,  the  multiple  systems  have  a  hierarchi¬ 
cal  structure:  a  star  and  a  binary  orbiting  around 
each  other  in  triple  systems,  two  binaries  orbit¬ 
ing  around  each  other  in  quadruple  systems.  Thus 
most  multiple  systems  can  be  described  as  bina¬ 
ries  with  several  levels. 

Binaries  are  classified  on  the  basis  of  the 
method  of  their  discovery.  This  classification  has 
nothing  to  do  with  the  physical  properties  of  the 
stars.  Visual  binaries  can  be  seen  as  two  sepa¬ 
rate  components,  i.e.  the  separation  between  the 
stars  is  larger  than  about  0.1  arc  seconds.  The  rel¬ 
ative  position  of  the  components  changes  over  the 
years  as  they  move  in  their  orbits  (Fig.  10.1).  In 
astrometric  binary  stars  only  one  component  is 
seen,  but  its  variable  proper  motion  shows  that 
a  second  invisible  component  must  be  present. 
The  spectroscopic  binary  stars  are  discovered  on 
the  basis  of  their  spectra.  Either  two  sets  of  spec¬ 
tral  lines  are  seen  or  else  the  Doppler  shift  of  the 
observed  lines  varies  periodically,  indicating  an 
invisible  companion.  The  fourth  class  of  bina¬ 
ries  are  the  photometric  binary  stars  or  eclips¬ 
ing  variables.  In  these  systems  the  components 
of  the  pair  regularly  pass  in  front  of  each  other, 


causing  a  change  in  the  total  apparent  magni¬ 
tude. 

Binary  stars  can  also  be  classified  on  the  ba¬ 
sis  of  their  mutual  separation.  In  distant  binaries 
the  separation  between  the  components  is  tens  or 
hundreds  of  astronomical  units  and  their  orbital 
periods  are  from  tens  to  thousands  of  years.  In 
close  binaries  the  separation  is  from  about  one 
au  down  to  the  radius  of  the  stars.  The  orbital  pe¬ 
riod  ranges  from  a  few  hours  to  a  few  years.  The 
components  of  contact  binaries  are  so  close  that 
they  are  touching  each  other. 

The  stars  in  a  binary  system  move  in  an  ellipti¬ 
cal  orbit  around  the  centre  of  mass  of  the  system. 
In  Chap.  6  it  was  shown  that  the  relative  orbit,  too, 
is  an  ellipse,  and  thus  the  observations  are  often 
described  as  if  one  component  remained  station¬ 
ary  and  the  other  orbited  around  it. 


10.1  Visual  Binaries 

We  consider  a  visual  binary,  assuming  initially 
that  the  brighter  primary  component  is  station¬ 
ary  and  the  fainter  secondary  component  is  or¬ 
biting  around  it.  The  angular  separation  of  the 
stars  and  the  angular  direction  to  the  secondary 
can  be  directly  observed.  Making  use  of  observa¬ 
tions  extending  over  many  years  or  decades,  the 
relative  orbit  of  the  secondary  can  be  determined. 
The  first  binary  orbit  to  be  determined  was  that  of 
§  UMa  in  1830  (Fig.  10.2). 

The  observations  of  visual  binaries  only  give 
the  projection  of  the  relative  orbital  ellipse  on  the 
plane  of  the  sky.  The  shape  and  position  of  the 
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Fig.  10.1  When  a  visual 
binary  is  followed  for 
a  long  time,  the 
components  can  be  seen  to 
move  with  respect  to  each 
other.  Picture  of  Kruger  60. 
(Yerkes  Observatory) 
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Fig.  10.2  In  1830  the  orbit  of  f  Ursae  Majoris  was  the 
first  binary  orbit  determined  observationally 


Fig.  10.3  The  components  of  a  binary  system  move 
around  their  common  centre  of  mass.  A\,  A2  denote  the 
positions  of  the  stars  at  a  given  time  A,  and  similarly  for 
B  and  C 


true  orbit  are  not  known.  However,  they  can  be 
calculated  if  one  makes  use  of  the  fact  that  the 
primary  should  be  located  at  a  focal  point  of  the 
relative  orbit.  The  deviation  of  the  projected  po¬ 
sition  of  the  primary  from  the  focus  of  the  pro¬ 
jected  relative  orbit  allows  one  to  determine  the 
orientation  of  the  true  orbit. 

The  absolute  size  of  the  orbit  can  only  be 
found  if  the  distance  of  the  binary  is  known. 
Knowing  this,  the  total  mass  of  the  system  can 
be  calculated  from  Kepler’s  third  law. 

The  masses  of  the  individual  components 
can  be  determined  by  observing  the  motions  of 
both  components  relative  to  the  centre  of  mass 
(Fig.  10.3).  Let  the  semimajor  axes  of  the  or¬ 
bital  ellipses  of  the  primary  and  the  secondary 
be  a\  and  <22-  Then,  according  to  the  definition  of 
the  centre  of  mass, 


<22  m  1 


where  mi  and  m2  are  the  component  masses.  The 
semimajor  axis  of  the  relative  orbit  is 

a  =<2i+<22.  (10.2) 

For  example,  the  masses  of  the  components  of 
§  UMa  have  been  found  to  be  1.3  and  1.0  solar 
masses. 


1 0.2  Astrometric  Binary  Stars 

In  astrometric  binaries,  only  the  orbit  of  the 
brighter  component  about  the  centre  of  mass  can 
be  observed.  If  the  mass  of  the  visible  component 
is  estimated,  e.g.  from  its  luminosity,  the  mass  of 
the  invisible  companion  can  also  be  estimated. 

The  first  astrometric  binary  was  Sirius,  which 
in  the  1830’s  was  observed  to  have  an  undulat¬ 
ing  proper  motion.  It  was  concluded  that  it  had 
a  small  companion,  which  was  visually  discov¬ 
ered  a  few  decades  later  (Figs.  10.4  and  15.1). 
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The  companion,  Sirius  B,  was  a  completely  new 
type  of  object,  a  white  dwarf  (Sect.  15.1). 

The  proper  motions  of  nearby  stars  have  been 
carefully  studied  in  the  search  for  planetary  sys¬ 
tems.  Although  e.g.  Barnard’s  star  may  have  un¬ 
seen  companions,  the  existence  of  planetary  sys¬ 
tems  around  other  stars  was  not  established  by 
proper  motion  studies  but  with  spectroscopic  ob¬ 
servations  (Sect.  21.8). 


1 0.3  Spectroscopic  Binaries 

The  spectroscopic  binaries  (Fig.  10.5)  appear  as 
single  stars  in  even  the  most  powerful  telescopes, 
but  their  spectra  show  a  regular  variation.  The 
first  spectroscopic  binary  was  discovered  in  the 
1880’s,  when  it  was  found  that  the  spectral  lines 
of  f  UMa  or  Mizar  split  into  two  at  regular  inter¬ 
vals. 

The  Doppler  shift  of  a  spectral  line  is  directly 
proportional  to  the  radial  velocity.  Thus  the  sep¬ 


aration  of  the  spectral  lines  is  largest  when  one 
component  is  directly  approaching  and  the  other 
is  receding  from  the  observer.  The  period  of  the 
variation  is  the  orbital  period  of  the  stars.  Unfor¬ 
tunately,  there  is  no  general  way  of  determining 
the  position  of  the  orbit  in  space.  The  observed 
velocity  v  is  related  to  the  true  velocity  vo  ac¬ 
cording  to 


v  =  VQsmi ,  (10.3) 

where  the  inclination  i  is  the  angle  between  the 
line  of  sight  and  the  normal  of  the  orbital  plane. 

Consider  a  binary  where  the  components 
move  in  circular  orbits  about  the  centre  of  mass. 
Let  the  radii  of  the  orbits  be  a\  and  <22.  From  the 
definition  of  the  centre  of  mass  m\a\  =  m2«2 , 
and  writing  a  =  a\  +  <22,  one  obtains 


a  1 


am  2 

m\  +  m2 


(10.4) 


Fig.  1 0.4  The  apparent 
paths  of  Sirius  and  its 
companion  in  the  sky 


Fig.  10.5  Spectrum  of  the  spectroscopic  binary  k  Arietis.  In  the  upper  spectrum  the  spectral  lines  are  single,  in  the 
lower  one  doubled.  (Lick  Observatory) 
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The  true  orbital  velocity  is 

2na\ 

uo,i  =  -^-> 

where  P  is  the  orbital  period.  The  observed  or¬ 
bital  velocity  according  to  (10.3)  is  thus 


tude  of  the  periastron  can  be  determined.  Know¬ 
ing  these,  the  mass  function  or  the  individual 
masses  can  again  be  determined  to  within  a  fac¬ 
tor  sin3  i . 
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VI 


271  a  i  sin  i 
P 


(10.5) 


Substituting  (10.4),  one  obtains 

lira  m2  sin/ 

v\  = - . 

Pm  i  +  m  2 

Solving  for  a  and  substituting  it  in  Kepler’s  third 
law,  one  obtains  the  mass  function  equation : 


mi  sin3  /  vi  P 

- - - T  =  .  (10.6) 

im\  +  m2)2  2  tzG 

If  one  component  in  a  spectroscopic  binary  is 
so  faint  that  its  spectral  lines  cannot  be  observed, 
only  P  and  v\  are  observed.  Equation  (10.6)  then 
gives  the  value  of  the  mass  function,  which  is 
the  expression  on  the  left-hand  side.  Neither  the 
masses  of  the  components  nor  the  total  mass  can 
be  determined.  If  the  spectral  lines  of  both  com¬ 
ponents  can  be  observed,  V2  is  also  known.  Then 
(10.5)  gives 

v\  a\ 

V2  CL2 

and  furthermore  the  definition  of  the  centre  of 
mass  gives 

m2V2 
m  1  = - . 

v\ 

When  this  is  substituted  in  (10.6),  the  value  of 
m2  sin3  /,  and  correspondingly,  m\  sin3  /,  can  be 
determined.  However,  the  actual  masses  cannot 
be  found  without  knowing  the  inclination. 

The  size  of  the  binary  orbit  (the  semimajor 
axis  a)  is  obtained  from  (10.5)  apart  from  a  fac¬ 
tor  sin  i .  In  general  the  orbits  of  binary  stars  are 
not  circular  and  the  preceding  expressions  cannot 
be  applied  as  they  stand.  For  an  eccentric  orbit, 
the  shape  of  the  velocity  variation  departs  more 
and  more  from  a  simple  sine  curve  as  the  ec¬ 
centricity  increases.  From  the  shape  of  the  veloc¬ 
ity  variation,  both  the  eccentricity  and  the  longi¬ 


In  the  photometric  binaries,  a  periodic  variation 
in  the  total  brightness  is  caused  by  the  motions 
of  the  components  in  a  double  system.  Usu¬ 
ally  the  photometric  binaries  are  eclipsing  vari¬ 
ables ,  where  the  brightness  variations  are  due  to 
the  components  passing  in  front  of  each  other. 
A  class  of  photometric  binaries  where  there  are 
no  actual  eclipses  are  the  ellipsoidal  variables. 
In  these  systems,  at  least  one  of  the  components 
has  been  distorted  into  an  ellipsoidal  shape  by  the 
tidal  pull  of  the  other  one.  At  different  phases  of 
the  orbit,  the  projected  surface  area  of  the  dis¬ 
torted  component  varies.  The  surface  temperature 
will  also  be  lower  at  the  ends  of  the  tidal  bulges. 
Together  these  factors  cause  a  small  variation  in 
brightness. 

The  inclination  of  the  orbit  of  an  eclipsing  bi¬ 
nary  must  be  very  close  to  90° .  These  are  the  only 
spectroscopic  binaries  for  which  the  inclination  is 
known  and  thus  the  masses  can  be  uniquely  deter¬ 
mined. 

The  variation  of  the  magnitude  of  eclips¬ 
ing  variables  as  a  function  of  time  is  called 
the  lightcurve.  According  to  the  shape  of  the 
lightcurve,  they  are  grouped  into  three  main 
types:  Algol ,  /3  Lyrae  and  W  Ursae  Majoris  type 
(Fig.  10.6). 

Algol  Stars  The  Algol-type  eclipsing  variables 
have  been  named  after  /3  Persei  or  Algol.  During 
most  of  the  period,  the  lightcurve  is  fairly  con¬ 
stant.  This  corresponds  to  phases  during  which 
the  stars  are  seen  separate  from  each  other  and 
the  total  magnitude  remains  constant.  There  are 
two  different  minima  in  the  lightcurve,  one  of 
which,  the  primary  minimum,  is  usually  much 
deeper  than  the  other  one.  This  is  due  to  the 
brightness  difference  of  the  stars.  When  the  larger 
star,  which  is  usually  a  cool  giant,  eclipses  the 
smaller  and  hotter  component,  there  is  a  deep 
minimum  in  the  lightcurve.  When  the  small, 
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Fig.  1 0.6  Typical  lightcurves  and  schematic  views  of  Al¬ 
gol,  /3  Lyrae  and  W  Ursae  Majoris  type  binary  systems. 
The  size  of  the  Sun  is  shown  for  comparison 

bright  star  passes  across  the  disk  of  the  giant,  the 
total  magnitude  of  the  system  does  not  change 
by  much. 

The  shape  of  the  minima  depends  on  whether 
the  eclipses  are  partial  or  total.  In  a  partial  eclipse 
the  lightcurve  is  smooth,  since  the  brightness 
changes  smoothly  as  the  depth  of  the  eclipse 
varies.  In  a  total  eclipse  there  is  an  interval  dur¬ 
ing  which  one  component  is  completely  invis¬ 
ible.  The  total  brightness  is  then  constant  and 
the  lightcurve  has  a  flat  bottomed  minimum.  The 


shape  of  the  minima  in  Algol  variables  thus  gives 
information  on  the  inclination  of  the  orbit. 

The  duration  of  the  minima  depends  on  the  ra¬ 
tio  of  the  stellar  radii  to  the  size  of  the  orbit.  If  the 
star  is  also  a  spectroscopic  binary,  the  true  dimen¬ 
sions  of  the  orbit  can  be  obtained.  In  that  case  the 
masses  and  the  size  of  the  orbit,  and  thus  also  the 
radii  can  be  determined  without  having  to  know 
the  distance  of  the  system. 

p  Lyrae  Stars  In  the  ft  Lyrae-type  binaries,  the 
total  magnitude  varies  continuously.  The  stars  are 
so  close  to  each  other  that  one  of  them  has  been 
pulled  into  ellipsoidal  shape.  Thus  the  brightness 
varies  also  outside  the  eclipses.  The  /3  Lyrae  vari¬ 
ables  can  be  described  as  eclipsing  ellipsoidal 
variables.  In  the  ft  Lyrae  system  itself,  one  star 
has  overfilled  its  Roche  lobe  (see  Sect.  12.6)  and 
is  steadily  losing  mass  to  its  companion.  The 
mass  transfer  causes  additional  features  in  the 
lightcurve. 

W  UMa  Stars  In  W  UMa  stars,  the  lightcurve 
minima  are  almost  identical,  very  round  and 
broad.  These  are  close  binary  systems  where  both 
components  overfill  their  Roche  lobes,  forming 
a  contact  binary  system. 

The  observed  lightcurves  of  photometric  bi¬ 
naries  may  contain  many  additional  features  that 
confuse  the  preceding  classification. 

-  The  shape  of  the  star  may  be  distorted  by  the 
tidal  force  of  the  companion.  The  star  may  be 
ellipsoidal  or  fill  its  Roche  surface,  in  which 
case  it  becomes  drop-like  in  shape. 

-  The  limb  darkening  (Sects.  9.6  and  13.2)  of  the 
star  may  be  considerable.  If  the  radiation  from 
the  edges  of  the  stellar  disk  is  fainter  than  that 
from  the  centre,  it  will  tend  to  round  off  the 
lightcurve. 

-  In  elongated  stars  there  is  gravity  darkening: 
the  parts  most  distant  from  the  centre  are  cooler 
and  radiate  less  energy. 

-  There  are  also  reflection  phenomena  in  stars.  If 
the  stars  are  close  together,  they  will  heat  the 
sides  facing  each  other.  The  heated  part  of  the 
surface  will  then  be  brighter. 

-  In  systems  with  mass  transfer,  the  material 
falling  onto  one  of  the  components  will  change 
the  surface  temperature. 
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All  these  additional  effects  cause  difficulties 
in  interpreting  the  lightcurve.  Usually  one  com¬ 
putes  a  theoretical  model  and  the  corresponding 
lightcurve,  which  is  then  compared  with  the  ob¬ 
servations.  The  model  is  varied  until  a  satisfac¬ 
tory  fit  is  obtained. 

So  far  we  have  been  concerned  solely  with  the 
properties  of  binary  systems  in  the  optical  do¬ 
main.  Recently  many  double  systems  that  radiate 
strongly  at  other  wavelengths  have  been  discov¬ 
ered.  Particularly  interesting  are  the  binary  pul¬ 
sars,  where  the  velocity  variation  can  be  deter¬ 
mined  from  radio  observations.  Many  different 
types  of  binaries  have  also  been  discovered  at 
X-ray  wavelengths.  These  systems  will  be  dis¬ 
cussed  in  Chap.  15. 

The  binary  stars  are  the  only  stars  with  ac¬ 
curately  known  masses.  The  masses  for  other 
stars  are  estimated  from  the  mass-luminosity  re¬ 
lation  (Sect.  9.7),  but  that  is  valid  only  for  main- 
sequence  stars  and  has  to  be  calibrated  by  means 
of  binary  observations. 


10.5  Examples 


Example  10.2  (The  Lightcurve  of  a  Binary)  Let 
us  suppose  that  the  line  of  sight  lies  in  the  orbital 
plane  of  an  Algol  type  binary,  where  both  com¬ 
ponents  have  the  same  radius.  The  lightcurve  is 
essentially  as  shown  in  the  figure.  The  primary 
minimum  occurs  when  the  brighter  component  is 
eclipsed.  The  depth  of  the  minima  will  be  calcu¬ 
lated. 


Time 


If  the  effective  temperatures  of  the  stars  are  TA 
and  Tb  and  their  radius  is  R ,  their  luminosities 
are  given  by 

La=4ttR2oT 'X,  Lb  =  4nR2oT%. 

The  flat  part  of  the  curve  corresponds  to  the  total 
luminosity 


Example  10.1  (The  Mass  of  a  Binary  Star)  The 
distance  of  a  binary  star  is  10  pc  and  the  largest 
angular  separation  of  the  components  is  1"  and 
the  smallest  is  l” .  The  orbital  period  is  100  years. 
The  mass  of  the  binary  is  to  be  determined,  as¬ 
suming  that  the  orbital  plane  is  normal  to  the  line 
of  sight. 

From  the  angular  separation  and  the  distance, 
the  semimajor  axis  is 

a  —  4"  x  10  pc  =  40  au. 


According  to  Kepler’s  third  law 


m  i  +  m  2  = 


403 

l002 


Mq  =  6.4  M, 


o 


Let  the  semimajor  axis  of  one  component  be  a\  = 
3"  and  for  the  other  a 2  =  i" .  Now  the  masses  of 
the  components  can  be  determined  separately: 


a,2  m2 

m\ci\  =  m2Ci2  =A  m  1  =  — m2  =  — , 

a\  3 

m\  +  m2  =  6.4  =>  mi  =  1.6,  m2  =4.8. 


7- tot  —  La  T  Lb* 

The  luminosities  may  be  expressed  as  absolute 
bolometric  magnitudes  by  means  of  (4.13).  Since 
the  distance  moduli  of  the  components  are  the 
same,  the  apparent  bolometric  magnitude  at  the 
primary  minimum  will  be 


mA  ~  mtot  —  Ma  —  M\ 


tot 


=  -2.51gU  =  +2.51gLtot 


L 


tot 


La 


=  2.5  lg 


=  2.5  lg(  1  + 


4nR2oTA.  +4jtR2oT 


B 


4ttR2oTa 

Tb 
Ta 


Similarly  the  depth  of  the  secondary  minimum  is 


mB  ~  hi  tot  =  2.5  lg  (  1  + 


Ta 

Tb 


Let  the  effective  temperatures  of  the  stars  be 
Ta  =  5000  K  and  TB  =  12,000  K.  The  depth  of 
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the  primary  minimum  is  then 


m A  -mt ot  =  2.51g(  1  + 


12,000 

5000 


3.8  mag. 


The  secondary  minimum  is 


,  ,  5000  Y 

mg  —  mt  ot  =  2.5  lg  1  +  - 

i  12,000/ 


0.03  mag. 


maximum  separation  of  the  components  of  the 
H y  line. 

Exercise  10.2  A  planet  (mass  m)  is  orbiting 
a  star  (mass  M)  at  a  distance  a.  The  distance  of 
the  star  from  the  centre  of  gravity  of  the  system 
is  a'.  Show  that 

M  P2  =  a2  {a  —  a), 

where  P  is  period  in  years,  distances  are  in  au’s 
and  masses  in  solar  masses. 


1 0.6  Exercises 

Exercise  10.1  The  components  of  a  binary 
move  along  circular  orbits.  The  mutual  distance 
is  1  au,  and  the  mass  of  each  component  is  1  M©. 
An  observer  in  the  plane  of  the  orbit  will  see  pe¬ 
riodic  splitting  of  the  spectral  lines.  What  is  the 


Exercise  10.3  The  distance  of  Barnard’s  star  is 
1.83  pc  and  mass  0.135  M©.  It  has  been  sug¬ 
gested  that  it  oscillates  with  an  amplitude  of 
0.026"  in  25  year  periods.  Assuming  this  oscil¬ 
lation  is  caused  by  a  planet,  find  the  mass  and  ra¬ 
dius  of  the  orbit  of  this  planet. 
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The  stars  are  huge  gas  spheres,  hundreds  of  thou¬ 
sands  or  millions  of  times  more  massive  than  the 
Earth.  A  star  such  as  the  Sun  can  go  on  shin¬ 
ing  steadily  for  thousands  of  millions  of  years. 
This  is  shown  by  studies  of  the  prehistory  of 
the  Earth,  which  indicate  that  the  energy  radi¬ 
ated  by  the  Sun  has  not  changed  by  much  during 
the  last  four  thousand  million  years.  The  equilib¬ 
rium  of  a  star  must  remain  stable  for  such  peri¬ 
ods. 


11.1  Internal  Equilibrium  Conditions 

Mathematically  the  conditions  for  the  internal 
equilibrium  of  a  star  can  be  expressed  as  four 
differential  equations  governing  the  distribution 
of  mass,  gas  pressure  and  energy  production  and 
transport  in  the  star.  These  equations  will  now  be 
derived. 

Hydrostatic  Equilibrium  The  force  of  gravity 
pulls  the  stellar  material  towards  the  centre.  It  is 
resisted  by  the  pressure  force  due  to  the  thermal 
motions  of  the  gas  molecules.  The  first  equilib¬ 
rium  condition  is  that  these  forces  be  in  equilib¬ 
rium. 

Consider  a  cylindrical  volume  element  at  the 
distance  r  from  the  centre  of  the  star  (Fig.  11.1). 
Its  volume  is  dV  =  dAdr,  where  dA  is  its  base 
area  and  dr  its  height;  its  mass  is  dm  =  p  d A  dr, 
where  p  =  p(r)  is  the  gas  density  at  the  radius  r. 
If  the  mass  inside  radius  r  is  Mr ,  the  gravitational 


Fig.  11.1  In  hydrostatic  equilibrium  the  sum  of  the  grav¬ 
itational  and  pressure  force  acting  on  a  volume  element  is 
zero 

force  on  the  volume  element  will  be 

G  Mr  dm  GMrp 
dFS  = - —  = - —dAdr, 

where  G  is  the  gravitational  constant.  The  minus 
sign  in  this  expression  means  that  the  force  is  di¬ 
rected  towards  the  centre  of  the  star.  If  the  pres¬ 
sure  at  the  lower  surface  of  the  volume  element 
is  P  and  at  its  upper  surface  P  +  dP,  the  net  force 
of  pressure  acting  on  the  element  is 

dEp  =  P  dA  —  (P  +  dP)  dA 
=  —dP  dA. 

Since  the  pressure  decreases  outwards,  d P  will 
be  negative  and  the  force  dEp  positive.  The  equi- 
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librium  condition  is  that  the  total  force  acting  on 
the  volume  element  should  be  zero,  i.e. 

0  =  dFg  +  dfp 
GMr  p 

= - -= —  d  A  dr  —  d  P  d  A 

y  ^ 


and  mass  d Mr  at  the  radius  r  (Fig.  11.3).  Let  Lr 
be  the  energy  flux,  i.e.  the  amount  of  energy  pass¬ 
ing  through  the  surface  r  per  unit  time.  If  s  is  the 
energy  production  coefficient,  i.e.  the  amount  of 
energy  released  in  the  star  per  unit  time  and  mass, 
then 


or 


d  P  GMrp 

dr  r2 


(H.l) 


This  is  the  equation  of  hydrostatic  equilibrium. 


Mass  Distribution  The  second  equation  gives 
the  mass  contained  within  a  given  radius.  Con¬ 
sider  a  spherical  shell  of  thickness  dr  at  the  dis¬ 
tance  r  from  the  centre  (Fig.  11.2).  Its  mass  is 

d Mr  =  4nr2p  dr, 


dLr  =  Lr+ dr  —  Lr  =  s  d Mr  =  4jtr  ps  dr. 
Thus  the  energy  conservation  equation  is 

dL  y  O 

- =4Ttr2p£.  (11.3) 

dr 

The  rate  at  which  energy  is  produced  depends  on 
the  distance  to  the  centre.  Essentially  all  of  the 
energy  radiated  by  the  star  is  produced  in  the  hot 
and  dense  core.  In  the  outer  layers  the  energy  pro¬ 
duction  is  negligible  and  Lr  is  almost  constant. 


giving  the  mass  continuity  equation 

d  Mr  9 

- =4jtr2p.  (11.2) 

dr 

Energy  Production  The  third  equilibrium  con¬ 
dition  expresses  the  conservation  of  energy,  re¬ 
quiring  that  any  energy  produced  in  the  star  has 
to  be  carried  to  the  surface  and  radiated  away.  We 
again  consider  a  spherical  shell  of  thickness  dr 


Fig.  11.2  The  mass  of  a  thin  spherical  shell  is  the  product 
of  its  volume  and  its  density 


The  Temperature  Gradient  The  fourth  equi¬ 
librium  equation  gives  the  temperature  change  as 
a  function  of  the  radius,  i.e.  the  temperature  gra¬ 
dient  dL /dr.  The  form  of  the  equation  depends 
on  how  the  energy  is  transported:  by  conduction , 
convection  or  radiation. 

In  the  interiors  of  normal  stars  conduction  is 
very  inefficient,  since  the  electrons  carrying  the 
energy  can  only  travel  a  short  distance  with¬ 
out  colliding  with  other  particles.  Conduction 


Fig.  11.3  The  energy  flowing  out  of  a  spherical  shell  is 
the  sum  of  the  energy  flowing  into  it  and  the  energy  gen¬ 
erated  within  the  shell 
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only  becomes  important  in  compact  stars,  white 
dwarfs  and  neutron  stars,  where  the  mean  free 
path  of  photons  is  extremely  short,  but  that  of 
some  electrons  can  be  relatively  large.  In  nor¬ 
mal  stars  conductive  energy  transport  can  be  ne¬ 
glected. 

In  radiative  energy  transport,  photons  emitted 
in  hotter  parts  of  the  star  are  absorbed  in  cooler 
regions,  which  they  heat.  The  star  is  said  to  be  in 
radiative  equilibrium,  when  the  energy  released 
in  the  stellar  interior  is  carried  outwards  entirely 
by  radiation. 

The  radiative  temperature  gradient  is  related  to 
the  energy  flux  Lr  according  to 


where  a  =  4cr/c  =  7.564  x  10~16  Jm-3  K~4  is 
the  radiation  constant,  c  the  speed  of  light,  and 
p  the  density.  The  mass  absorption  coefficient  k 
gives  the  amount  of  absorption  per  unit  mass.  Its 
value  depends  on  the  temperature,  density  and 
chemical  composition. 

In  order  to  derive  (11 .4),  we  consider  the  equa¬ 
tion  of  radiative  transfer  (5.44).  In  terms  of  the 
variables  used  in  the  present  chapter,  it  may  be 
written 


between  the  flux  density  Fr  and  the  energy 
flux  Lr,  one  obtains  (11.4). 

The  derivative  d T /dr  is  negative,  since  the 
temperature  increases  inwards.  Clearly  there  has 
to  be  a  temperature  gradient,  if  energy  is  to  be 
transported  by  radiation:  otherwise  the  radiation 
field  would  be  the  same  in  all  directions  and  the 
net  flux  Fr  would  vanish. 

If  the  radiative  transfer  of  energy  becomes  in¬ 
efficient,  the  absolute  value  of  the  radiative  tem¬ 
perature  gradient  becomes  very  large.  In  that  case 
motions  are  set  up  in  the  gas,  which  carry  the 
energy  outwards  more  efficiently  than  the  radi¬ 
ation.  In  these  convective  motions,  hot  gas  rises 
upwards  into  cooler  layers,  where  it  loses  its  en¬ 
ergy  and  sinks  again.  The  rising  and  sinking  gas 
elements  also  mix  the  stellar  material,  and  the 
composition  of  the  convective  parts  of  a  star  be¬ 
comes  homogeneous.  Radiation  and  conduction, 
on  the  other  hand,  do  not  mix  the  material,  since 
they  move  only  energy,  not  gas. 

In  order  to  derive  the  temperature  gradient  for 
the  convective  case,  consider  a  rising  bubble.  As¬ 
sume  that  the  gas  moving  with  the  bubble  obeys 
the  adiabatic  equation  of  state 

TocP^y,  (11.5) 


COS0 


dly 

dr 


KvPly  T  jv  • 


where  P  is  the  pressure  of  the  gas  and  y,  the  adi¬ 
abatic  exponent 


In  this  equation  kv  is  replaced  with  a  suitable 
mean  value  k.  The  equation  is  then  multiplied 
with  cos  0  and  integrated  over  all  directions  and 
frequencies.  On  the  left  hand  side,  Iv  may  be  ap¬ 
proximated  with  the  Planck  function  Bv .  The  fre¬ 
quency  integral  may  then  be  evaluated  by  means 
of  (5.16).  On  the  right-hand  side,  the  first  term 
can  be  expressed  in  terms  of  the  flux  density  ac¬ 
cording  to  (4.2)  and  the  integral  over  directions  of 
the  second  gives  zero,  since  jv  does  not  depend 
on  6 .  One  thus  obtains 


471  d 
IT  dr 


— k  pFr . 


Finally,  using  the  relation 

F  y 

P  — _ _ 

r  4nr2  ’ 


y  =  CP/Cv,  (11.6) 

is  the  ratio  of  the  specific  heats  in  constant  pres¬ 
sure  and  constant  volume.  This  ratio  of  the  spe¬ 
cific  heats  depends  on  the  ionisation  of  the  gas, 
and  can  be  computed  when  the  temperature,  den¬ 
sity  and  chemical  composition  are  known. 

Taking  the  derivative  of  (11.5)  we  get  the  ex¬ 
pression  for  the  convective  temperature  gradient 

dr  \  y 

In  the  practical  computation  of  stellar  struc¬ 
ture,  one  uses  either  (11.4)  or  (11.7),  depending 
on  which  equation  gives  a  less  steep  temperature 
gradient.  In  the  outermost  layers  of  stars  heat  ex¬ 
change  with  the  surroundings  must  be  taken  into 


T  d  P 


P  dr 


(11.7) 
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account,  and  (11.7)  is  no  longer  a  good  approxi¬ 
mation.  An  often  used  method  for  calculating  the 
convective  temperature  gradient  in  that  case  is  the 
mixing-length  theory.  The  theory  of  convection  is 
a  difficult  and  still  imperfectly  understood  prob¬ 
lem,  which  is  beyond  the  scope  of  this  presenta¬ 
tion. 

The  convective  motions  set  in  when  the  radia¬ 
tive  temperature  gradient  becomes  larger  in  ab¬ 
solute  value  than  the  adiabatic  gradient,  i.e.  if  ei¬ 
ther  the  radiative  gradient  becomes  steep  or  if  the 
convective  gradient  becomes  small.  From  (11.4) 
it  can  be  seen  that  a  steep  radiative  gradient  is 
expected,  if  either  the  energy  flux  density  or  the 
mass  absorption  coefficient  becomes  large.  The 
convective  gradient  may  become  small,  if  the  adi¬ 
abatic  exponent  approaches  1 . 

Boundary  Conditions  In  order  to  obtain  a  well- 
posed  problem,  some  boundary  conditions  have 
to  be  prescribed  for  the  preceding  differential 
equations: 

-  There  are  no  sources  of  energy  or  mass  at  the 
centre  inside  the  radius  r  —  0;  thus  Mq  =  0  and 
L0  =  0. 

-  The  total  mass  within  the  radius  R  of  the  star 
is  fixed,  Mr  =  M. 

-  The  temperature  and  pressure  at  the  stellar  sur¬ 
face  have  some  determinate  values,  Tr  and  Pr. 
These  will  be  very  small  compared  to  those  in 
the  interior,  and  thus  it  is  usually  sufficient  to 
take  Tr=0  and  Pr  =  0. 


11.2  Physical  State  of  the  Gas 

Due  to  the  high  temperature  the  gas  in  the  stars  is 
almost  completely  ionised.  The  interactions  be¬ 
tween  individual  particles  are  small,  so  that,  to 
a  good  approximation,  the  gas  obeys  the  perfect 
gas  equation  of  state, 

k 

P  = - pT,  (11.8) 

/imn 

where  k  is  Boltzmann’s  constant,  /x  the  mean 
molecular  weight  in  units  of  mn,  and  mu  the 
mass  of  the  hydrogen  atom. 

The  mean  molecular  weight  can  be  approxi¬ 
mately  calculated  assuming  complete  ionisation. 
An  atom  with  nuclear  charge  Z  then  produces 
Z  +  1  free  particles  (the  nucleus  and  Z  electrons). 
Hydrogen  gives  rise  to  two  particles  per  atomic 
mass  unit;  helium  gives  rise  to  three  particles  per 
four  atomic  mass  units.  For  all  elements  heavier 
than  hydrogen  and  helium  it  is  usually  sufficient 
to  take  Z  +  1  to  be  half  the  atomic  weight.  (Exact 
values  could  easily  be  calculated,  but  the  abun¬ 
dance  of  heavy  elements  is  so  small  that  this  is 
usually  not  necessary.)  In  astrophysics  the  rela¬ 
tive  mass  fraction  of  hydrogen  is  conventionally 
denoted  by  X,  that  of  helium  by  Y  and  that  of  all 
heavier  elements  by  Z,  so  that 

x  +  y  +  z  =  i.  (ii.9) 

(The  Z  occurring  in  this  equation  should  not  be 
confused  with  the  nuclear  charge,  which  is  un¬ 
fortunately  denoted  by  the  same  letter.)  Thus  the 
mean  molecular  weight  will  be 


In  addition  to  these  boundary  conditions  one 
needs  an  expression  for  the  pressure,  which  is 
given  by  the  equation  of  state  as  well  as  expres¬ 
sions  for  the  mass  absorption  coefficient  and  the 
energy  generation  rate,  which  will  be  considered 
later.  The  solution  of  the  basic  differential  equa¬ 
tions  give  the  mass,  temperature,  density  and  en¬ 
ergy  flux  as  functions  of  the  radius.  The  stellar 
radius  and  luminosity  can  then  be  calculated  and 
compared  with  the  observations. 

The  properties  of  a  stellar  equilibrium  model 
are  essentially  determined  once  the  mass  and  the 
chemical  composition  have  been  given.  This  re¬ 
sult  is  known  as  the  Vogt-Russell  theorem. 


/x  = 


1 

2X  +  \  Y  +  i  Z ' 


(11.10) 


At  high  temperatures  the  radiation  pressure 
has  to  be  added  to  the  gas  pressure  described  by 
the  perfect  gas  equation.  The  pressure  exerted  by 
radiation  is  (see  Box  11.2) 


nad  =  \aT^  (ii.ii) 

where  a  is  the  radiation  constant.  Thus  the  total 
pressure  is 


k 

liniu 


1 

pT  H — aT 
H  3 
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(11.12) 
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The  perfect  gas  law  does  not  apply  at  very  high 
densities. 

The  Pauli  exclusion  principle  states  that  an 
atom  with  several  electrons  cannot  have  more 
than  one  electron  with  all  four  quantum  numbers 
equal.  This  can  also  be  generalised  to  a  gas  con¬ 
sisting  of  electrons  (or  other  fermions).  A  phase 
space  can  be  used  to  describe  the  electrons.  The 
phase  space  is  a  6-dimensional  space,  three  coor¬ 
dinates  of  which  give  the  position  of  the  particle 
and  the  other  three  coordinates  the  momenta  in 
v,  y  and  z  directions.  A  volume  element  of  the 
phase  space  is 


AV  =  AxAyAzApxApyApz.  (11.13) 


From  the  uncertainty  principle  it  follows  that  the 
smallest  meaningful  volume  element  is  of  the  or¬ 
der  ofh3.  According  to  the  exclusion  principle 
there  can  be  only  two  electrons  with  opposite 
spins  in  such  a  volume  element.  When  density 
becomes  high  enough,  all  volume  elements  of  the 
phase  space  will  be  filled  up  to  a  certain  limiting 
momentum.  Such  matter  is  called  degenerate. 

Electron  gas  begins  to  degenerate  when  the 
density  is  of  the  order  107  kg/m3.  In  ordinary 
stars  the  gas  is  usually  nondegenerate,  but  in 
white  dwarfs  and  in  neutron  stars,  degeneracy  is 
of  central  importance.  The  pressure  of  a  degener¬ 
ate  electron  gas  is  (see  Box  11.2) 


P  % 


(11.14) 


where  me  is  the  electron  mass  and  N/V  the  num¬ 
ber  of  electrons  per  unit  volume.  This  equation 
may  be  written  in  terms  of  the  density 


p  =  A/xemH/F, 


where  /re  is  the  mean  molecular  weight  per  free 
electron  in  units  of  mu.  An  expression  for  /xe  may 
be  derived  in  analogy  with  (11.10): 


Me  = 


1 

x+f r+±z 


2 

x  +  r 


(11.15) 


For  the  solar  hydrogen  abundance  this  yields 


/xe  =  2/(0. 71  +  1)  =  1.17. 


The  final  expression  for  the  pressure  is 


P  % 


P 


5/3 


Me^H 


(11.16) 


This  is  the  equation  of  state  of  a  degenerate  elec¬ 
tron  gas.  In  contrast  to  the  perfect  gas  law  the 
pressure  no  longer  depends  on  the  temperature, 
only  on  the  density  and  on  the  particle  masses. 

In  normal  stars  the  degenerate  gas  pressure  is 
negligible,  but  in  the  central  parts  of  giant  stars 
and  in  white  dwarfs,  where  the  density  is  of  the 
order  of  108  kg/m3,  the  degenerate  gas  pressure 
is  dominant,  in  spite  of  the  high  temperature. 

At  even  higher  densities  the  electron  momenta 
become  so  large  that  their  velocities  approach  the 
speed  of  light.  In  this  case  the  formulas  of  the 
special  theory  of  relativity  have  to  be  used.  The 
pressure  of  a  relativistic  degenerate  gas  is 

/ N\4/ 3  /  p  \4/3 

P^hc  —  =hc  — —  .  (11.17) 


In  the  relativistic  case  the  pressure  is  proportional 
to  the  density  to  the  power  4/3,  rather  than  5/3 
as  for  the  nonrelativistic  case.  The  transition  to 
the  relativistic  situation  takes  place  roughly  at  the 
density  109  kg/m3. 

In  general  the  pressure  inside  a  star  depends 
on  the  temperature  (except  for  a  completely  de¬ 
generate  gas),  density  and  chemical  composition. 
In  actual  stars  the  gas  will  never  be  totally  ionised 
or  completely  degenerate.  The  pressure  will  then 
be  given  by  more  complicated  expressions.  Still 
it  can  be  calculated  for  each  case  of  interest.  One 
may  then  write 


P  =  P(T,  p,  X,  Y,  Z),  (11.18) 


giving  the  pressure  as  a  known  function  of  the 
temperature,  density  and  chemical  composition. 

The  opacity  of  the  gas  describes  how  difficult 
it  is  for  radiation  to  propagate  through  it.  The 
change  d I  of  the  intensity  in  a  distance  dr  can 
be  expressed  as 


d I  =  —la  dr, 

where  a  is  the  opacity  (Sect.  4.5).  The  opacity 
depends  on  the  chemical  composition,  tempera- 
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ture  and  density  of  the  gas.  It  is  usually  written  as 
a  =  Kp,  where  p  is  the  density  of  the  gas  and  k 
the  mass  absorption  coefficient  ([/c]  =  m2/kg). 

The  inverse  of  the  opacity  represents  the  mean 
free  path  of  radiation  in  the  medium,  i.e.  the  dis¬ 
tance  it  can  propagate  without  being  scattered 
or  absorbed.  The  different  types  of  absorption 
processes  (bound-bound,  bound-free,  free-free) 
have  been  described  in  Sect.  5.1.  The  opacity  of 
the  stellar  material  due  to  each  process  can  be 
calculated  for  relevant  values  of  temperature  and 
density. 


1 1 .3  Stellar  Energy  Sources 

When  the  equations  of  stellar  structure  were  de¬ 
rived,  the  character  of  the  source  of  stellar  energy 
was  left  unspecified.  Knowing  a  typical  stellar  lu¬ 
minosity,  one  can  calculate  how  long  different 
energy  sources  would  last.  For  instance,  normal 
chemical  burning  could  produce  energy  for  only 
a  few  thousand  years.  The  energy  released  by  the 
contraction  of  a  star  would  last  slightly  longer, 
but  after  a  few  million  years  this  energy  source 
would  also  run  out. 

Terrestrial  biological  and  geological  evidence 
shows  that  the  solar  luminosity  has  remained 
fairly  constant  for  at  least  a  few  thousand  mil¬ 
lion  years.  Since  the  age  of  the  Earth  is  about 
5000  million  years,  the  Sun  has  presumably  ex¬ 
isted  at  least  for  that  time.  Since  the  solar  lu¬ 
minosity  is  4  x  1026  W,  it  has  radiated  about 
6  x  1043  J  in  5  x  109  years.  The  Sun’s  mass  is 
2  x  103°  kg;  thus  it  must  be  able  to  produce  at 
least  3  x  1013  J/kg. 

The  general  conditions  in  the  solar  interior 
are  known,  regardless  of  the  exact  energy  source. 
Thus,  in  Example  11.5,  it  will  be  estimated  that 
the  temperature  at  half  the  radius  is  about  5  mil¬ 
lion  degrees.  The  central  temperature  must  be 
about  ten  million  kelvins,  which  is  high  enough 
for  thermonuclear  fusion  reactions  to  take  place. 

In  fusion  reactions  light  elements  are  trans¬ 
formed  into  heavier  ones.  The  final  reaction  prod¬ 
ucts  have  a  smaller  total  mass  than  the  initial  nu¬ 
clei.  This  mass  difference  is  released  as  energy 
according  to  Einstein’s  relation  E  =  me2.  Ther¬ 


monuclear  reactions  are  commonly  referred  to  as 
burning,  although  they  have  no  relation  to  the 
chemical  burning  of  ordinary  fuels. 

The  atomic  nucleus  consists  of  protons  and 
neutrons,  together  referred  to  as  nucleons.  We  de¬ 
fine 

mp  =  proton  mass, 
mn  =  neutron  mass, 

Z  =  nuclear  charge  =  atomic  number, 

N  =  neutron  number, 

A  =  Z  +  N  =  atomic  weight, 
m(Z,  N)  =  mass  of  the  nucleus. 

The  mass  of  the  nucleus  is  smaller  than  the 
sum  of  the  masses  of  all  its  nucleons.  The  dif¬ 
ference  is  called  the  binding  energy.  The  binding 
energy  per  nucleon  is 

Q  =  -(Zmp  +  Nmn-m(Z,N))c2.  (11.19) 

It  turns  out  that  Q  increases  towards  heavier  ele¬ 
ments  up  to  iron  (Z  =  26).  Beyond  iron  the  bind¬ 
ing  energy  again  begins  to  decrease  (Fig.  11.4). 

It  is  known  that  the  stars  consist  mostly  of  hy¬ 
drogen.  Let  us  consider  how  much  energy  would 
be  released  by  the  fusion  of  four  hydrogen  nu¬ 
clei  into  a  helium  nucleus.  The  mass  of  a  pro¬ 
ton  is  1.672  x  1(T27  kg  and  that  of  a  helium  nu¬ 
cleus  is  6.644  x  10-27  kg.  The  mass  difference, 
4.6  x  10~29  kg,  corresponds  to  an  energy  differ¬ 
ence  E  —  4.1  x  10-12  J.  Thus  0.7  %  of  the  mass 
is  turned  into  energy  in  the  reaction,  correspond¬ 
ing  to  an  energy  release  of  6.4  x  10 14  J  per  one 
kilogram  of  hydrogen.  This  should  be  compared 
with  our  previous  estimate  that  3  x  10 13  J/kg  is 
needed. 

Already  in  the  1930’s  it  was  generally  ac¬ 
cepted  that  stellar  energy  had  to  be  produced  by 
nuclear  fusion.  In  1938  Hans  Be  the  and  inde¬ 
pendently  Carl  Friedrich  von  Weizsacker  put  for¬ 
ward  the  first  detailed  mechanism  for  energy  pro¬ 
duction  in  the  stars,  the  carbon-nitrogen-oxygen 
( CNO )  cycle.  The  other  important  energy  gener- 
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Fig.  11.4  The  nuclear 
binding  energy  per  nucleon 
as  a  function  of  the  atomic 
weight.  Among  isotopes 
with  the  same  atomic 
weight  the  one  with  the 
largest  binding  energy  is 
shown.  The  points 
correspond  to  nuclei  with 
even  proton  and  neutron 
numbers,  the  crosses  to 
nuclei  with  odd  mass 
numbers.  Preston,  M.A. 
(1962):  Physics  of  the 
Nucleus  (Addison-Wesley 
Publishing  Company,  Inc., 
Reading,  Mass.) 


ation  processes  (the  proton-proton  chain  and  the 
triple-alpha  reaction)  were  not  proposed  until  the 
1950’s. 

The  Proton-Proton  Chain  (Fig.  1 1 .5).  In  stars 
with  masses  of  about  that  of  the  Sun  or  smaller, 
the  energy  is  produced  by  the  proton-proton  (pp) 
chain.  It  consists  of  the  following  steps: 


ppl:  (1) 

1H+1H 

-*  2H  +  e+  +  v, 

*H+  1H  +  e-  - 

-*2H  +  Ve, 

(2) 

2H+1H 

-*  3He  +  y. 

(3) 

3  He  +  3  He 

-*  4He  +  2JH. 

For  each  reaction  (3)  the  reactions  (1)  and  (2) 
have  to  take  place  twice.  The  first  reaction  step 
has  a  very  small  probability,  which  has  not  been 
measured  in  the  laboratory.  At  the  central  den¬ 
sity  and  temperature  of  the  Sun,  the  expected 
time  for  a  proton  to  collide  with  another  one  to 
form  a  deuteron  is  10 10  years  on  the  average.  It  is 
only  thanks  to  the  slowness  of  this  reaction  that 
the  Sun  is  still  shining.  If  it  were  faster,  the  Sun 
would  have  burnt  out  long  ago.  The  neutrino  pro¬ 
duced  in  the  reaction  (1)  can  escape  freely  from 
the  star  and  carries  away  some  of  the  energy  re¬ 
leased.  The  positron  e+  is  immediately  annihi¬ 
lated  together  with  an  electron,  giving  rise  to  two 
gamma  quanta. 


The  second  reaction,  where  a  deuteron  and 
a  proton  unite  to  form  the  helium  isotope  3  He,  is 
very  fast  compared  to  the  preceding  one.  Thus  the 
abundance  of  deuterons  inside  stars  is  very  small. 

The  last  step  in  the  pp  chain  can  take  three  dif¬ 
ferent  forms.  The  ppl  chain  shown  above  is  the 
most  probable  one.  In  the  Sun  91  %  of  the  energy 
is  produced  by  the  ppl  chain.  It  is  also  possible 
for  3  He  nuclei  to  unite  into  4  He  nuclei  in  two  ad¬ 
ditional  branches  of  the  pp  chain. 


PpH:  (3) 

(4) 

(5) 

ppm:  (3) 

(4) 

(5) 

(6) 


3He  +  4He 
7Be  +  e- 
7Li  +  1H 


3He  +  4He 
7Be  +  lH 
8B 
8Be 


7Be  +  y, 

7  Li  +  ve, 

4  He  +  4He, 

7Be  +  y, 

8B  +  y, 

8Be  +  e+  +  ve, 
4  He  +  4He. 


The  Carbon  Cycle  (See  Fig.  11.6.)  At  tem¬ 
peratures  below  20  million  degrees  the  pp  chain 
is  the  main  energy  production  mechanism.  At 
higher  temperatures  corresponding  to  stars  with 
masses  above  1.5  M©,  the  carbon  (CNO)  cycle 
becomes  dominant,  because  its  reaction  rate  in¬ 
creases  more  rapidly  with  temperature.  In  the 
CNO  cycle  carbon,  oxygen  and  nitrogen  act  as 
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Fig.  11.5  The 

proton-proton  chain.  In  the 
ppl  branch,  four  protons 
are  transformed  into  one 
helium  nucleus,  two 
positrons,  two  neutrinos 
and  radiation.  The  relative 
weights  of  the  reactions  are 
given  for  conditions  in  the 
Sun.  The  pp  chain  is  the 
most  important  energy 
source  in  stars  with  mass 
below  1.5  Mq 


0.1  %  He 


very  temperature  sensitive 

I 


catalysts.  The  reaction  cycle  is  the  following: 

(1)  12C  +  1H^  13N  +  k, 

(2)  13N  —*■  13C  +  e+  +  ve, 

(3)  13C+  lR  14N  +  x, 

(4)  14N  +  l  H  — >  130  +  y, 

(5)  lsO  — »  15N  +  y  +  ve, 

(6)  15N  +  1H^  12C+4He. 

Reaction  (4)  is  the  slowest,  and  thus  deter¬ 
mines  the  rate  of  the  CNO  cycle.  At  a  tempera¬ 
ture  of  20  million  degrees  the  reaction  time  for 
the  reaction  (4)  is  a  million  years. 

The  fraction  of  energy  released  as  radiation  in 
the  CNO  cycle  is  slightly  smaller  than  in  the  pp 
chain,  because  more  energy  is  carried  away  by 
neutrinos. 


The  Triple  Alpha  Reaction  As  a  result  of  the 
preceding  reactions,  the  abundance  of  helium  in 
the  stellar  interior  increases.  At  a  temperature 
above  108  degrees  the  helium  can  be  transformed 
into  carbon  in  the  triple  alpha  reaction: 

(1)  4He  +  4He  **  8Be, 

(2)  8Be  +  4He  -*  12C  +  y. 

Here  8  Be  is  unstable  and  decays  into  two  he¬ 
lium  nuclei  or  alpha  particles  in  2.6  x  10-16  sec- 
onds.  The  production  of  carbon  thus  requires  the 
almost  simultaneous  collision  of  three  particles. 
The  reaction  is  often  written 

34He  — »■  12C  +  y. 

Once  helium  burning  has  been  completed,  at 
higher  temperatures  other  reactions  become  pos- 
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Proton  in 


Proton  in 


Fig.  11.6  The  CNO  cycle  is  catalysed  by  12C.  It  trans¬ 
forms  four  protons  into  a  helium  nucleus,  two  positrons, 
two  neutrinos  and  radiation.  It  is  the  dominant  energy 
source  for  stars  more  massive  than  1.5  M© 


sible,  in  which  heavier  elements  up  to  iron  and 
nickel  are  built  up.  Examples  of  such  reactions 
are  various  alpha  reactions  and  oxygen,  carbon 
and  silicon  burning. 


Alpha  Reactions  During  helium  burning  some 
of  the  carbon  nuclei  produced  react  with  helium 
nuclei  to  form  oxygen,  which  in  turn  reacts  to 
form  neon,  etc.  These  reactions  are  fairly  rare  and 
thus  are  not  important  as  stellar  energy  sources. 
Examples  are 


C+4He  - 

>  160  +  y. 

0  +  4He  - 

*  20Ne  +  y, 

Ne  +  4He  - 

*  24Mg  +  y. 

Carbon  Burning  After  the  helium  is  exhausted, 
carbon  burning  sets  in  at  the  temperature  (5-8)  x 
1010  K: 

12C  +  12C  24Mg  +  y 

23Na+'H 
20Ne  +  4He 
— 23Mg  +  n 
16Q  +  24He. 


Oxygen  Burning  Oxygen  is  consumed  at 
slightly  higher  temperatures  in  the  reactions 

160  +  160  -*  32S  +  Y 
-*  31P+  rH 
-*  28Si  +  4He 
-*  31S  +  n 
->  24Mg  +  24He. 

Silicon  Burning  After  several  intermediate 
steps  the  burning  of  silicon  produces  nickel  and 
iron.  The  total  process  may  be  expressed  as 

28Si  +  28Si  — >  56Ni  +  y, 

56Ni  — >  56Fe  +  2e+  +  2  ve. 

When  the  temperature  becomes  higher  than 
about  109  K,  the  energy  of  the  photons  becomes 
large  enough  to  destroy  certain  nuclei.  Such  re¬ 
actions  are  called  photonuclear  reactions  or  pho¬ 
todissociations. 

The  production  of  elements  heavier  than  iron 
requires  an  input  of  energy,  and  therefore  such 
elements  cannot  be  produced  by  thermonuclear 
reactions.  Elements  heavier  than  iron  are  almost 
exclusively  produced  by  neutron  capture  dur¬ 
ing  the  final  violent  stages  of  stellar  evolution 
(Sect.  11.5). 

The  rates  of  the  reactions  presented  above  can 
be  determined  by  laboratory  experiments  or  by 
theoretical  calculations.  Knowing  them,  one  can 
calculate  the  rate  at  which  energy  is  released  per 
unit  mass  and  time  as  a  function  of  the  density, 
temperature  and  chemical  composition: 

e  =  e(T,p,X,Y,Z).  (11.20) 

In  reality  the  relative  abundance  of  each  of  the 
heavier  nuclei  needs  to  be  known,  not  just  their 
total  abundance  Z. 


11.4  Stellar  Models 

A  theoretical  stellar  model  is  obtained  if  one 
solves  the  differential  equations  for  stellar  struc¬ 
ture.  As  we  have  already  noted,  the  model  is 
uniquely  defined  once  the  chemical  composition 
and  the  mass  of  the  star  have  been  given. 
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Table  1 1 .1  Properties  of  zero  age  main  sequence  stars.  {Tc  =  central  temperature;  pc  =  central  density;  Mc{  =  relative 
mass  of  convective  interior;  Mce  =  relative  mass  of  convective  envelope) 


M  [Mq] 

R[Rq] 

L[Lq] 

Te  [K] 

Tc  [106  K] 

Pc  [kg/m3] 

Mci  [M] 

Mce  [M] 

30 

6.6 

140,000 

44,000 

36 

3000 

0.60 

0 

15 

4.7 

21,000 

32,000 

34 

6200 

0.39 

0 

9 

3.5 

4500 

26,000 

31 

7900 

0.26 

0 

5 

2.2 

630 

20,000 

27 

26,000 

0.22 

0 

3 

1.7 

93 

14,000 

24 

42,000 

0.18 

0 

1.5 

1.2 

5.4 

8100 

19 

95,000 

0.06 

0 

1.0 

0.87 

0.74 

5800 

14 

89,000 

0 

0.01 

0.5 

0.44 

0.038 

3900 

9.1 

78,000 

0 

0.41 

Stars  just  formed  out  of  the  interstellar  me¬ 
dium  are  chemically  homogeneous.  When  stel¬ 
lar  models  for  homogeneous  stars  are  plotted  in 
the  HR  diagram,  they  fall  along  the  lower  edge  of 
the  main  sequence.  The  theoretical  sequence  ob¬ 
tained  in  this  way  is  called  the  zero  age  main  se¬ 
quence ,  ZAMS.  The  exact  position  of  the  ZAMS 
depends  on  the  initial  chemical  composition.  For 
stars  with  an  initial  abundance  of  heavy  elements 
like  that  in  the  Sun,  the  computed  ZAMS  is  in 
good  agreement  with  the  observations.  If  the  ini¬ 
tial  value  of  Z  is  smaller,  the  theoretical  ZAMS 
falls  below  the  main  sequence  in  the  subdwarf 
region  of  the  HR  diagram.  This  is  related  to  the 
classification  of  stars  into  populations  I  and  II, 
which  is  discussed  in  Sect.  17.2. 

The  theoretical  models  also  provide  an  ex¬ 
planation  for  the  mass-luminosity  relation.  The 
computed  properties  of  zero  age  main  sequence 
stars  of  different  masses  are  given  in  Table  11.1. 
The  chemical  composition  assumed  is  X  =  0.71 
(hydrogen  mass  fraction),  Y  =  0.27  (helium) 
and  Z  =  0.02  (heavier  elements),  except  for  the 
30  Mq  star,  which  has  X  =  0.70  and  Y  =  0.28. 
The  luminosity  of  a  one  solar  mass  star  is  0.74  L© 
and  the  radius  0.87  R©.  Thus  the  Sun  has  bright¬ 
ened  and  expanded  to  some  extent  during  its  evo¬ 
lution.  However,  these  changes  are  small  and  do 
not  conflict  with  the  evidence  for  a  steady  solar 
energy  output.  In  addition  the  biological  evidence 
only  goes  back  about  3000  million  years. 

The  model  calculations  show  that  the  central 
temperature  in  the  smallest  stars  (M  ~  0.08  M©) 
is  about  4  x  106  K,  which  is  the  minimum  tem¬ 
perature  required  for  the  onset  of  thermonuclear 


reactions.  In  the  biggest  stars  (M  &  50  M©),  the 
central  temperature  reaches  4  x  107  K. 

The  changes  in  chemical  composition  caused 
by  the  nuclear  reactions  can  be  computed,  since 
the  rates  of  the  various  reactions  at  different 
depths  in  the  star  are  known.  For  example,  the 
change  AX  of  the  hydrogen  abundance  in  the 
time  interval  At  is  proportional  to  the  rate  of  en¬ 
ergy  generation  s  and  to  At : 

AX  a  —s At.  (11.21) 

The  constant  of  proportionality  is  clearly  the 
amount  of  hydrogen  consumed  per  unit  energy 
[kg/J].  The  value  of  this  constant  of  proportion¬ 
ality  is  different  for  the  pp  chain  and  the  CNO 
cycle.  Therefore  the  contribution  from  each  re¬ 
action  chain  must  be  calculated  separately  in 
(11.21).  For  elements  that  are  produced  by  the 
nuclear  reactions,  the  right-hand  side  contribution 
in  (11.21)  is  positive.  If  the  star  is  convective,  the 
change  in  composition  is  obtained  by  taking  the 
average  of  (11.21)  over  the  convection  zone. 


Box  11.1  (Gas  Pressure  and  Radiation  Pres¬ 
sure)  Let  us  consider  noninteracting  particles 
in  a  rectangular  box.  The  particles  may  also  be 
photons.  Let  the  sides  of  the  box  be  Ax,  Ay 
and  Az,  and  the  number  of  particles,  N.  The 
pressure  is  caused  by  the  collisions  of  the  par¬ 
ticles  with  the  sides  of  the  box.  When  a  par¬ 
ticle  hits  a  wall  perpendicular  to  the  v  axis, 
its  momentum  in  the  v  direction,  px ,  changes 
by  Ap  =  2px.  The  particle  will  return  to  the 
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same  wall  after  the  time  At  =  2Ax/vx.  Thus 
the  pressure  exerted  by  the  particles  on  the  wall 
(surface  area  A  =  AyAz )  is 

p  _  F_  _  J2AP/At  _  HPxVx 
A  A  AxAyAz 

_  N(pxvx) 

~  V  ’ 

where  V  =  AxAyAz  is  the  volume  of  the  box 
and  the  angular  brackets  represent  the  aver¬ 
age  value.  The  momentum  is  px  =mvx  (where 
m  =  hv/c 2  for  photons),  and  hence 

p  =  Nmjvft 
V 

Suppose  the  velocities  of  the  particles  are 
isotropically  distributed.  Then  (v2)  =  (v2)  = 

(v2),  and  thus 

(^2>  =  K2)+(^)+k2)  =  3K2) 


and 

Nm(v2) 

~  3V 

If  the  particles  are  gas  molecules,  the  energy 

1  o 

of  a  molecule  is  s  =  ^mv  .  The  total  energy 
of  the  gas  is  E  =  N (s)  =  ^Nm(v  ),  and  the 
pressure  may  be  written 


P 


2  E 

3  V 


(gas). 


If  the  particles  are  photons,  they  move  with  the 
speed  of  light  and  their  energy  is  e  =  me2.  The 
total  energy  of  a  photon  gas  is  thus  E  =  N  (s)  = 
N me2  and  the  pressure  is 


P 


1  E 
3  V 


(radiation). 


According  to  (4.7),  (4.4)  and  (5.16)  the  en¬ 
ergy  density  of  blackbody  radiation  is 


47T 

—  I  = 

c 


-  F  =  -aT4  =  aT4, 
c  c 


where  a  =  Aa/c  is  the  radiation  constant.  Thus 
the  radiation  pressure  is 


Prad=a  T4/ 3. 


Box  11.2  (The  Pressure  of  a  Degenerate  Gas) 
A  gas  where  all  available  energy  levels  up  to 
a  limiting  momentum  po,  known  as  the  Fermi 
momentum,  are  filled  is  called  degenerate.  We 
shall  determine  the  pressure  of  a  completely 
degenerate  electron  gas. 

Let  the  volume  of  the  gas  be  V.  We  con¬ 
sider  electrons  with  momenta  in  the  range 
[p,  p  +  dp].  Their  available  phase  space  vol¬ 
ume  is  4jtp2dpV.  According  to  the  Heisen¬ 
berg  uncertainty  relation  the  elementary  vol¬ 
ume  in  phase  space  is  h3  and,  according  to  the 
Pauli  exclusion  principle,  this  volume  can  con¬ 
tain  two  electrons  with  opposite  spins.  Thus  the 
number  of  electrons  in  the  momentum  interval 
[p,p  +  dp]  is 


dN  =  2 


4i rp2  dp  V 
h 3 


The  total  number  of  electrons  with  momenta 
smaller  than  po  is 


871  y  fp°  2  871  y  o 

—  )„  ”  dp  =  lFT',«- 


Hence  the  Fermi  momentum  po  is 


P  0  = 


1/3  A  /  JV  \  1/3 

2\v) 


Nonrelativistic  Gas  The  kinetic  energy  of  an 
electron  is  s  =  p2/2me.  The  total  energy  of  the 
gas  is 


f  4ttV  fP°  4 

E=  s  dN  = - ^  /  p4  dp 

J  me/F  Jo 

47T  y  5 

5  meh2 

Introducing  the  expression  for  the  Fermi  mo¬ 
mentum  po ,  one  obtains 

E_n  /  3  \5/3  h2  V(N\5/3 
40  \7T/  \  V/ 

The  pressure  of  the  gas  was  derived  in 
Box  11.1: 

2  E 
P  =  -  — 

3  y 
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1 

20 


2/3/^ 

mQ 


5/3 


(nonrelativistic). 


Here  N /  V  is  the  number  density  of  electrons. 


Relativistic  Gas  If  the  density  becomes  so 
large  that  the  electron  kinetic  energy  e  corre¬ 
sponding  to  the  Fermi  momentum  exceeds  the 
rest  energy  mec2,  the  relativistic  expression  for 
the  electron  energy  has  to  be  used.  In  the  ex¬ 
treme  relativistic  case  s  =  cp  and  the  total  en¬ 
ergy 


f  ZncV  fP0  o 

E=isAN=—l pAp 

2ttcV  4 
=  ~h^p°' 

The  expression  for  the  Fermi  momentum  re¬ 
mains  unchanged,  and  hence 


E 


n  /  3 

¥  \7T 


4/3 

hcV 


n  y 4/3 

v) 


The  pressure  of  the  relativistic  electron  gas  is 
obtained  from  the  formula  derived  for  a  photon 
gas  in  Box  11.1: 


P 


1  E 
3  V 

8  \7T 


N  V 4/3 

v) 


(relativistic). 


We  have  obtained  the  nonrelativistic  and  ex¬ 
treme  relativistic  approximations  to  the  pres¬ 
sure.  In  intermediate  cases  the  exact  expression 
for  the  electron  energy, 

8  =  (mec  +  p  c  ) 


has  to  be  used. 

The  preceding  derivations  are  rigorously 
valid  only  at  zero  temperature.  However,  the 
densities  in  compact  stars  are  so  high  that  the 
effects  of  a  nonzero  temperature  are  negligible, 
and  the  gas  may  be  considered  completely  de¬ 
generate. 


11.5  Examples 

Example  11.1  (The  Gravitational  Acceleration  at 
the  Solar  Surface)  The  expression  for  the  gravi¬ 
tational  acceleration  is 

GMq 

8~~rT- 

Using  the  solar  mass  M  =  1.989  x  1030  kg  and 
radius  R  =  6.96  x  108  m,  one  obtains 

g  ~  274  m s-2  ^2^  go, 

where  go  =  9.81  ms-2  is  the  gravitational  accel¬ 
eration  at  the  surface  of  the  Earth. 


Example  11.2  (The  Average  Density  of  the  Sun) 
The  volume  of  a  sphere  with  radius  R  is 


4 

V  =  -txR 
3 


3. 


thus  the  average  density  of  the  Sun  is 


3  M 
4  7t  R3 


~  1410  kgm  3. 


Example  11.3  (Pressure  at  Half  the  Solar  Radius) 
The  pressure  can  be  estimated  from  the  condition 
for  the  hydrostatic  equilibrium  (11.1).  Suppose 
the  density  is  constant  and  equal  to  the  average 
density  p.  Then  the  mass  within  the  radius  r  is 


and  the  hydrostatic  equation  can  be  written 

d  P  GMr~p  4nG~p2r 

dr  r 2  3 

This  can  be  integrated  from  half  the  solar  radius, 
r  =  Rq /2,  to  the  surface,  where  the  pressure  van¬ 
ishes: 


4  _9 

d  P  =  --TtGp 


r  dr, 
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which  gives 


the  radiation  pressure  given  by  (1 1 . 1 1)  is 


P  =  X-nG-p2Rl 


1 


o 


-it  6.67  x  10-11  x  14102 
2 

x  (6.96  x  108)2  N/m2 
1014  Pa. 


This  estimate  is  extremely  rough,  since  the  den¬ 
sity  increases  strongly  inwards. 

Example  11.4  (The  Mean  Molecular  Weight  of 
the  Sun)  In  the  outer  layers  of  the  Sun  the  ini¬ 
tial  chemical  composition  has  not  been  changed 
by  nuclear  reactions.  In  this  case  one  can  use  the 
values  X  =  0.71,  Y  =  0.27  and  Z  =  0.02.  The 
mean  molecular  weight  (1 1.10)  is  then 


1 


fi  = 


2  x  0.71  +  0.75  x  0.27  +  0.5  x  0.02 


0.61. 


When  the  hydrogen  is  completely  exhausted, 

X  =  0  and  Y  =  0.98,  and  hence 


1 


0.75  x  0.98  +  0.5  x  0.02 


1.34. 


Example  11.5  (The  Temperature  of  the  Sun 
at  r  =  Rq/2)  Using  the  density  from  Exam¬ 
ple  11.2  and  the  pressure  from  Example  11.3, 
the  temperature  can  be  estimated  from  the  per¬ 
fect  gas  law  (11.8).  Assuming  the  surface  value 
for  the  mean  molecular  weight  (Example  11.4), 
one  obtains  the  temperature 


T  = 


limiiP 

kp 

0.61  x  1.67  x  10-27  x  1.0  x  10 
1.38  x  10-23  x  1410 

5  x  106  K. 


14 


Example  11.6  (The  Radiation  Pressure  in  the 
Sun  at  r  =  Rq/2)  In  the  previous  example  we 
found  that  the  temperature  is  T  ~  5  x  106  K.  Thus 


^rad  —  ^  aT 

1 


=  -  X  7.564  X  10“16  X  (5  X  10°) 
3 


6^ 


2  x  1011  Pa. 


This  is  about  a  thousand  times  smaller  than  the 
gas  pressure  estimated  in  Example  11.3.  Thus  it 
confirms  that  the  use  of  the  perfect  gas  law  in  Ex¬ 
ample  11.5  was  correct. 

Example  11.7  (The  Path  of  a  Photon  from  the 
Centre  of  a  Star  to  Its  Surface)  Radiative  en¬ 
ergy  transport  can  be  described  as  a  random  walk, 
where  a  photon  is  repeatedly  absorbed  and  re¬ 
emitted  in  a  random  direction.  Let  the  step  length 
of  the  walk  (the  mean  free  path)  be  d.  Consider, 
for  simplicity,  the  random  walk  in  a  plane.  After 
one  step  the  photon  is  absorbed  at 

x\=  d  cos  0\ ,  yi=d  sin  0\ , 

where  0\  is  an  angle  giving  the  direction  of  the 
step.  After  N  steps  the  coordinates  are 


N 


N 


X 


=  d  cos  6i,  y  =  ^2  d  sin  Of , 


i  =  1 


i  =  1 


and  the  distance  from  the  starting  point  is 


2  2,2 

r  =x  +  y 


=  d‘ 


'  N  \2  /  N  \ 

y^cos 6i  I  +  I  J 


2-i 


The  first  term  in  square  brackets  can  be  written 


N 


yj  cos  0i 


/ 

=  (cos$i  +  cos  62  H - 1-  cos  6n)‘ 


N 


=  ^2  cos2  Qi  +  COS  0i  COS  0j  . 

1  i^j 
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Since  the  directions  ft  are  randomly  distributed 
and  independent, 

T:  cos  ft  cos  6 j  =0. 
i 

The  same  result  applies  for  the  second  term  in 
square  brackets.  Thus 

N 

r 2  =  d2  J](cos2  ft  +  sin2  ft)  =  Aft2, 
l 


After  A  steps  the  photon  is  at  the  distance  r  = 
d\f~N  from  the  starting  point.  Similarly,  a  drunk¬ 
ard  taking  a  hundred  one-metre  steps  at  random 
will  have  wandered  ten  metres  from  his/her  start¬ 
ing  point.  The  same  result  applies  in  three  dimen¬ 
sions. 

The  time  taken  by  a  photon  to  reach  the  sur¬ 
face  from  the  centre  depends  on  the  mean  free 
path  d  =  1/a  =  l/icp.  The  value  of  k  at  half 
the  solar  radius  can  be  estimated  from  the  val¬ 
ues  of  density  and  temperature  obtained  in  Ex¬ 
amples  11.2  and  11.5.  The  mass  absorption  co¬ 
efficient  in  these  conditions  is  found  to  be  k  = 
10  m2/kg.  (We  shall  not  enter  on  how  it  is  calcu¬ 
lated.)  The  photon  mean  free  path  is  then 

1  -4 

d  =  —  ~  10  4  m. 


This  should  be  a  reasonable  estimate  in  most 
of  the  solar  interior.  Since  the  solar  radius  r  = 
109  m,  the  number  of  steps  needed  to  reach  the 
surface  will  be  A  =  ( r/d )2  =  1026.  The  total  path 
travelled  by  the  photon  is  s  =  Nd  =  1022  m,  and 
the  time  taken  is  t  =  s/c  =  106  years;  a  more 
careful  calculation  gives  t  =  107  years.  Thus  it 
takes  10  million  years  for  the  energy  generated 
at  the  centre  to  radiate  into  space.  Of  course  the 
radiation  that  leaves  the  surface  does  not  consist 
of  the  same  gamma  photons  that  were  produced 
near  the  centre.  The  intervening  scattering,  emis¬ 
sion  and  absorption  processes  have  transformed 
the  radiation  into  visible  light  (as  can  easily  be 
seen). 


1 1 .6  Exercises 

Exercise  11.1  How  many  hydrogen  atoms  are 
there  in  the  Sun  per  each  helium  atom? 

Exercise  11.2  (a)  How  many  pp  reactions  take 
place  in  the  Sun  every  second?  The  luminosity 
of  the  Sun  is  3.9  x  1026  W,  the  mass  of  a  pro¬ 
ton  is  1.00728  amu,  and  that  of  the  a  particle 
4.001514  amu  (1  amu  is  1.6604  x  10-27  kg). 

(b)  How  many  neutrinos  produced  in  these  pp 
reactions  will  hit  the  Earth  in  one  second? 

Exercise  11.3  The  mass  absorption  coefficient 
of  a  neutrino  is  k  =  10-21  m2kg_1.  Find  the 
mean  free  path  at  the  centre  of  the  Sun. 
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In  the  preceding  chapter  we  have  seen  how  one 
can  compute  the  evolution  of  a  star  by  starting 
from  a  homogeneous  model  representing  a  newly 
formed  system.  When  the  chemical  composition 
of  the  star  changes  with  time,  a  new  model  is 
computed  each  time.  In  this  chapter  we  shall  con¬ 
sider  the  theoretical  evolutionary  paths  of  sys¬ 
tems  with  various  masses  and  see  how  the  com¬ 
puted  evolution  explains  the  observational  data. 
The  following  discussion  is  rather  qualitative, 
since  the  details  of  the  theoretical  calculations 
are  too  involved  for  the  present  book.  Also,  the 
different  evolutionary  tracks  would  become  pro¬ 
hibitively  complicated  if  more  than  the  very  basic 
stellar  properties  were  included. 


1 2.1  Evolutionary  Time  Scales 


be  consumed  before  other,  more  rapid  evolution¬ 
ary  mechanisms  set  in.  Since  0.7  %  of  the  rest 
mass  is  turned  into  energy  in  hydrogen  burning, 
the  nuclear  time  scale  will  be 


0.007  x  0.1  Me2 

Z 


(12.1) 


For  the  Sun  one  obtains  the  nuclear  time  scale 
1010  years,  and  thus 


t0  «  M^M0  X  1010  a.  (12.2) 
L/Lq 

This  gives  the  nuclear  time  scale  as  a  function 
of  the  mass  M  and  luminosity  L  of  a  given  star. 
For  example,  if  the  mass  is  30  M©,  one  obtains  tn 
about  2  million  years.  The  reason  for  the  shorter 
time  scale  is  that  the  stellar  luminosity  strongly 
increases  for  higher  masses  (Table  12.1). 


Changes  in  a  star  may  take  place  on  quite  differ¬ 
ent  time  scales  at  different  evolutionary  phases. 
There  are  three  important  basic  time  scales:  the 
nuclear  time  scale  tn ,  the  thermal  time  scale  tt  and 
the  dynamical  or  freefall  time  scale  t&. 

The  Nuclear  Time  Scale  The  time  in  which 
a  star  radiates  away  all  the  energy  that  can  be 
released  by  nuclear  reactions  is  called  the  nu¬ 
clear  time  scale.  An  estimate  of  this  time  can  be 
obtained  if  one  calculates  the  time  in  which  all 
available  hydrogen  is  turned  into  helium.  On  the 
basis  of  theoretical  considerations  and  evolution¬ 
ary  computations  it  is  known  that  only  just  over 
10  %  of  the  total  mass  of  hydrogen  in  the  star  can 


The  Thermal  Time  Scale  The  time  in  which 
a  star  would  radiate  away  all  its  thermal  energy 
if  the  nuclear  energy  production  were  suddenly 
turned  off  is  called  the  thermal  time  scale.  This  is 
also  the  time  it  takes  for  radiation  from  the  centre 
to  reach  the  surface.  According  to  the  virial  theo¬ 
rem  (6.51)  the  kinetic  energy  of  the  thermal  mo¬ 
tion  of  the  gas  particles  equals  half  of  the  poten¬ 
tial  energy.  Thus  he  thermal  time  scale  is  roughly 
may  be  estimated  as 


0.5  GM2/R 


( M/Mp )2 
( R/Rq)(L/Lq ) 


x  2  x  107  a, 


(12.3) 
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Table  12.1  Stellar  lifetimes  (unit  106  years) 


Mass  [Mq] 

Spectral  type  on  the 
main  sequence 

Contraction  to  main 
sequence 

Main  sequence 

Main  sequence 
to  red  giant 

Red  giant 

30 

05 

0.02 

4.9 

0.55 

0.3 

15 

B0 

0.06 

10 

1.7 

2 

9 

B2 

0.2 

22 

0.2 

5 

5 

B5 

0.6 

68 

2 

20 

3 

A0 

3 

240 

9 

80 

1.5 

F2 

20 

2000 

280 

1.0 

G2 

50 

10,000 

680 

0.5 

M0 

200 

30,000 

0.1 

M7 

500 

107 

where  G  is  the  constant  of  gravity  and  R  the  stel¬ 
lar  radius.  For  the  Sun  the  thermal  time  scale  is 
about  20  million  years  or  1/500  of  the  nuclear 
time  scale. 


The  Dynamical  Time  Scale  The  third  and 
shortest  time  scale  is  the  time  it  would  take  a  star 
to  collapse  if  the  pressure  supporting  it  against 
gravity  were  suddenly  removed.  It  can  be  esti¬ 
mated  from  the  time  it  would  take  for  a  particle 
to  fall  freely  from  the  stellar  surface  to  the  cen¬ 
tre.  This  is  half  of  the  period  given  by  Kepler’s 
third  law,  where  the  semimajor  axis  of  the  orbit 
corresponds  to  half  the  stellar  radius  R : 


2tt  j(R/ 2)3  _  /  R 3 
Ty  GM  y  GM 


(12.4) 


The  dynamical  time  scale  of  the  Sun  is  about  half 
an  hour. 

The  ordering  of  the  time  scales  is  normally 
like  that  in  the  Sun,  i.e.  t&  h  tn. 


1 2.2  The  Contraction  of  Stars 

Towards  the  Main  Sequence 

The  formation  and  subsequent  gravitational  col¬ 
lapse  of  condensations  in  the  interstellar  medium 
will  be  considered  in  a  later  chapter.  Here  we 
shall  follow  the  behaviour  of  such  a  protostar , 
when  it  is  already  in  the  process  of  contraction. 

When  a  cloud  contracts,  gravitational  potential 
energy  is  released  and  transformed  into  thermal 


energy  of  the  gas  and  into  radiation.  Initially  the 
radiation  can  propagate  freely  through  the  mate¬ 
rial,  because  the  density  is  low  and  the  opacity 
small.  Therefore  most  of  the  liberated  energy  is 
radiated  away  and  the  temperature  does  not  in¬ 
crease.  The  contraction  takes  place  on  the  dynam¬ 
ical  time  scale;  the  gas  is  falling  freely  inwards. 

The  density  and  the  pressure  increase  most 
rapidly  near  the  centre  of  the  cloud.  As  the  den¬ 
sity  increases,  so  does  the  opacity.  A  larger  frac¬ 
tion  of  the  released  energy  is  then  turned  into 
heat,  and  the  temperature  begins  to  rise.  This 
leads  to  a  further  increase  in  the  pressure  that  is 
resisting  the  free  fall.  The  contraction  of  the  cen¬ 
tral  part  of  the  cloud  slows  down.  The  outer  parts, 
however,  are  still  falling  freely. 

At  this  stage,  the  cloud  may  already  be  con¬ 
sidered  a  protostar.  It  consists  mainly  of  hydrogen 
in  molecular  form.  When  the  temperature  reaches 
1800  K,  the  hydrogen  molecules  are  dissociated 
into  atoms.  The  dissociation  consumes  energy, 
and  the  rise  in  temperature  is  slowed  down.  The 
pressure  then  also  grows  more  slowly  and  this  in 
turn  means  that  the  rate  of  contraction  increases. 
The  same  sequence  of  events  is  repeated,  first 
when  hydrogen  is  ionised  at  104  K,  and  then 
when  helium  is  ionised.  When  the  temperature 
has  reached  about  105  K,  the  gas  is  essentially 
completely  ionised. 

The  contraction  of  a  protostar  only  stops  when 
a  large  fraction  of  the  gas  is  fully  ionised  in  the 
form  of  plasma.  The  star  then  settles  into  hy¬ 
drostatic  equilibrium.  Its  further  evolution  takes 
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Fig.  12.1  The  paths  in  the 
HR  diagram  of  stars 
contracting  to  the  main 
sequence  on  the  thermal 
time  scale.  After  a  rapid 
dynamical  collapse  the 
stars  settle  on  the  Hayashi 
track  and  evolve  towards 
the  main  sequence  on  the 
thermal  time  scale. 

(Models  by  Iben,  I.  (1965): 
Astrophys.  J.  141,  993) 


place  on  the  thermal  time  scale,  i.e.  much  more 
slowly.  The  radius  of  the  protostar  has  shrunk 
from  its  original  value  of  about  100  au  to  about 
1/4  au.  It  will  usually  be  located  inside  a  larger 
gas  cloud  and  will  be  accreting  material  from  its 
surroundings.  Its  mass  therefore  grows,  and  the 
central  temperature  and  density  increase. 

The  temperature  of  a  star  that  has  just  reached 
equilibrium  is  still  low  and  its  opacity  corre¬ 
spondingly  large.  Thus  it  will  be  convective  in  its 
centre.  The  convective  energy  transfer  is  quite  ef¬ 
ficient  and  the  surface  of  the  protostar  will  there¬ 
fore  be  relatively  bright. 

We  now  describe  the  evolution  in  the  HR  di¬ 
agram.  Initially  the  protostar  will  be  faint  and 


cool,  and  it  will  reside  at  the  lower  far  right  in 
the  HR  diagram  (outside  Fig.  12.1).  During  the 
collapse  its  surface  rapidly  heats  up  and  bright¬ 
ens  and  it  moves  to  the  upper  right  of  Fig.  12.1. 
At  the  end  of  the  collapse  the  star  will  settle  at 
a  point  corresponding  to  its  mass  on  the  Hayashi 
track.  The  Hayashi  track  (Fig.  12.1)  gives  the  lo¬ 
cation  in  the  HR  diagram  of  completely  convec¬ 
tive  stars.  Stars  to  its  right  cannot  be  in  equi¬ 
librium  and  will  collapse  on  the  dynamic  time 
scale. 

The  star  will  now  evolve  almost  along  the 
Hayashi  track  on  the  thermal  time  scale.  In  the 
HR  diagram  it  moves  almost  vertically  down¬ 
wards,  its  radius  decreases  and  its  luminosity 
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Fig.  12.2  Herbig  -Haro 
object  number  555  lies  at 
the  end  of  the  “elephant’s 
trunk”  in  Pelican  Nebula  in 
Cygnus.  The  small  wings 
are  shockwaves,  which 
give  evidence  for  powerful 
outflows  from  newly 
formed  stars  embedded 
within  the  clouds.  (Photo 
University  of  Colorado, 
University  of  Hawaii  and 
N  OAO/AURA/NSF) 


drops  (Fig.  12.1).  As  the  temperature  goes  on  in¬ 
creasing  in  its  centre,  the  opacity  diminishes  and 
energy  begins  to  be  transported  by  radiation.  The 
mass  of  the  radiative  region  will  gradually  grow 
until  finally  most  of  the  star  is  radiative.  By  then 
the  central  temperature  will  have  become  so  large 
that  nuclear  reactions  begin.  Previously  all  the 
stellar  energy  had  been  released  potential  energy, 
but  now  the  nuclear  reactions  make  a  growing 
contribution  and  the  luminosity  increases.  The 
stellar  surface  temperature  will  also  increase  and 
the  star  will  move  slightly  upwards  to  the  left 
in  the  HR  diagram.  In  massive  stars,  this  turn  to 
the  left  occurs  much  earlier,  because  their  central 
temperatures  are  higher  and  the  nuclear  reactions 
are  initiated  earlier. 

For  solar  mass  stars,  the  rapid  collapse  of  the 
protostellar  cloud  only  lasts  for  a  few  hundred 
years.  The  final  stage  of  condensation  is  much 
slower,  lasting  several  tens  of  millions  of  years. 
This  length  of  time  strongly  depends  on  the  stel¬ 
lar  mass  because  of  the  luminosity  dependence  of 
the  thermal  time  scale.  A  15  M©  star  condenses  to 
the  main  sequence  in  60,000  years,  whereas  for 
a  0. 1  M©  star,  the  time  is  hundreds  of  millions  of 
years. 

Some  of  the  hydrogen  burning  reactions  start 
already  at  a  few  million  degrees.  For  example, 


lithium,  beryllium  and  boron  burn  to  helium  in 
the  ppll  and  ppIII  branches  of  the  pp  chain  long 
before  the  complete  chain  has  turned  on.  Be¬ 
cause  the  star  is  convective  and  thus  well  mixed 
during  the  early  stages,  even  its  surface  material 
will  have  been  processed  in  the  centre.  Although 
the  abundances  of  the  above-mentioned  elements 
are  small,  they  give  important  information  on  the 
central  temperature. 

The  beginning  of  the  main  sequence  phase  is 
marked  by  the  start  of  hydrogen  burning  in  the  pp 
chain  at  a  temperature  of  about  4  million  degrees. 
The  new  form  of  energy  production  completely 
supersedes  the  energy  release  due  to  contraction. 
As  the  contraction  is  halted,  the  star  makes  a  few 
oscillations  in  the  HR  diagram,  but  soon  settles  in 
an  equilibrium  and  the  long,  quiet  main  sequence 
phase  begins. 

It  is  difficult  to  observe  stars  during  contrac¬ 
tion,  because  the  new-born  stars  are  usually  hid¬ 
den  among  dense  clouds  of  dust  and  gas.  How¬ 
ever,  some  condensations  in  interstellar  clouds 
have  been  discovered  and  near  them,  very  young 
stars.  One  example  are  the  TTauri  stars.  Their 
lithium  abundance  is  relatively  high,  which  indi¬ 
cates  that  they  are  newly  formed  stars  in  which 
the  central  temperature  has  not  yet  become  large 
enough  to  destroy  lithium.  Near  the  T  Tauri  stars, 
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small,  bright,  star-like  nebulae,  Herbig-Haro  ob¬ 
jects ,  have  been  discovered.  These  are  thought  to 
be  produced  in  the  interaction  between  a  stellar 
wind  and  the  surrounding  interstellar  medium. 


1 2.3  The  Main  Sequence  Phase 

The  main  sequence  phase  is  that  evolutionary 
stage  in  which  the  energy  released  by  the  burn¬ 
ing  of  hydrogen  in  the  core  is  the  only  source 
of  stellar  energy.  During  this  stage,  the  star  is  in 
stable  equilibrium,  and  its  structure  changes  only 
because  its  chemical  composition  is  gradually  al¬ 
tered  by  the  nuclear  reactions.  Thus  the  evolu¬ 
tion  takes  place  on  a  nuclear  time  scale,  which 
means  that  the  main  sequence  phase  is  the  longest 
part  of  the  life  of  a  star.  For  example,  for  a  so¬ 
lar  mass  star,  the  main  sequence  phase  lasts  for 
about  10,000  million  years.  More  massive  stars 
evolve  more  rapidly,  because  they  radiate  much 
more  power.  Thus  the  main  sequence  phase  of 
a  15  solar  mass  star  is  only  about  10  million 
years.  On  the  other  hand,  less  massive  stars  have 
a  longer  main  sequence  lifetime:  a  0.25  M©  star 
spends  about  70,000  million  years  on  the  main 
sequence. 

Since  stars  are  most  likely  to  be  found  in  the 
stage  of  steady  hydrogen  burning,  the  main  se¬ 
quence  in  the  HR  diagram  is  richly  populated,  in 
particular  at  its  low-mass  end.  The  more  massive 
upper  main  sequence  stars  are  less  abundant  be¬ 
cause  of  their  shorter  main  sequence  lifetimes. 

If  the  mass  of  a  star  becomes  too  large,  the 
force  of  gravity  can  no  longer  resist  the  radiation 
pressure.  Stars  more  massive  than  this  upper  limit 
cannot  form,  because  they  cannot  accrete  addi¬ 
tional  mass  during  the  contraction  phase.  Theo¬ 
retical  computations  give  a  limiting  mass  of  about 
120  M©;  the  most  massive  stars  observed  are 
claimed  to  be  about  150M©.  These  values  are, 
however,  uncertain. 

There  is  also  a  lower-mass  limit  of  the  main 
sequence.  Stars  below  0.08  M©  never  become  hot 
enough  for  hydrogen  burning  to  begin.  They  can 
still  generate  some  luminosity  from  the  burning 
of  deuterium,  but  this  energy  source  is  rapidly  ex¬ 
hausted.  These  brown  dwarfs  have  surface  tem¬ 
peratures  in  the  range  of  1000-2000  K.  Hundreds 


of  brown  dwarfs  have  now  been  found  in  dedi¬ 
cated  surveys.  The  lower  limit  for  brown  dwarf 
mass  is  sometimes  taken  to  be  about  0.015  M©, 
corresponding  to  the  minimum  mass  for  deu¬ 
terium  burning. 

If  the  mass  is  even  lower  there  are  no  nu¬ 
clear  sources  of  energy.  The  smallest  protostars 
therefore  contract  to  planet-like  dwarfs.  During 
the  contraction  phase  they  radiate  because  poten¬ 
tial  energy  is  released,  but  eventually  they  be¬ 
gin  to  cool.  In  the  HR  diagram  such  stars  first 
move  almost  vertically  downwards  and  then  fur¬ 
ther  downwards  to  the  right. 

Is  there  a  difference  between  the  lowest-mass 
brown  dwarfs  and  the  most  massive  planets?  If 
brown  dwarfs  have  formed  by  gravitational  col¬ 
lapse  and  fragmentation  as  described  in  the  pre¬ 
vious  section  and  in  Sect.  15.4,  there  is  no  rea¬ 
son  not  to  count  them  as  stars,  although  they  are 
not  producing  energy  by  nuclear  reactions.  Plan¬ 
ets  in  contrast  are  thought  to  form  much  more 
slowly  by  the  clumping  of  solids  and  accretion  of 
gas  in  a  protoplanetary  disk.  The  objects  formed 
by  this  mechanism  start  out  with  a  quite  differ¬ 
ent  structure.  Whether  such  a  clear-cut  distinction 
between  the  formation  mechanisms  of  dark  stars 
and  planets  really  can  be  made  still  remains  an 
open  question. 

The  Upper  Main  Sequence  The  stars  on  the 
upper  main  sequence  are  so  massive  and  their 
central  temperature  so  high  that  the  CNO  cycle 
can  operate.  On  the  lower  main  sequence  the  en¬ 
ergy  is  produced  by  the  pp  chain.  The  pp  chain 
and  the  CNO  cycle  are  equally  efficient  at  a  tem¬ 
perature  of  18  million  degrees,  corresponding  to 
the  central  temperature  of  a  1.5  M©  star.  The 
boundary  between  the  upper  and  the  lower  main 
sequence  corresponds  roughly  to  this  mass. 

The  energy  production  in  the  CNO  cycle  is 
very  strongly  concentrated  at  the  core.  The  out¬ 
ward  energy  flux  will  then  become  very  large, 
and  can  no  longer  be  maintained  by  radiative 
transport.  Thus  the  upper  main  sequence  stars 
have  a  convective  core ,  i.e.  the  energy  is  trans¬ 
ported  by  material  motions.  These  keep  the  mate¬ 
rial  well  mixed,  and  thus  the  hydrogen  abundance 
decreases  uniformly  with  time  within  the  entire 
convective  region. 
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Fig.  12.3  Stellar 
evolutionary  paths  in  the 
HR  diagram  at  the  main 
sequence  phase  and  later. 
On  the  main  sequence, 
bounded  by  dashed  curves , 
the  evolution  is  on  the 
nuclear  time  scale.  The 
post-main  sequence 
evolution  to  the  red  giant 
phase  is  on  the  thermal 
time  scale.  The  point 
marked  He  corresponds  to 
helium  ignition  and  in 
low-mass  stars  the  helium 
flash.  The  straight  line 
shows  the  location  of  stars 
with  the  same  radius. 

(Iben,  I.  (1967):  Annual 
Rev.  Astron.  Astrophys.  5, 
571;  data  for  30  M0  from 
Stothers,  R.  (1966): 
Astrophys.  J.  143,  91) 
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Outside  the  core,  there  is  radiative  equilib¬ 
rium ,  i.e.  the  energy  is  carried  by  radiation  and 
there  are  no  nuclear  reactions.  Between  the  core 
and  the  envelope,  there  is  a  transition  region 
where  the  hydrogen  abundance  decreases  in¬ 
wards. 

The  mass  of  the  convective  core  will  gradu¬ 
ally  diminish  as  the  hydrogen  is  consumed.  In  the 
HR  diagram  the  star  will  slowly  shift  to  the  upper 
right  as  its  luminosity  grows  and  its  surface  tem¬ 
perature  decreases  (Fig.  12.2).  When  the  central 
hydrogen  supply  becomes  exhausted,  the  core  of 
the  star  will  begin  to  shrink  rapidly.  The  surface 
temperature  will  increase  and  the  star  will  quickly 
move  to  the  upper  left.  Because  of  the  contraction 


of  the  core,  the  temperature  in  the  hydrogen  shell 
just  outside  the  core  will  increase.  It  rapidly  be¬ 
comes  high  enough  for  hydrogen  burning  to  set 
in  again. 

The  Lower  Main  Sequence  On  the  lower  main 
sequence,  the  central  temperature  is  lower  than 
for  massive  stars,  and  the  energy  is  generated  by 
the  pp  chain.  Since  the  rate  of  the  pp  chain  is  not 
as  sensitive  to  temperature  as  that  of  the  CNO  cy¬ 
cle,  the  energy  production  is  spread  over  a  larger 
region  than  in  the  more  massive  stars  (Fig.  12.3). 
In  consequence,  the  core  never  becomes  convec- 
tively  unstable,  but  remains  radiative. 
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In  the  outer  layers  of  lower  main  sequence 
stars,  the  opacity  is  high  because  of  the  low  tem¬ 
perature.  Radiation  can  then  no  longer  carry  all 
the  energy,  and  convection  will  set  in.  The  struc¬ 
ture  of  lower  main  sequence  stars  is  thus  oppo¬ 
site  to  that  of  the  upper  main  sequence:  the  centre 
is  radiative  and  the  envelope  is  convective.  Since 
there  is  no  mixing  of  material  in  the  core,  the  hy¬ 
drogen  is  most  rapidly  consumed  at  the  very  cen¬ 
tre,  and  the  hydrogen  abundance  increases  out¬ 
wards. 

As  the  amount  of  hydrogen  in  the  core  de¬ 
creases,  the  star  will  slowly  move  upwards  in 
the  HR  diagram,  almost  along  the  main  sequence 
(Fig.  12.2).  It  becomes  slightly  brighter  and  hot¬ 
ter,  but  its  radius  will  not  change  by  much.  The 
evolutionary  track  of  the  star  will  then  bend  to  the 
right,  as  hydrogen  in  the  core  nears  its  end.  Even¬ 
tually  the  core  is  almost  pure  helium.  Hydrogen 
will  continue  to  burn  in  a  thick  shell  around  the 
core. 

Stars  with  masses  between  0.08  M©  and 
0.26  Mq  have  a  very  simple  evolution.  During 
their  whole  main  sequence  phase  they  are  fully 
convective,  which  means  that  their  entire  hydro¬ 
gen  content  is  available  as  fuel.  These  stars  evolve 
very  slowly  toward  the  upper  left  in  the  HR  dia¬ 
gram.  Finally,  when  all  their  hydrogen  has  burned 
to  helium,  they  contract  to  become  white  dwarfs. 


1 2.4  The  Giant  Phase 

The  main-sequence  phase  of  stellar  evolution 
ends  when  hydrogen  is  exhausted  at  the  centre. 
The  star  then  settles  in  a  state  in  which  hydrogen 
is  burning  in  a  shell  surrounding  a  helium  core. 
As  we  have  seen,  the  transition  takes  place  grad¬ 
ually  in  lower  main- sequence  stars,  giving  rise  to 
the  Subgiant  Branch  in  the  HR  diagram,  while 
the  upper  main- sequence  stars  make  a  rapid  jump 
at  this  point. 

The  mass  of  the  helium  core  is  increased  by 
the  hydrogen  burning  in  the  shell.  This  leads  to 
the  expansion  of  the  envelope  of  the  star,  which 
moves  almost  horizontally  to  the  right  in  the  HR 
diagram.  As  the  convective  envelope  becomes 
more  extensive,  the  star  approaches  the  Hayashi 
track.  Since  it  cannot  pass  further  to  the  right,  and 


since  its  radius  continues  to  grow,  the  star  has  to 
move  upwards  along  the  Hayashi  track  towards 
larger  luminosities  (Fig.  12.2).  The  star  has  be¬ 
come  a  red  giant. 

In  low-mass  stars  (M  <  2.3  M©),  as  the  mass 
of  the  core  grows,  its  density  will  eventually  be¬ 
come  so  high  that  it  becomes  degenerate.  The 
central  temperature  will  continue  to  rise.  The 
whole  helium  core  will  have  a  uniform  temper¬ 
ature  because  of  the  high  conductivity  of  the  de¬ 
generate  gas.  If  the  mass  of  the  star  is  larger  than 
0.26  Mq  the  central  temperature  will  eventually 
reach  about  100  million  degrees,  which  is  enough 
for  helium  to  burn  to  carbon  in  the  triple  alpha 
process. 

Helium  burning  will  set  in  simultaneously  in 
the  whole  central  region  and  will  suddenly  raise 
its  temperature.  Unlike  a  normal  gas,  the  degener¬ 
ate  core  cannot  expand,  although  the  temperature 
increases  (cf.  (11.16)),  and  therefore  the  increase 
in  temperature  will  only  lead  to  a  further  accel¬ 
eration  of  the  rate  of  the  nuclear  reactions.  When 
the  temperature  increases  further,  the  degeneracy 
of  the  gas  is  removed  and  the  core  will  begin  to 
expand  violently.  Only  a  few  seconds  after  the  ig¬ 
nition  of  helium,  there  is  an  explosion,  the  helium 
flash. 

The  energy  from  the  helium  flash  is  absorbed 
by  the  outer  layers,  and  thus  it  does  not  lead  to 
the  complete  disruption  of  the  star.  In  fact  the  lu¬ 
minosity  of  the  star  drops  in  the  flash,  because 
when  the  centre  expands,  the  outer  layers  con¬ 
tract.  The  energy  released  in  the  flash  is  turned 
into  potential  energy  of  the  expanded  core.  Thus 
after  the  helium  flash,  the  star  settles  into  a  new 
state,  where  helium  is  steadily  burning  to  carbon 
in  a  nondegenerate  core. 

After  the  helium  flash  the  star  finds  itself  on 
the  horizontal  giant  branch  in  the  HR  diagram. 
The  exact  position  of  a  star  on  the  horizontal 
branch  after  the  helium  flash  is  a  sensitive  func¬ 
tion  of  its  envelope  mass.  This  in  turn  depends 
on  the  amount  of  mass  lost  by  the  star  in  the  he¬ 
lium  flash,  which  can  vary  randomly  from  star 
to  star.  While  the  luminosity  does  not  vary  much 
along  the  horizontal  branch,  the  effective  temper¬ 
atures  are  higher  for  stars  with  less  mass  in  the 
envelope.  The  horizontal  branch  is  divided  into 
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a  blue  and  a  red  part  by  a  gap  corresponding  to 
the  pulsational  instability  leading  to  RR  Lyrae 
variables  (see  Sect.  14.2).  The  form  of  the  hori¬ 
zontal  branch  for  a  collection  of  stars  depends  on 
their  metal- abundance,  in  the  sense  that  a  lower 
metal  abundance  is  related  to  a  more  prominent 
blue  horizontal  branch.  Thus  the  blue  horizon¬ 
tal  branch  in  globular  clusters  with  low  metal- 
abundances  is  strong  and  prominent  (Sect.  16.3). 
For  stars  with  solar  element  abundances  the  hori¬ 
zontal  branch  is  reduced  to  a  short  stump,  the  red 
clump ,  where  it  joins  the  red  giant  branch. 

In  intermediate-mass  stars  (2.3  M©  <  M  < 
8  Mq),  the  central  temperature  is  higher  and  the 
central  density  lower,  and  the  core  will  therefore 
not  be  degenerate.  Thus  helium  burning  can  set  in 
non-catastrophically  as  the  central  regions  con¬ 
tract.  As  the  importance  of  the  helium  burning 
core  increases,  the  star  first  moves  away  from 
the  red  giant  branch  towards  bluer  colours,  but 
then  loops  back  towards  the  Hayashi  track  again. 
An  important  consequence  of  these  blue  loops 
is  that  they  bring  the  star  into  the  strip  in  the 
HR  diagram  corresponding  to  the  cepheid  insta¬ 
bility  (Sect.  14.2).  This  gives  rise  to  the  classi¬ 
cal  cepheid  variables,  which  are  of  central  impor¬ 
tance  for  determining  distances  in  the  Milky  Way 
and  to  the  nearest  galaxies. 

In  the  most  massive  stars  helium  burning  starts 
before  the  star  has  had  time  to  reach  the  red  giant 
branch.  Some  stars  will  continue  moving  to  the 
right  in  the  HR  diagram.  For  others  this  will  pro¬ 
duce  a  massive  stellar  wind  and  a  large  mass  loss. 
Stars  in  this  evolutionary  phase,  such  as  P  Cygni 
and  r]  Carinae,  are  known  as  luminous  blue  vari¬ 
ables,  LBV ,  and  are  among  the  brightest  in  the 
Milky  Way.  If  the  star  can  retain  its  envelope  it 
will  become  a  red  supergiant.  Otherwise  it  will 
turn  back  towards  the  blue  side  of  the  HR  dia¬ 
gram,  ending  up  as  a  Wolf-Rayet  star. 

The  Asymptotic  Giant  Branch  The  evolu¬ 
tion  that  follows  core  helium  burning  depends 
strongly  on  the  stellar  mass.  The  mass  determines 
how  high  the  central  temperature  can  become  and 
the  degree  of  degeneracy  when  heavier  nuclear 
fuels  are  ignited. 

When  the  central  helium  supply  is  exhausted, 
helium  will  continue  to  burn  in  a  shell,  while 


the  hydrogen  burning  shell  is  extinguished.  In  the 
HR  diagram  the  star  will  move  towards  lower  ef¬ 
fective  temperature  and  higher  luminosity.  This 
phase  is  quite  similar  to  the  previous  red  giant 
phase  of  low-mass  stars,  although  the  temper¬ 
atures  are  slightly  hotter.  For  this  reason  it  is 
known  as  the  asymptotic  giant  branch,  AGB. 

After  the  early  phase,  when  the  helium  shell 
catches  up  with  the  extinguished  hydrogen  shell, 
the  AGB  star  enters  what  is  known  as  the  ther¬ 
mally  pulsing  phase ,  where  hydrogen  and  helium 
shell  burning  alternate.  A  configuration  with  two 
burning  shells  is  unstable,  and  in  this  phase  the 
stellar  material  may  become  mixed  or  matter  may 
be  ejected  into  space  in  a  shell,  like  that  of  a  plan¬ 
etary  nebula. 

The  thermally  pulsing  AGB  continues  until  ra¬ 
diation  pressure  has  led  to  the  complete  expul¬ 
sion  of  the  outer  layers  into  a  planetary  nebula. 
Low-  and  intermediate-mass  giants  (M  <  8  Mq) 
never  become  hot  enough  to  ignite  carbon  burn¬ 
ing  in  the  core,  which  remains  as  a  carbon- 
oxygen  white  dwarf  (Fig.  12.6). 

The  End  of  the  Giant  Phase  After  the  end  of 

helium  burning  the  evolution  of  a  star  changes 
character.  This  is  because  the  nuclear  time  scale 
at  the  centre  becomes  short  compared  to  the  ther¬ 
mal  time  scale  of  the  outer  layers.  Secondly, 
the  energy  released  in  nuclear  reactions  will  be 
carried  away  by  neutrinos,  instead  of  being  de¬ 
posited  at  the  centre.  In  consequence,  while  the 
thermonuclear  burning  follows  the  same  pattern 
as  hydrogen  and  helium  burning,  the  star  as  a 
whole  does  not  have  time  to  react  immediately. 

In  stars  with  masses  around  10  M©  either  car¬ 
bon  or  oxygen  may  be  ignited  explosively  just 
like  helium  in  low-mass  stars:  there  is  a  carbon 
or  oxygen  flash.  This  is  much  more  powerful  than 
the  helium  flash,  and  may  make  the  star  explode 
as  a  supernova  (Sects.  12.5  and  13.3). 

For  even  larger  masses  the  core  remains  non¬ 
degenerate  and  burning  will  start  non-catastro¬ 
phically  as  the  core  goes  on  contracting  and 
becoming  hotter.  First  carbon  burning  and  sub¬ 
sequently  oxygen  and  silicon  burning  (see 
Sect.  10.3)  will  be  ignited.  As  each  nuclear  fuel 
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Fig.  12.4  Energy  transport  in  the  main  sequence  phase, 
(a)  The  least  massive  stars  (M  <  0.26  M©)  are  convective 
throughout,  (b)  For  0.26  M©  <  M  <  1.5  M©  the  core  is 
radiative  and  the  envelope  convective,  (c)  Massive  stars 
(M  >1.5  Mq )  have  a  convective  core  and  a  radiative  en¬ 
velope 


is  exhausted  in  the  centre,  the  burning  will  con¬ 
tinue  in  a  shell.  The  star  will  thus  contain  sev¬ 
eral  nuclear  burning  shells.  At  the  end  the  star 
will  consist  of  a  sequence  of  layers  differing  in 
composition,  in  massive  stars  (more  massive  than 
15  M©)  all  the  way  up  to  iron. 

The  central  parts  of  the  most  massive  stars 
with  masses  larger  than  15  M©  bum  all  the  way 
to  iron  56Fe.  All  nuclear  sources  of  energy  will 
then  be  completely  exhausted.  The  structure  of 
a  30  solar  mass  star  at  this  stage  is  schemati¬ 
cally  shown  in  Fig.  12.4.  The  star  is  made  up 
of  a  nested  sequence  of  zones  bounded  by  shells 
burning  silicon  28  Si,  oxygen  16 O  and  carbon  12 C, 
helium  4He  and  hydrogen  1H.  However,  this  is 
not  a  stable  state,  since  the  end  of  nuclear  re¬ 
actions  in  the  core  means  that  the  central  pres¬ 
sure  will  fall,  and  the  core  will  collapse.  Some 
of  the  energy  released  in  the  collapse  goes  into 
dissociating  the  iron  nuclei  first  to  helium  and 
then  to  protons  and  neutrons.  This  will  further 
speed  up  the  collapse,  just  like  the  dissociation 
of  molecules  speeds  up  the  collapse  of  a  pro¬ 
tostar.  The  collapse  takes  place  on  a  dynamical 
time  scale,  which,  in  the  dense  stellar  core,  is 
only  a  fraction  of  a  second.  The  outer  parts  will 
also  collapse,  but  more  slowly.  In  consequence, 
the  temperature  will  increase  in  layers  contain¬ 
ing  unburnt  nuclear  fuel.  This  will  burn  explo¬ 
sively,  releasing  immense  amounts  of  energy  in 
a  few  seconds,  principally  in  the  form  of  neutri¬ 
nos. 

The  final  stages  of  stellar  evolution  may  be 
described  as  an  implosion  of  the  core,  which  is 


briefly  halted  every  time  a  new  source  of  nu¬ 
clear  fuel  becomes  available  for  burning  It  is  still 
an  open  problem  how  exactly  the  energy  released 
in  this  collapse  is  transformed  into  the  disrup¬ 
tion  of  the  entire  star  and  the  ejection  of  its  outer 
layers.  It  is  also  still  unclear  whether  in  a  given 
case  the  remnant  will  be  a  neutron  star  or  a  black 
hole. 

Although  the  exact  mechanism  is  not  yet  un¬ 
derstood,  the  end-point  of  the  evolution  of  stars 
more  massive  the  about  8  M©  is  that  the  outer 
layers  explode  as  a  supernova.  In  the  dense  cen¬ 
tral  core,  the  protons  and  electrons  combine  to 
form  neutrons.  The  core  will  finally  consist  al¬ 
most  entirely  of  neutrons,  which  become  degen¬ 
erate  because  of  the  high  density.  The  degeneracy 
pressure  of  the  neutrons  will  stop  the  collapse  of 
a  small  mass  core.  However,  if  the  mass  of  the 
core  is  large  enough,  a  black  hole  will  probably 
be  formed. 


1 2.5  The  Final  Stages  of  Evolution 

The  endpoints  of  stellar  evolution  can  be  seen 
from  Fig.  12.7.  This  shows  the  relation  between 
mass  and  central  density  for  a  body  at  zero  tem¬ 
perature,  i.e.  the  final  equilibrium  when  a  mas¬ 
sive  body  has  cooled.  There  are  two  maxima 
on  the  curve.  The  mass  corresponding  to  the 
left-hand  maximum  is  called  the  Chandrasekhar 
mass ,  Mch  ~  1.2-1.4M©,  and  that  correspond¬ 
ing  to  the  right-hand  one,  the  Oppenheimer- 
Volkoffmass ,  Mov  ~  1.5-2 Mq. 

Let  us  first  consider  a  star  with  mass  less  than 
M©h.  Suppose  the  mass  does  not  change.  When 
the  nuclear  fuel  is  exhausted,  the  star  will  become 
a  white  dwarf,  which  will  gradually  cool  down 
and  contract.  In  Fig.  12.5  it  moves  horizontally 
to  the  right.  Finally  it  will  reach  zero  tempera¬ 
ture  and  end  up  on  the  left-hand  rising  part  of  the 
equilibrium  curve.  Its  final  equilibrium  is  a  com¬ 
pletely  degenerate  black  dwarf. 

If  the  mass  of  the  star  is  larger  than  Mq h  but 
smaller  than  Mov,  it  can  continue  cooling  until  it 
reaches  the  right-hand  rising  section  of  the  curve. 
Again  there  is  a  stable  final  state,  this  time  corre¬ 
sponding  to  a  completely  degenerate  neutron  star. 

An  even  more  massive  star  with  mass  larger 
than  Mqv  will  go  on  contracting  past  the  den- 
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sity  corresponding  to  a  neutron  star.  There  is 
then  no  longer  any  known  possible  stable  equi¬ 
librium,  and  the  star  must  go  on  contracting  to 
form  a  black  hole. 

The  only  endpoints  of  stellar  evolution  pre¬ 
dicted  by  theory  are  the  two  stable  states  of 
Fig.  12.7  and  the  two  extreme  possibilities,  col¬ 
lapse  to  a  black  hole  or  explosive  disruption. 


Fig.  12.5  The  structure  of  a  massive  star  (30  M©)  at  a  late 
evolutionary  stage.  The  star  consists  of  layers  with  differ¬ 
ent  composition  separated  by  nuclear  burning  shells 


The  preceding  considerations  are  purely  theo¬ 
retical.  The  final  evolutionary  stages  of  real  stars 
involve  many  imperfectly  known  factors,  which 
may  affect  the  final  equilibrium.  Perhaps  most 
important  is  the  question  of  mass  loss,  which  is 
very  difficult  to  settle  either  observationally  or 
theoretically.  For  example,  in  a  supernova  explo¬ 
sion  the  whole  star  may  be  disrupted  and  it  is  very 
uncertain  whether  what  remains  will  be  a  neutron 
star,  a  black  hole  or  nothing  at  all.  (The  structure 
of  compact  stars  will  be  discussed  in  Chap.  15.) 

A  summary  of  the  various  evolutionary  paths 
is  given  in  Fig.  12.8. 


1 2.6  The  Evolution  of  Close  Binary 
Stars 

If  the  components  of  a  binary  star  are  well  sep¬ 
arated,  they  do  not  significantly  perturb  one  an¬ 
other.  When  studying  their  evolution,  one  can  re¬ 
gard  them  as  two  single  stars  evolving  indepen¬ 
dently,  as  described  above.  However,  in  close  bi¬ 
nary  pairs,  this  will  no  longer  be  the  case. 

Close  binary  stars  are  divided  into  three 
classes,  as  shown  in  Fig.  12.9:  detached ,  semide¬ 
tached  and  contact  binaries.  The  figure-eight 


Fig.  12.6  The  usual  endpoint  for  the  development  of 
a  star  with  a  mass  of  less  than  three  solar  masses, 
is  a  white  dwarf,  with  an  expanding  planetary  neb¬ 
ula  around  it.  On  the  left ,  the  planetary  nebula  NGC 
6369,  photographed  with  the  8 -meter  Gemini  South  tele¬ 


scope.  For  a  massive  star,  the  life  ends  with  a  super¬ 
nova  explosion.  On  the  right ,  the  supernova  remnant  Cas¬ 
siopeia  A  on  radio  wavelengths.  The  image  was  created 
by  the  VLA  telescope.  (Images  Gemini  Observatory/ Abu 
Team/N OAO/AURA/NSF  and  NRAO/AUI) 
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Fig.  12.7  The  evolutionary  end  points  of  stars  with  dif¬ 
ferent  masses  shown  as  a  function  of  central  density.  The 
curve  shows  the  behaviour  of  the  central  density  of  com¬ 
pletely  degenerate  (T  =0K)  bodies.  The  Chandrasekhar 
mass  Mch  and  the  Oppenheimer-Volkoff  mass  Mov  cor¬ 
respond  to  maxima  on  this  curve 

curve  drawn  in  the  figure  is  an  equipotential  sur¬ 
face  called  the  Roche  surface.  If  the  star  becomes 
larger  than  this  surface,  it  begins  to  lose  mass  to 
its  companion  through  the  waist  of  the  Roche  sur¬ 
face. 

During  the  main  sequence  phase  the  stellar  ra¬ 
dius  does  not  change  much,  and  each  component 
will  remain  within  its  own  Roche  lobe.  When 
the  hydrogen  is  exhausted,  the  stellar  core  will 
rapidly  shrink  and  the  outer  layers  expand,  as  we 
have  seen.  At  this  stage  a  star  may  exceed  its 
Roche  lobe  and  mass  transfer  may  set  in. 

Close  binary  stars  are  usually  seen  as  eclipsing 
binaries.  One  example  is  Algol  in  the  constella¬ 
tion  Perseus.  The  components  in  this  binary  sys¬ 
tem  are  a  normal  main  sequence  star  and  a  sub¬ 
giant,  which  is  much  less  massive  than  the  main 
sequence  star.  The  subgiant  has  a  high  luminos¬ 
ity  and  thus  has  apparently  already  left  the  main 
sequence.  This  is  unexpected,  since  the  compo¬ 
nents  were  presumably  formed  at  the  same  time, 
and  the  more  massive  star  should  evolve  more 
rapidly.  The  situation  is  known  as  the  Algol  para¬ 
dox:  for  some  reason,  the  less  massive  star  has 
evolved  more  rapidly. 

In  the  1950’s  a  solution  to  the  paradox  pro¬ 
posed  that  the  subgiant  was  originally  more  mas¬ 
sive,  but  that  it  had  lost  mass  to  its  companion 
during  its  evolution.  Since  the  1960’s  mass  trans¬ 
fer  in  close  binary  systems  has  been  much  stud¬ 


ied,  and  has  turned  out  be  a  very  significant  factor 
in  the  evolution  of  close  binaries. 

As  an  example,  let  us  consider  a  close  binary, 
where  the  initial  masses  of  the  components  are 
1  and  2  solar  masses  and  the  initial  orbital  pe¬ 
riod  1.4  days  (Fig.  12.10).  After  evolving  away 
from  the  main  sequence  the  more  massive  com¬ 
ponent  will  exceed  the  Roche  limit  and  begin  to 
lose  mass  to  its  companion.  Initially  the  mass  will 
be  transferred  on  the  thermal  time  scale,  and  after 
a  few  million  years  the  roles  of  the  components 
will  be  changed:  the  initially  more  massive  com¬ 
ponent  has  become  less  massive  than  its  compan¬ 
ion. 

The  binary  is  now  semidetached  and  can  be 
observed  as  an  Algol-type  eclipsing  binary.  The 
two  components  are  a  more  massive  main  se¬ 
quence  star  and  a  less  massive  subgiant  filling  its 
Roche  surface.  The  mass  transfer  will  continue, 
but  on  the  much  slower  nuclear  time  scale.  Fi¬ 
nally,  mass  transfer  will  cease  and  the  less  mas¬ 
sive  component  will  contract  to  a  0.6  M©  white 
dwarf. 

The  more  massive  2.4  M©  star  now  evolves 
and  begins  to  lose  mass,  which  will  accumulate 
on  the  surface  of  the  white  dwarf.  The  accumu¬ 
lated  mass  may  give  rise  to  nova  outbursts ,  where 
material  is  ejected  into  space  by  large  explosions. 
Despite  this,  the  mass  of  the  white  dwarf  will 
gradually  grow  and  may  eventually  exceed  the 
Chandrasekhar  mass.  The  white  dwarf  will  then 
collapse  and  explode  as  a  type  I  supernova. 

As  a  second  example,  we  can  take  a  massive 
binary  with  the  initial  masses  20  and  8  M©  and 
the  initial  period  4.7  days  (Fig.  12.11).  The  more 
massive  component  evolves  rapidly,  and  at  the 
end  of  the  main  sequence  phase,  it  will  transfer 
more  than  15  M©  of  its  material  to  the  secondary. 
The  mass  transfer  will  occur  on  the  thermal  time 
scale,  which,  in  this  case,  is  only  a  few  ten  thou¬ 
sand  years.  The  end  result  is  a  helium  star ,  having 
as  a  companion  an  unevolved  main  sequence  star. 
The  properties  of  the  helium  star  are  like  those  of 
a  Wolf-Ray et  star  (Fig.  12.12). 

Helium  continues  to  bum  to  carbon  in  the  core 
of  the  helium  star,  and  the  mass  of  the  carbon  core 
will  grow.  Eventually  the  carbon  will  be  explo¬ 
sively  ignited,  and  the  star  will  explode  as  a  su- 


274 


12  Stellar  Evolution 


3 


4 


Fig.  12.8  Evolution  schemes  for  stars  with  different 
masses.  The  radius  is  scaled  to  be  the  same  in  all  draw¬ 
ings.  In  reality,  there  are  vast  differences  in  the  sizes  of 
different  stars  and  different  phases  of  evolution.  In  the  be¬ 
ginning  (7)  a  gas  cloud  is  contracting  rapidly  in  free  fall. 
Because  the  gas  is  quite  rarefied,  radiation  escapes  easily 
from  the  cloud.  As  the  density  increases,  radiation  trans¬ 
port  becomes  more  difficult,  and  the  released  energy  tends 
to  warm  up  the  gas.  The  contraction  lasts  until  the  gas  is 
completely  ionised,  and  the  star,  which  has  become  a  pro¬ 
tostar,  is  in  hydrostatic  equilibrium  (2).  The  star  is  convec¬ 
tive  throughout  its  interior. 

Now  evolution  continues  on  a  thermal  time  scale.  The  con¬ 
traction  is  much  slower  than  in  the  free-fall  phase.  The 
phases  of  further  evolution  are  determined  by  the  mass  M 
of  the  star.  For  M  <  0.08  A7©  the  temperature  in  the  cen¬ 
tre  does  not  rise  high  enough  for  hydrogen  burning,  and 
these  stars  contract  to  planetlike  brown  dwarfs.  Stars  with 
M  >  0.08  Mq  start  hydrogen  burning  when  the  tempera¬ 
ture  has  reached  about  4  x  106  K.  This  is  the  beginning 
of  the  main  sequence  phase.  In  the  main  sequence,  the 
lowest-mass  stars  with  0.08  M©  <M<  0.26  M©  are  en¬ 
tirely  convective,  and  thus  they  remain  homogeneous  (2). 
Their  evolution  is  very  slow,  and  after  all  the  hydrogen  has 
been  burnt  to  helium,  they  contract  to  white  dwarfs  ( 4 ). 
The  increasing  temperature  makes  the  stars  with  M  > 
0.26  Mq  radiative  in  the  centre  as  the  opacity  de¬ 
creases  (5).  The  low-mass  stars  with  0.26  M©  <  M  < 
1.5  Mq  remain  radiative  in  the  centre  during  the  main 
sequence  phase  ( 6 )  as  they  burn  their  hydrogen  through 
the  pp  chain.  The  outer  part  is  convective.  At  the  end  of 
the  main  sequence  phase,  hydrogen  burning  continues  in 
a  shell  surrounding  the  helium  core  (7). 


The  outer  part  expands,  and  the  giant  phase  begins.  The 
contracting  helium  core  is  degenerate  and  warms  up.  At 
about  108  K,  the  triple  alpha  process  begins  and  leads  im¬ 
mediately  to  the  helium  flash  (8).  The  explosion  is  damped 
by  the  outer  parts,  and  helium  burning  goes  on  in  the 
core  (9).  Hydrogen  is  still  burning  in  an  outer  shell.  As  the 
central  helium  is  exhausted,  helium  burning  moves  over  to 
a  shell  (10).  At  the  same  time,  the  outer  part  expands  and 
the  star  loses  some  of  its  mass.  The  expanding  envelope 
forms  a  planetary  nebula  (77).  The  star  in  the  centre  of  the 
nebula  becomes  a  white  dwarf  (72). 

In  the  upper  main  sequence  with  M  >  1.5  M©  energy  is 
released  through  the  CNO  cycle,  and  the  core  becomes 
convective,  while  the  outer  part  is  radiative  (13).  The  main 
sequence  phase  ends  as  the  hydrogen  in  the  core  is  ex¬ 
hausted,  and  shell  burning  begins  (14).  The  helium  core 
remains  convective  and  nondegenerate,  and  helium  burn¬ 
ing  begins  without  perturbations  (75  and  19).  Afterwards, 
helium  burning  moves  over  to  a  shell  (16  and  20).  For 
stars  with  3  M©  <  M  <  15  M©  the  carbon  in  the  core  is 
degenerate,  and  a  carbon  flash  occurs  (77).  This  leads  to 
a  supernova  explosion  (18)  and  possibly  to  the  complete 
destruction  of  the  star. 

For  the  most  massive  stars  with  M  >  15  M©  the  carbon 
core  remains  convective,  and  carbon  burns  to  oxygen  and 
magnesium.  Finally,  the  star  consists  of  an  iron  core  sur¬ 
rounded  by  shells  with  silicon,  oxygen,  carbon,  helium 
and  hydrogen  (27).  The  nuclear  fuel  is  now  exhausted,  and 
the  star  collapses  on  a  dynamical  time  scale.  The  result  is 
a  supernova  (22).  The  outer  parts  explode,  but  the  remain¬ 
ing  core  continues  to  contract  to  a  neutron  star  or  a  black 
hole 
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Fig.  12.8  (Continued) 


Fig.  12.9  The  types  of  close  binary  systems:  (a)  de¬ 
tached,  (b)  semidetached  and  (c)  contact  binary 

pernova.  The  consequences  of  this  explosion  are 
not  known,  but  let  us  suppose  that  a  2  M©  com¬ 
pact  remnant  is  left.  As  the  more  massive  star  ex¬ 
pands,  its  stellar  wind  will  become  stronger,  giv¬ 
ing  rise  to  strong  X-ray  emission  as  it  hits  the 
compact  star.  This  X-ray  emission  will  only  cease 
when  the  more  massive  star  exceeds  its  Roche 
surface. 


The  system  will  now  rapidly  lose  mass  and 
angular  momentum.  A  steady  state  is  finally 
reached  when  the  system  contains  a  6  M©  helium 
star  in  addition  to  the  2  M©  compact  star.  The 
helium  star  is  seen  as  a  Wolf-Rayet  star,  which, 
after  about  a  million  years,  explodes  as  a  super¬ 
nova.  This  will  probably  lead  to  the  breakup  of 
the  binary  system.  However,  for  certain  values 
of  the  mass,  the  binary  may  remain  bound.  Thus 
a  binary  neutron  star  may  be  formed. 


1 2.7  Comparison  with  Observations 

The  most  important  direct  support  for  the  theo¬ 
retical  evolutionary  models  is  obtained  from  the 
properties  of  observed  HR  diagrams.  If  the  the¬ 
oretical  models  are  correct,  the  observed  number 
of  stars  should  reflect  the  duration  of  the  various 
evolutionary  phases.  These  are  given  for  stars  of 
different  masses  in  Table  12.1.  The  stars  are  most 
numerous  along  the  main  sequence.  Giants  are 
also  common  and,  in  addition  to  these,  there  are 
white  dwarfs,  subgiants,  etc.  The  sparsely  popu¬ 
lated  region  to  the  right  of  the  main  sequence,  the 
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Hertzsprung  gap ,  is  explained  by  the  rapid  tran¬ 
sition  from  the  main  sequence  to  the  giant  phase. 

The  cepheids  provide  an  important  test  for 
the  evolutionary  models.  The  pulsations  and  the 
relation  between  period  and  luminosity  for  the 
cepheids  can  be  understood  on  the  basis  of  the¬ 
oretical  stellar  models. 

The  evolutionary  models  can  also  explain  the 
HR  diagrams  of  star  clusters.  Let  us  assume  that 
all  the  stars  in  a  cluster  were  formed  at  the  same 
time.  In  the  youngest  systems,  the  associations, 
the  stars  will  mainly  be  found  on  the  upper  main 


Fig.  12.10  Evolution  of  a  low-mass  binary:  (a)  both 
components  on  the  main  sequence;  (b)  mass  transfer  from 
the  more  massive  component;  (c)  light  subgiant  and  mas¬ 
sive  main  sequence  star;  (d)  white  dwarf  and  main  se¬ 
quence  star;  (e)  mass  transferred  to  the  white  dwarf  from 
the  more  massive  component  leads  to  nova  outbursts; 
(f)  the  white  dwarf  mass  exceeds  the  Chandrasekhar  mass 
and  explodes  as  a  type  I  supernova 


sequence,  since  the  most  massive  stars  evolve 
most  rapidly.  To  the  right  of  the  main  sequence, 
there  will  be  less  massive  T  Tauri  stars,  which  are 
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Fig.  12.11  Evolution  of  a  massive  binary.  It  has  been 
assumed  that  the  supernova  explosion  of  a  5  M©  helium 
star  leaves  a  2  M©  compact  remnant  (neutron  star  or  black 
hole),  (a)  Main  sequence  phase;  (b)  beginning  of  the  first 
mass  transfer  phase;  (c)  end  of  the  first  mass  transfer 
phase;  the  first  Wolf-Ray et  phase  begins;  (d)  the  helium 
star  ( Wolf-Ray  et  star)  has  exploded  as  a  supernova;  (e)  the 
23  Mq  component  becomes  a  supergiant;  the  compact 
component  is  a  strong  X-ray  source;  (f)  beginning  of  the 
second  mass  transfer  phase;  the  X-ray  source  is  throttled 
and  large-scale  mass  loss  begins;  (g)  second  Wolf-Rayet 
phase;  (h)  the  6  M©  helium  star  has  exploded  as  a  super¬ 
nova;  the  binary  may  or  may  not  be  disrupted,  depending 
on  the  remaining  mass 
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Fig.  12.12  The  Wolf-Rayet  star  WR  104  photographed 
by  the  10-m  Keck  telescope.  The  spiral  is  dust  and  gas 
which  is  thrown  out  from  the  rotating  binary  system.  The 
spiral  “pinwheel”  is  seen  to  make  a  full  revolution  in  about 
220  days.  (Photo  U.C.  Berkeley  Space  Sciences  Labora¬ 
tory  AV.M  .  Keck  Observatory) 

still  contracting.  In  intermediate  age  open  clus¬ 
ters,  the  main  sequence  will  be  well  developed 
and  its  upper  end  should  bend  to  the  right,  since 
the  most  massive  stars  will  already  have  begun  to 
evolve  off  the  main  sequence.  In  the  old  globular 
clusters,  the  giant  branch  should  increase  in  im¬ 
portance  in  the  older  clusters.  These  predictions 
are  confirmed  by  the  observations,  which  will  be 
further  discussed  in  Chap.  16  on  star  clusters. 

Of  course,  the  most  detailed  observations  can 
be  made  of  the  Sun,  which  is  therefore  a  crucial 
point  of  comparison  for  the  theoretical  models. 
If  a  star  of  one  solar  mass  with  an  initial  com¬ 
position  of  71  %  hydrogen,  27  %  helium  and 
2  %  heavier  elements  is  allowed  to  evolve  for 
5000  million  years,  it  will  be  very  similar  to  our 
present  Sun.  In  particular,  it  will  have  the  same 
radius,  surface  temperature  and  luminosity.  Ac¬ 
cording  to  calculations,  about  half  of  the  Sun’s 
supply  of  hydrogen  fuel  has  been  consumed.  The 
Sun  will  go  on  shining  like  a  normal  main  se¬ 
quence  star  for  another  5000  million  years,  before 
there  will  be  any  dramatic  change. 

Some  problems  remain  in  regard  to  the  obser¬ 
vations.  One  is  the  solar  neutrino  problem.  The 


neutrinos  produced  by  solar  nuclear  reactions 
have  been  observed  since  the  beginning  of  the 
1970’s  by  the  techniques  described  in  Sect.  3.7. 
Only  the  neutrinos  formed  in  the  relatively  rare 
ppIII  reaction  are  energetic  enough  to  be  ob¬ 
served  in  this  way.  Their  observed  number  is  too 
small:  whereas  the  models  predict  about  5  units, 
the  observations  have  consistently  only  registered 
1-2. 

The  discrepancy  may  be  due  to  a  fault  in 
the  observational  technique  or  to  some  unknown 
properties  of  the  neutrinos.  However,  if  the  solar 
models  are  really  in  error,  the  central  temperature 
of  the  Sun  would  have  to  be  about  20  %  lower 
than  thought,  which  would  be  in  serious  conflict 
with  the  observed  solar  luminosity.  One  possibil¬ 
ity  is  that  some  of  the  electron  neutrinos  change 
to  other,  unobservable  particles  during  their  pas¬ 
sage  to  Earth.  (See  also  Sect.  12.1.) 

A  second  problem  is  the  observed  abundance 
of  lithium  and  beryllium.  The  solar  surface  con¬ 
tains  a  normal  abundance  of  beryllium,  but  very 
little  lithium.  This  should  mean  that  during  its 
contraction,  the  Sun  was  still  fully  convective 
when  the  central  temperature  was  high  enough 
to  destroy  lithium  (3  x  106  K),  but  not  beryl¬ 
lium  (4  x  106  K).  However,  according  to  the  stan¬ 
dard  solar  evolution  models,  convection  ceased  in 
the  centre  already  at  a  temperature  of  2  x  106  K. 
One  suggested  explanation  is  that  the  convection 
has  later  carried  down  lithium  to  layers  where  the 
temperature  is  high  enough  to  destroy  it. 


1 2.8  The  Origin  of  the  Elements 

There  are  just  under  a  hundred  naturally  occur¬ 
ring  elements,  and  about  300  isotopes  in  the  solar 
system  (Fig.  12.13).  In  Sect.  12.4,  we  have  seen 
how  the  elements  up  to  iron  are  produced  when 
hydrogen  burns  to  helium  and  helium  further  to 
carbon,  oxygen  and  heavier  elements. 

Almost  all  nuclei  heavier  than  helium  were 
produced  in  nuclear  reactions  in  stellar  interiors. 
In  the  oldest  stars,  the  mass  fraction  of  heavy 
elements  is  only  about  0.02  %,  whereas  in  the 
youngest  stars  it  is  a  few  per  cent.  Neverthe¬ 
less,  most  of  the  stellar  material  is  hydrogen  and 
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Fig.  12.13  Element 
abundances  in  the  solar 
system  as  a  function  of  the 
nuclear  mass  number.  The 
abundance  of  Si  has  been 
normalised  as  106 
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helium.  According  to  the  standard  cosmological 
model,  those  were  formed  in  the  early  stages  of 
the  Universe,  when  the  temperature  and  density 
were  suitable  for  nuclear  reactions.  (This  will  be 
discussed  in  Chap.  20.)  Although  helium  is  pro¬ 
duced  during  the  main  sequence  stellar  evolution, 
very  little  of  it  is  actually  returned  into  space  to  be 
incorporated  into  later  stellar  generations.  Most 
of  it  is  either  transformed  into  heavier  elements 
by  further  reactions,  or  else  remains  locked  up  in¬ 
side  white  dwarf  remnants.  Therefore  the  helium 
abundance  does  not  increase  by  much  due  to  stel¬ 
lar  processes. 

The  most  important  nuclear  reactions  leading 
to  the  build-up  of  the  heavy  nuclei  up  to  iron  were 
presented  in  Sect.  11.3.  The  probabilities  of  the 
various  reactions  are  determined  either  by  exper¬ 
iments  or  by  theoretical  calculations.  When  they 


are  known,  the  relative  abundances  of  the  various 
nuclei  produced  can  be  calculated. 

The  formation  of  elements  heavier  than  iron 
requires  an  input  of  energy,  and  thus  they  cannot 
be  explained  in  the  same  manner.  Still  heavy  nu¬ 
clei  are  continually  produced.  In  1952  technetium 
was  discovered  in  the  atmosphere  of  a  red  giant. 
The  half-life  of  the  most  longlived  isotope  98  Tc  is 
about  1.5  x  106  years,  so  that  the  observed  tech¬ 
netium  must  have  been  produced  in  the  star. 

Most  of  the  nuclei  more  massive  than  iron  are 
formed  by  neutron  capture  (Fig.  12.14).  Since 
the  neutron  does  not  have  an  electric  charge,  it 
can  easily  penetrate  into  the  nucleus.  The  prob¬ 
ability  for  neutron  capture  depends  both  on  the 
kinetic  energy  of  the  incoming  neutron  and  on 
the  mass  number  of  the  nucleus.  For  example,  in 
the  solar  system  the  abundances  of  isotopes  show 
maxima  at  the  mass  numbers  A  =  70-90,  130, 
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Fig.  12.14  Neutron  capture  paths  for  the  s-process  and 
r-process  (from  left  to  right).  The  s-process  follows  a  path 
along  the  line  of  beta  stability.  The  stable  r-process  nuclei 
(small  circles)  result  from  beta  decay  of  their  neutron  rich 
progenitors  on  the  shaded  path  shown  lower.  Beta  decay 
occurs  along  straight  lines  A  =  const.  The  closed  neutron 

138,  195  and  208.  These  mass  numbers  cor¬ 
respond  to  nuclei  with  closed  neutron  shells  at 
the  neutron  numbers  N  =  50,  82,  and  126.  The 
neutron  capture  probability  for  these  nuclei  is 
very  small.  The  closed  shell  nuclei  thus  react 
more  slowly  and  are  accumulated  in  greater  abun¬ 
dances. 

In  a  neutron  capture,  a  nucleus  with  mass 
number  A  is  transformed  into  a  more  massive  nu¬ 
cleus: 

(Z,  A)  +  n  — >  (Z,  A  +  1)  +  y. 

The  newly  formed  nucleus  may  be  unstable  to 
P  decay ,  where  one  neutron  is  transformed  into 
a  proton: 

(Z,  A  +  1)  — >  (Z  +  1,  A  +  1)  +  e  +  Vq. 

Two  kinds  of  neutron  capture  processes  are  en¬ 
countered,  depending  on  the  value  of  the  neutron 
flux.  In  the  slow  s-process ,  the  neutron  flux  is 


shells  in  nuclei  at  N  =  50,  82  and  126  correspond  to  abun¬ 
dance  peaks  in  s-process  nuclei  at  A  =  88,  138  and  208, 
and  in  r-process  nuclei  at  A  =  80,  130  and  195.  (Seeger, 
P.A.,  Fowler,  W.A.,  Clayton,  D.D.  (1965):  Astrophys.  J. 
Suppl.  11,  121) 


so  small  that  any  ft  decays  have  had  time  to  oc¬ 
cur  before  the  next  neutron  capture  reaction  takes 
place.  The  most  stable  nuclei  up  to  mass  num¬ 
ber  210  are  formed  by  the  s-process.  These  nu¬ 
clei  are  said  to  correspond  to  the  ft  stability  val¬ 
ley.  The  s-process  explains  the  abundance  peaks 
at  the  mass  numbers  88,  138  and  208. 

When  the  neutron  flux  is  large,  p  decays  do 
not  have  time  to  happen  before  the  next  neu¬ 
tron  capture.  One  then  speaks  of  the  rapid  r- 
process ,  which  gives  rise  to  more  neutron-rich 
isotopes.  The  abundance  maxima  produced  by 
the  r-process  lie  at  mass  numbers  about  ten  units 
smaller  than  those  of  the  s-process. 

A  neutron  flux  sufficient  for  the  s-process  is 
obtained  in  the  course  of  normal  stellar  evolu¬ 
tion.  For  example,  some  of  the  carbon  and  oxy¬ 
gen  burning  reactions  produce  free  neutrons.  If 
there  is  convection  between  the  hydrogen  and  he¬ 
lium  burning  shells,  free  protons  may  be  carried 
into  the  carbon-rich  layers.  Then  the  following 
neutron-producing  reaction  chain  becomes  im- 
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portant: 


12C  +  p  - 

»  13N  +  y, 

13n 

»  13C  +  e+  +  v, 

13C  +4  He  - 

*  160  +  n. 

The  convection  can  also  carry  the  reaction  prod¬ 
ucts  nearer  to  the  surface. 

The  neutron  flux  required  for  the  r-process 
is  about  1022  cm-3,  which  is  too  large  to  be 
produced  during  normal  stellar  evolution.  The 
only  presently  known  site  where  a  large  enough 
neutron  flux  is  expected  is  near  a  neutron  star 
forming  in  a  supernova  explosion.  In  this  case, 
the  rapid  neutron  capture  leads  to  nuclei  that 
cannot  capture  more  neutrons  without  becoming 
strongly  unstable.  After  one  or  more  rapid  /3  de¬ 
cays,  the  process  continues. 

The  r-process  stops  when  the  neutron  flux  de¬ 
creases.  The  nuclei  produced  then  gradually  de¬ 
cay  by  the  /3 -process  towards  more  stable  iso¬ 
topes.  Since  the  path  of  the  r-process  goes  about 
ten  mass  units  below  the  stability  valley,  the  abun¬ 
dance  peaks  produced  will  fall  about  ten  units 
below  those  of  the  s-process.  This  is  shown  in 
Fig.  12.11.  The  most  massive  naturally  occurring 
elements,  such  as  uranium,  thorium  and  pluto¬ 
nium,  are  formed  by  the  r-process. 

There  are  about  40  isotopes  on  the  proton-rich 
side  of  the  /3  stability  valley  that  cannot  be  pro¬ 
duced  by  neutron  capture  processes.  Their  abun¬ 
dances  are  very  small,  relative  to  the  neighbour¬ 
ing  isotopes.  They  are  formed  in  supernova  ex¬ 
plosions  at  temperatures  higher  than  109  K  by  re¬ 
actions  known  as  the  p-process.  At  this  tempera¬ 
ture,  pair  formation  can  take  place: 

y  -^e+  +  e_. 

The  positron  may  either  be  annihilated  immedi¬ 
ately  or  be  consumed  in  the  reaction 

e+  +  (Z,  A)  — >  (Z  +  1,  A)  +  ve. 


the  isotopes  184W,  190Pt  and  196Hg  formed  by  the 
fission  of  lead. 

All  the  preceding  reaction  products  are  ejected 
into  the  interstellar  medium  in  the  supernova 
explosion.  Collisions  between  cosmic  rays  and 
heavy  nuclei  then  finally  give  rise  to  the  light  el¬ 
ements  lithium,  beryllium  and  boron.  Thus  the 
abundances  of  essentially  all  naturally  occurring 
isotopes  can  be  explained. 

During  succeeding  generations  of  stars  the  rel¬ 
ative  abundance  of  heavy  elements  increases  in 
the  interstellar  medium.  They  can  then  be  incor¬ 
porated  into  new  stars,  planets — and  living  be¬ 
ings. 


12.9  Example 


Example  12.1  An  interstellar  cloud  has  a  mass 
of  one  solar  mass  and  density  of  1010  hydrogen 
atoms  per  cm3.  Its  rotation  period  is  1000  years. 
What  is  the  rotation  period  after  the  cloud  has 
condensed  into  a  star  of  solar  size? 

The  angular  momentum  is  L  =  loo,  where  co  is 
the  angular  velocity  and  I  is  the  moment  of  iner¬ 
tia.  For  a  homogeneous  sphere 

2  2 
I  =  -MR2, 

5 

where  M  is  the  mass  and  R  the  radius.  From  the 
conservation  of  the  angular  momentum  we  get 


L  =  I  \  CD  \  =  I 20)2 


I\2n  l22n 


Pi 


Pi 


h  \mr\ 

=►  P2  =  P\-r  =  P\1J — | 

h  1  MR} 


=  P\ 


R2 

Ri 


where  P\  and  P2  are  the  rotation  periods  before 
and  after  the  collapse.  The  mass  of  the  cloud  is 


Another  reaction  in  the  p-process  is 

(Z,  A)  +  p  —>  (Z  +  1,  A  +  1)  +  y. 

Finally,  the  fission  of  some  heavier  isotopes  may 
give  rise  to  p-process  nuclei.  Examples  of  this  are 


4  2 

M  =  -7T  R5  p 
3 

=  -71 R3  X  1016  X  1.6734  X  10“27  kg 
3  6 

=  1  AT©  =  1.989  x  1030  kg. 
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Solving  for  the  radius  we  get  R  =  3  x  1013  m. 
The  rotation  period  after  the  collapse  is 


P2  =  1000  a  x 


6.96  x  108  m\2 
3  x  1013  m 


=  5.4  x  10  7  a  =  17  s. 


This  is  several  orders  of  magnitude  shorter  than 
the  actual  period.  Somehow  the  star  has  to  get  rid 
of  most  of  its  angular  momentum  during  the  pro¬ 
cess. 


12.10  Exercises 

Exercise  12.1  Find  the  free  fall  time  scale  for 
a  hydrogen  cloud,  if  the  density  of  H2  molecules 
is  3000  cm-3.  Assume  that  stars  condense  from 
such  clouds,  there  are  100  clouds  in  the  Galaxy, 


the  mass  of  each  cloud  is  5  x  104Mo,  and  10  % 
of  the  mass  is  converted  into  stars.  Also  assume 
that  the  average  mass  of  a  star  is  1  M©.  How 
many  stars  are  born  in  one  year? 

Exercise  12.2  The  mass  of  Vega  (spectral  class 
A0V)  is  2  Mq,  radius  3  Rq,  and  luminosity 
60  L©.  Find  its  thermal  and  nuclear  time  scales. 

Exercise  12.3  Assume  that  a  star  remains  109 
years  in  the  main  sequence  and  burns  10  %  of  its 
hydrogen.  Then  the  star  will  expand  into  a  red  gi¬ 
ant,  and  its  luminosity  will  increase  by  a  factor 
of  100.  How  long  is  the  red  giant  stage,  if  we  as¬ 
sume  that  the  energy  is  produced  only  by  burning 
the  remaining  hydrogen? 
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The  Sun  is  our  nearest  star.  It  is  important  for 

Surface 

X 

— 

0.71 

astronomy  because  many  phenomena  which  can 

chemical 

composition 

Y 

— 

0.27 

only  be  studied  indirectly  in  other  stars  can  be  di- 

Z 

— 

0.02 

rectly  observed  in  the  Sun  (e.g.  stellar  rotation, 
starspots,  the  structure  of  the  stellar  surface).  Our 
present  picture  of  the  Sun  is  based  both  on  ob¬ 
servations  and  on  theoretical  calculations.  Some 
observations  of  the  Sun  disagree  with  the  theoret¬ 
ical  solar  models.  The  details  of  the  models  will 
have  to  be  changed,  but  the  general  picture  should 
remain  valid. 

1 3.1  Internal  Structure 

The  Sun  is  a  typical  main  sequence  star.  Its  prin¬ 
cipal  properties  are: 


Rotational  period 


Mass 

o 

§ 

II 

S 

— 

1.989  x  1030  kg 

Radius 

o 

II 

— 

6.960  x  10s  m 

Mean  density 

P 

— 

1409  kg/m3 

Central  density 

Pc 

— 

1.6  x  105  kg/m3 

Luminosity 

O 

II 

*4 

— 

3.9  x  1026  W 

Effective 

temperature 

Te 

— 

5785  K 

Central  temperature 

Tc 

— 

1.5  x  107  K 

Absolute 

bolometric 

magnitude 

M\)0\ 

4.72 

Absolute  visual 
magnitude 

My 

— 

4.79 

Spectral  class 

G2V 

Colour  indices 

B-V 

— 

0.62 

U  -B 

— 

0.10 

at  the  equator 

25  d 

at  latitude  60° 

29  d 

Fig.  13.1  The  distribution  of  temperature,  pressure,  en¬ 
ergy  production  and  mass  as  functions  of  radius  in  the  Sun 


On  the  basis  of  these  data,  the  solar  model 
shown  in  Fig.  13.1  has  been  calculated.  The  en¬ 
ergy  is  produced  by  the  pp  chain  in  a  small  cen¬ 
tral  region.  99  %  of  the  solar  energy  is  produced 
within  a  quarter  of  the  solar  radius. 

The  Sun  produces  energy  at  the  rate  of  4  x 
1026  W,  which  is  equivalent  to  changing  about 
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Fig.  13.2  The  interior  and 
surface  of  the  Sun.  The 
various  kinds  of  solar 
phenomena  are 
schematically  indicated. 
(Based  on  Van  Zandt,  R.P. 
(1977):  Astronomy  for  the 
Amateur,  Planetary 
Astronomy,  Vol.  1,  3rd  edn. 
(published  by  the  author, 
Peoria,  III.)) 


four  million  tonnes  of  mass  into  energy  every 
second.  The  mass  of  the  Sun  is  so  large,  about 
330,000  times  that  of  the  Earth,  that  during  the 
whole  main  sequence  lifetime  of  the  Sun  less  than 
0. 1  %  of  its  mass  is  turned  into  energy. 

When  the  Sun  formed  about  5000  million 
years  ago,  its  composition  was  the  same  every¬ 
where  as  its  present  surface  composition.  Since 
energy  production  is  concentrated  at  the  very  cen¬ 
tre,  hydrogen  is  consumed  most  rapidly  there.  At 
about  a  quarter  of  the  radius  the  hydrogen  abun¬ 
dance  is  still  the  same  as  in  the  surface  layers, 
but  going  inwards  from  that  point  it  rapidly  de¬ 
creases.  In  the  central  core  only  40  %  of  the  ma¬ 
terial  is  hydrogen.  About  5  %  of  the  hydrogen  in 
the  Sun  has  been  turned  into  helium. 

The  radiative  central  part  of  the  Sun  extends 
to  about  70  %  of  the  radius.  At  that  radius  the 
temperature  has  dropped  so  much  that  the  gas 
is  no  longer  completely  ionised.  The  opacity  of 
the  solar  material  then  strongly  increases,  inhibit¬ 


ing  the  propagation  of  radiation.  In  consequence, 
convection  becomes  a  more  efficient  means  of  en¬ 
ergy  transport.  Thus  the  Sun  has  a  convective  en¬ 
velope  (Fig.  13.2). 

The  Solar  Neutrino  Problem  The  central  nu¬ 
clear  reactions  produce  neutrinos  at  several  of  the 
steps  in  the  pp  chain  (see  Fig.  1 1.5).  These  neutri¬ 
nos  can  propagate  freely  through  the  outer  layers, 
and  thus  give  direct  information  about  conditions 
near  the  centre  of  the  Sun.  When  neutrinos  from 
the  Sun  were  first  observed  in  the  1970’s,  their 
number  was  found  to  be  only  about  a  third  of 
what  was  predicted.  This  disagreement  is  called 
the  solar  neutrino  problem. 

In  the  first  experiments  only  neutrinos  from 
the  ppll  and  ppIII  branches  were  observed 
(Sect.  11.3).  Since  only  a  small  fraction  of  the 
solar  luminosity  is  produced  in  these  reactions, 
it  was  not  clear  what  were  the  consequences 
of  these  results  for  solar  models.  In  the  1990’s 
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neutrinos  produced  in  the  ppl  branch,  the  main 
branch  of  the  pp  chain,  were  observed.  Although 
the  disagreement  with  the  standard  models  was 
slightly  smaller  in  these  observations  (about  60  % 
of  the  predicted  flux  was  observed),  the  neutrino 
problem  still  remained. 

Perhaps  the  most  popular  explanation  for  the 
solar  neutrino  problem  was  based  on  neutrino 
oscillations.  According  to  this  explanation,  if 
neutrinos  have  a  small  mass  (about  10-2  eV), 
an  electron  neutrino  could  change  into  a  /x  or 
a  r  neutrino  as  it  passed  through  the  outer  parts 
of  the  Sun.  In  the  early  experiments  only  electron 
neutrinos  were  observed,  representing  only  part 
of  the  total  number  of  neutrinos  produced. 

In  2001  the  Canadian  Sudbury  neutrino  obser¬ 
vatory  (SNO)  announced  results  that  seemed  to 
solve  the  problem.  The  SNO  can  detect  the  flux  of 
all  different  neutrinos  and  the  fraction  of  the  elec¬ 
tron  neutrinos.  The  measurements  showed  that 
the  total  neutrino  flux  was  indeed  consistent  with 
the  predictions  of  the  solar  models,  but  only  35  % 
of  the  flux  consisted  of  electron  neutrinos.  Thus 
65  %  of  the  solar  electron  neutrinos  had  changed 
to  /x  or  r  neutrinos  while  travelling  from  the  Sun 
to  the  Earth.  Similar  observations  were  also  made 
in  a  Japanese  neutrino  observatory.  Among  the 
pioneers  of  the  neutrino  astronomy  are  Raymond 
Davis  and  Masatoshi  Koshiba  who  shared  the  No¬ 
bel  prize  in  physics  in  2002. 

The  solar  neutrino  problem  can  now  be  con¬ 
sidered  to  be  solved.  The  solution  is  a  great  suc¬ 
cess  for  the  standard  solar  model.  But  it  has  also 
revealed  the  existence  of  neutrino  oscillations, 
proving  that  neutrinos  have  a  small  but  non-zero 
rest  mass.  This  shows  that  the  standard  model  of 
particle  physics  needs  to  be  revised  in  some  re¬ 
spects. 

The  Solar  Rotation  As  soon  as  telescopes 
were  introduced,  it  was  observed  from  the  mo¬ 
tions  of  sunspots  that  the  Sun  is  rotating  with 
a  rotational  period  of  about  27  days.  As  early  as 
1630  Christoph  Scheiner  showed  that  there  was 
differential  rotation :  the  rotational  period  near  the 
poles  was  more  than  30  days,  while  it  was  only 
25  days  at  the  equator.  The  rotational  axis  of  the 


Sun  is  inclined  at  7°  with  respect  to  the  plane  of 
the  ecliptic,  so  that  the  North  Pole  of  the  Sun  is 
best  visible  from  the  Earth  in  September. 

The  motions  of  sunspots  still  give  the  best  in¬ 
formation  on  the  rotation  near  the  surface  of  the 
Sun.  Other  surface  features  also  have  been  used 
for  this  purpose.  The  rotational  velocity  has  also 
been  measured  directly  from  the  Doppler  effect. 
The  angular  velocity  is  usually  written 

£2  =  A-  Bsin2ir,  (13.1) 

where  if  is  the  latitude  with  respect  to  the  equa¬ 
tor.  The  measured  values  of  the  coefficients  are 
A  =  14.5  and  B  =  2.9  degrees/day. 

The  rotational  velocity  deeper  down  in  the 
Sun  cannot  be  directly  observed.  In  the  1980’s 
a  method  to  estimate  the  rotation  in  the  interior 
became  available,  when  it  became  possible  to 
measure  the  frequencies  of  solar  oscillations  from 
the  variations  in  spectral  lines.  These  oscillations 
are  essentially  sound  waves  produced  by  turbu¬ 
lent  gas  motions  in  the  convection  zone.  These 
sound  waves  have  calculable  oscillation  periods 
(about  3-12  minutes),  which  depend  on  the  con¬ 
ditions  in  the  solar  interior.  By  comparing  the  ob¬ 
served  and  theoretical  values  one  can  get  infor¬ 
mation  about  the  conditions  deep  inside  the  Sun. 
The  idea  of  the  method  is  the  same  as  that  used 
when  studying  the  interior  of  the  Earth  by  means 
of  waves  from  earthquakes,  and  it  is  therefore 
called  helioseismology. 

Using  helioseismology,  models  for  the  so¬ 
lar  rotation  throughout  the  convection  zone  have 
been  deduced.  It  appears  that  the  angular  velocity 
in  the  whole  convection  zone  is  almost  the  same 
as  at  the  surface,  although  it  decreases  slightly 
with  radius  near  the  equator,  and  increases  near 
the  poles.  The  angular  velocity  of  the  radiative 
core  is  still  uncertain,  but  there  are  indications 
that  the  core  is  rotating  as  a  solid  body  with  ap¬ 
proximately  the  average  surface  angular  velocity. 
At  the  bottom  of  the  convection  zone  there  is  a 
thin  layer  known  as  the  tachocline ,  where  the  an¬ 
gular  velocity  changes  rapidly  with  radius.  The 
internal  solar  rotation  according  to  the  helioseis- 
mological  studies  is  shown  in  Fig.  13.3. 
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The  solar  differential  rotation  is  maintained  by 
gas  motions  in  the  convection  zone.  Explaining 
the  observed  behaviour  is  a  difficult  problem  that 
is  not  yet  completely  understood. 


1 3.2  The  Atmosphere 

The  solar  atmosphere  is  divided  into  the  photo¬ 
sphere  and  the  chromosphere.  Outside  the  actual 
atmosphere,  the  corona  extends  much  further  out¬ 
wards. 

The  Photosphere  The  innermost  layer  of  the 
atmosphere  is  the  photosphere,  which  is  only 
about  300-500  km  thick.  The  photosphere  is 
the  visible  surface  of  the  Sun,  where  the  den¬ 
sity  rapidly  increases  inwards,  hiding  the  interior 
from  sight.  The  temperature  at  the  inner  bound¬ 
ary  of  the  photosphere  is  8000  K  and  at  the  outer 


Fig.  13.3  The  rotation  rate  of  the  Sun  inferred  from  he- 
lioseismological  observations.  The  equator  is  at  the  hori¬ 
zontal  axis  and  the  pole  is  at  the  vertical  axis,  both  axes 
being  labelled  by  fractional  radius.  Some  contours  are  la¬ 
belled  in  nHz,  and,  for  clarity,  selected  contours  are  shown 
as  bold.  (430  nHz  is  about  26.9  days.)  The  dashed  circle  is 
at  the  base  of  the  convection  zone  and  the  tick  marks  at  the 
edge  of  the  outer  circle  are  at  latitudes  15°,  30°,  45°,  60°, 
75° .  The  shaded  area  indicates  the  region  in  the  Sun  where 
no  reliable  inference  can  be  made  with  present  data.  The 
slanted  dotted  lines  are  at  an  angle  of  27°  with  the  rotation 
axis.  (Adapted  from  Schou  et  al.  1998.)  (J.  Christensen- 
Dalsgaard  2007,  astro-ph/06 10942,  Fig.  2) 


boundary  4500  K.  Near  the  edge  of  the  solar 
disk,  the  line  of  sight  enters  the  photosphere  at 
a  very  small  angle  and  never  penetrates  to  large 
depths.  Near  the  edges  one  therefore  only  sees 
light  from  the  cooler,  higher  layers.  For  this  rea¬ 
son,  the  edges  appear  darker;  this  phenomenon 
is  known  as  limb  darkening.  Both  the  continuous 
spectrum  and  the  absorption  lines  are  formed  in 
the  photosphere,  but  the  light  in  the  absorption 
lines  comes  from  higher  layers  and  therefore  the 
lines  appear  dark. 

The  solar  convection  is  visible  on  the  surface 
as  the  granulation  (Fig.  13.4),  an  uneven,  con¬ 
stantly  changing  granular  pattern.  At  the  bright 
centre  of  each  granule,  gas  is  rising  upward,  and 
at  the  darker  granule  boundaries,  it  is  sinking 
down  again.  The  size  of  a  granule  seen  from 
the  Earth  is  typically  1",  corresponding  to  about 
1000  km  on  the  solar  surface.  There  is  also 
a  larger  scale  convection  called  super  granulation 
in  the  photosphere.  The  cells  of  the  supergranu¬ 
lation  may  be  about  V  in  diameter.  The  observed 
velocities  in  the  supergranulation  are  mainly  di¬ 
rected  along  the  solar  surface. 

The  Chromosphere  Outside  the  photosphere 
there  is  a  layer,  perhaps  about  500  km  thick, 
where  the  temperature  increases  from  4500  K  to 
about  6000  K,  the  chromosphere.  Outside  this 
layer,  there  is  a  transition  region  of  a  few  thou¬ 
sand  kilometres,  where  the  chromosphere  gradu¬ 
ally  goes  over  into  the  corona.  In  the  outer  parts 
of  the  transition  region,  the  kinetic  temperature  is 
already  about  106  K. 

Normally  the  chromosphere  is  not  visible,  be¬ 
cause  its  radiation  is  so  much  weaker  than  that 
of  the  photosphere.  However,  during  total  solar 
eclipses,  the  chromosphere  shines  into  view  for 
a  few  seconds  at  both  ends  of  the  total  phase, 
when  the  Moon  hides  the  photosphere  com¬ 
pletely.  The  chromosphere  then  appears  as  a  thin 
reddish  sickle  or  ring. 

During  eclipses  the  chromospheric  spectrum, 
called  the  flash  spectrum ,  can  be  observed 
(Fig.  13.5).  It  is  an  emission  line  spectrum  with 
more  than  3000  identified  lines.  Brightest  among 
these  are  the  lines  of  hydrogen,  helium  and  cer¬ 
tain  metals. 
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Fig.  13.4  The  granulation 
of  the  solar  surface.  The 
granules  are  produced  by 
streaming  gas.  Their 
typical  diameter  is 
1000  km.  The  picture  was 
taken  in  May  13,  2005  with 
the  Swedish  one  metre 
solar  telescope  on  La 
Palma.  (Photograph  Tom 
Berger,  Royal  Swedish 
Academy  of  Sciences, 
ISP/RSAS) 
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Fig.  13.5  Flash  spectrum 
of  the  solar  chromosphere, 
showing  bright  emission 
lines 


One  of  the  strongest  chromospheric  emission 
lines  is  the  hydrogen  B aimer  a  line  (Fig.  13.6) 
at  a  wavelength  of  656.3  nm.  Since  the  Ha  line 
in  the  normal  solar  spectrum  is  a  very  dark  ab¬ 
sorption  line,  a  photograph  taken  at  this  wave¬ 
length  will  show  the  solar  chromosphere.  For 
this  purpose,  one  uses  narrow-band  filters  letting 
through  only  the  light  in  the  Ha  line.  The  re¬ 
sulting  pictures  show  the  solar  surface  as  a  mot¬ 
tled,  wavy  disk.  The  bright  regions  are  usually 
the  size  of  a  supergranule,  and  are  bounded  by 
spicules  (Fig.  13.7).  These  are  flamelike  struc¬ 
tures  rising  up  to  10,000  km  above  the  chromo¬ 
sphere,  and  lasting  for  a  few  minutes.  Against 


the  bright  surface  of  the  Sun,  they  look  like 
dark  streaks;  at  the  edges,  they  look  like  bright 
flames. 

The  Corona  The  chromosphere  gradually  goes 
over  into  the  corona.  The  corona  is  also  best  seen 
during  total  solar  eclipses  (Fig.  13.8).  It  then  ap¬ 
pears  as  a  halo  of  light  extending  out  to  a  few 
solar  radii.  The  surface  brightness  of  the  corona 
is  about  that  of  the  full  moon,  and  it  is  therefore 
difficult  to  see  next  to  the  bright  photosphere. 

The  inner  part  of  the  corona,  the  K  corona, 
has  a  continuous  spectrum  formed  by  the  scatter¬ 
ing  of  the  photospheric  light  by  electrons.  Fur- 
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Fig.  13.6  The  solar 
surface  in  the  hydrogen  Hq, 
line.  Active  regions  appear 
bright;  the  dark  filaments 
are  prominences.  Limb 
darkening  has  been 
removed  artificially,  which 
brings  to  light  spicules  and 
prominences  above  the 
limb.  The  photograph  was 
taken  in  October  1997. 
(Photograph  Big  Bear 
Solar  Observatory/NJIT) 


Fig.  13.7  Spicules, 
flamelike  uprisings  near  the 
edge  of  the  solar  disc. 
(Photograph  Big  Bear 
Solar  Observatory) 


ther  out,  a  few  solar  radii  from  the  surface,  is  the 
F  corona,  which  has  a  spectrum  showing  Fraun¬ 
hofer  absorption  lines.  The  light  of  the  F  corona 
is  sunlight  scattered  by  dust. 

In  the  latter  part  of  the  19th  century  strong 
emission  lines,  which  did  not  correspond  to  those 
of  any  known  element,  were  discovered  in  the 


corona  (Fig.  13.9).  It  was  thought  that  a  new  ele¬ 
ment,  called  coronium,  had  been  found — a  little 
earlier,  helium  had  been  discovered  in  the  Sun 
before  it  was  known  on  Earth.  About  1940,  it 
was  established  that  the  coronal  lines  were  due  to 
highly  ionised  atoms,  e.g.  thirteen  times  ionised 
iron.  Much  energy  is  needed  to  remove  so  many 
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Fig.  13.8  Previously,  the 
corona  could  be  studied 
only  during  total  solar 
eclipses.  The  picture  is 
from  the  eclipse  on 
March  7,  1970.  Nowadays 
the  corona  can  be  studied 
continuously  using 
a  device  called  the 
coronagraph 


Fig.  13.9  The  presence  of 
lines  from  highly  ionised 
atoms  in  the  coronal 
spectrum  shows  that  the 
temperature  of  the  corona 
has  to  be  very  high 
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electrons  from  the  atoms.  The  entire  corona  has 
to  have  a  temperature  of  about  a  million  de¬ 
grees. 

A  continuous  supply  of  energy  is  needed  in 
order  to  maintain  the  high  temperature  of  the 
corona.  According  to  earlier  theories,  the  energy 
came  in  the  form  of  acoustic  or  magnetohydrody¬ 
namic  shock  waves  generated  at  the  solar  surface 
by  the  convection.  Most  recently,  heating  by  elec¬ 


tric  currents  induced  by  changing  magnetic  fields 
has  been  suggested.  Heat  would  then  be  gener¬ 
ated  in  the  corona  almost  like  in  an  ordinary  light 
bulb. 

In  spite  of  its  high  temperature  the  coronal  gas 
is  so  diffuse  that  the  total  energy  stored  in  it  is 
small.  It  is  constantly  streaming  outwards,  gradu¬ 
ally  becoming  a  solar  wind ,  which  carries  a  flux 
of  particles  away  from  the  Sun.  The  gas  lost  in 


290 


13  The  Sun 


Fig.  13.10  The  sunspots 
are  the  form  of  solar 
activity  that  has  been 
known  for  the  longest  time. 
The  photograph  was  taken 
with  the  Swedish  1 -meter 
Solar  Telescope  in  July 
2002.  (Photograph  Royal 
Swedish  Academy  of 
Sciences) 


this  way  is  replaced  with  new  material  from  the 
chromosphere. 


13.3  Solar  Activity 

Sunspots  The  clearest  visible  sign  of  solar  ac¬ 
tivity  are  the  sunspots .  The  existence  of  sunspots 
has  been  known  for  long  (Fig.  13.10),  since  the 
largest  ones  can  be  seen  with  the  naked  eye  by 
looking  at  the  Sun  through  a  suitably  dense  layer 
of  fog.  More  precise  observations  became  avail¬ 
able  beginning  in  the  17th  century,  when  Galilei 
started  to  use  the  telescope  for  astronomical  ob¬ 
servations. 

A  sunspot  looks  like  a  ragged  hole  in  the  solar 
surface.  In  the  interior  of  the  spot  there  is  a  dark 
umbra  and  around  it,  a  less  dark  penumbra.  By 
looking  at  spots  near  the  edge  of  the  solar  disk,  it 
can  be  seen  that  the  spots  are  slightly  depressed 
with  respect  to  the  rest  of  the  surface.  The  surface 
temperature  in  a  sunspot  is  about  1500  K  below 


that  of  its  surroundings,  which  explains  the  dark 
colour  of  the  spots. 

The  diameter  of  a  typical  sunspot  is  about 
10,000  km  and  its  lifetime  is  from  a  few  days  to 
several  months,  depending  on  its  size.  The  larger 
spots  are  more  likely  to  be  long-lived.  Sunspots 
often  occur  in  pairs  or  in  larger  groups.  By  fol¬ 
lowing  the  motions  of  the  spots,  the  period  of  ro¬ 
tation  of  the  Sun  can  be  determined. 

The  variations  in  the  number  of  sunspots 
have  been  followed  for  almost  250  years.  The 
frequency  of  spots  is  described  by  the  Zurich 
sunspot  number  Z : 

Z  =  C(S+  10G),  (13.2) 

where  S  is  the  number  of  spots  and  G  the  num¬ 
ber  of  spot  groups  visible  at  a  particular  time.  C  is 
a  constant  depending  on  the  observer  and  the  con¬ 
ditions  of  observation. 

In  Fig.  13.11,  the  variations  in  the  Zurich 
sunspot  number  between  the  18th  century  and 
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Fig.  13.11  The  Zurich  sunspot  number  from  1700  to  2001.  Prior  to  1700  there  are  only  occasional  observations.  The 
number  of  sunspots  and  spot  groups  varies  with  a  period  of  about  1 1  years 


Fig.  1 3.1 2  Left :  During  a  sunspot  maximum  (Sept  27,  2001)  the  Sun  is  dotted  by  numerous  spots.  Right.  Exactly  seven 
years  later  no  spots  are  seen,  although  minimum  phase  should  already  be  over.  (Photos  SOHO/MIDI) 


the  present  are  shown.  Evidently  the  number  of 
spots  varies  with  an  average  period  of  about 
11  years.  The  actual  period  may  be  between  7 
and  17  years.  In  the  past  decades,  it  has  been 
about  10.5  years.  Usually  the  activity  rises  to  its 
maximum  in  about  3-4  years,  and  then  falls  off 
slightly  more  slowly  (Fig.  13.12).  The  period  was 
first  noted  by  Samuel  Heinrich  Schwabe  in  1843. 

The  variations  in  the  number  of  sunspots  have 
been  fairly  regular  since  the  beginning  of  the 
18th  century.  However,  in  the  17th  century  there 
were  long  intervals  when  there  were  essentially 
no  spots  at  all.  This  quiescent  period  is  called  the 
Maunder  minimum.  The  similar  Sporer  minimum 
occurred  in  the  15th  century,  and  other  quiet  in¬ 
tervals  have  been  inferred  at  earlier  epochs.  The 
mechanism  behind  these  irregular  variations  in 
solar  activity  is  not  yet  understood. 


The  magnetic  fields  in  sunspots  are  measured 
on  the  basis  of  the  Zeeman  effect,  and  may  be 
as  large  as  0.45  tesla.  (The  magnetic  field  of 
the  Earth  at  the  equator  is  0.03  mT.)  The  strong 
magnetic  field  inhibits  convective  energy  trans¬ 
port,  which  explains  the  lower  temperature  of  the 
spots. 

Sunspots  often  occur  in  pairs  where  the  com¬ 
ponents  have  opposite  polarity.  The  structure  of 
such  bipolar  groups  can  be  understood  if  the  field 
rises  into  a  loop  above  the  solar  surface,  connect¬ 
ing  the  components  of  the  pair.  If  gas  is  streaming 
along  such  a  loop,  it  becomes  visible  as  a  loop 
prominence  (Fig.  13.13). 

The  periodic  variation  in  the  number  of  sun¬ 
spots  reflects  a  variation  in  the  general  solar  mag¬ 
netic  field.  At  the  beginning  of  a  new  activity  cy¬ 
cle  spots  first  begin  to  appear  at  latitudes  of  about 
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Fig.  13.13  In  pairs  of 
sunspots  the  magnetic  field 
lines  form  a  loop  outside 
the  solar  surface.  Material 
streaming  along  the  held 
lines  may  form  loop 
prominences.  Loops  of 
different  size  can  be  seen 
in  this  image,  which  the 
Trace  satellite  took  in 
1999.  (Photo  Trace) 


Fig.  13.14  At  the 

beginning  of  an  activity 
cycle,  sunspots  appear  at 
high  latitudes.  As  the  cycle 
advances  the  spots  move 
towards  the  equator. 
(Diagram  by  H.  Virtanen, 
based  on  Greenwich 
Observatory  observations) 


±40°.  As  the  cycle  advances,  the  spots  move 
closer  to  the  equator.  The  characteristic  pattern 
in  which  spots  appear,  shown  in  Fig.  13.14,  is 
known  as  the  butterfly  diagram.  Spots  of  the  next 
cycle  begin  to  appear  while  those  of  the  old  one 
are  still  present  near  the  equator.  Spots  belonging 
to  the  new  cycle  have  a  polarity  opposite  to  that 
of  the  old  ones.  (Spots  in  opposite  hemispheres 
also  have  opposite  polarity.)  Since  the  field  is 
thus  reversed  between  consecutive  1 1  year  cycles 
the  complete  period  of  solar  magnetic  activity  is 
22  years. 

The  following  general  qualitative  description 
of  the  mechanism  of  the  solar  cycle  was  pro¬ 
posed  by  Horace  W.  Babcock.  Starting  at  a  so¬ 
lar  minimum,  the  field  will  be  of  a  generally 
dipolar  character.  Because  a  conducting  medium, 
such  as  the  outer  layers  of  the  Sun,  cannot  move 
across  the  field  lines,  these  will  be  frozen  into  the 


plasma  and  carried  along  by  it.  Thus  the  differ¬ 
ential  rotation  will  draw  the  field  into  a  tight  spi¬ 
ral  (Fig.  13.15).  In  the  process  the  field  becomes 
stronger,  and  this  amplification  will  be  a  function 
of  latitude. 

When  the  subsurface  field  becomes  strong 
enough,  it  gives  rise  to  a  “magnetic  buoyancy” 
that  lifts  ropes  of  magnetic  flux  above  the  sur¬ 
face.  This  happens  first  at  a  latitude  about  40°, 
and  later  at  lower  latitudes.  These  protruding  flux 
ropes  expand  into  loops  forming  bipolar  groups 
of  spots.  As  the  loops  continue  expanding  they 
make  contact  with  the  general  dipolar  field,  which 
still  remains  in  the  polar  regions.  This  leads  to 
a  rapid  reconnection  of  the  field  lines  neutralis¬ 
ing  the  general  field.  The  final  result  when  ac¬ 
tivity  subsides  is  a  dipolar  field  with  a  polarity 
opposite  the  initial  one. 
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Fig.  13.15  Because  the  Sun  rotates  faster  at  the  equator  than  at  the  poles,  the  held  lines  of  the  solar  magnetic  held  are 
drawn  out  into  a  tight  spiral 


Fig.  13.16  A  quiet  Sun  in  August  2006  around  the 
last  sunspot  minimum.  Both  pictures  were  taken  by  the 
Michelson  Doppler  Imager  on  the  SOHO  satellite.  On  the 

Thus  the  Babcock  model  accounts  for  the  but¬ 
terfly  diagram,  the  formation  of  bipolar  magnetic 
regions  and  the  general  field  reversal  between  ac¬ 
tivity  maxima.  Nevertheless,  it  remains  an  essen¬ 
tially  phenomenological  model,  and  alternative 
scenarios  have  been  proposed.  In  dynamo  the¬ 
ory  quantitative  models  for  the  origin  of  mag¬ 
netic  fields  in  the  Sun  and  other  celestial  bodies 
are  studied.  In  these  models  the  field  is  produced 
by  convection  and  differential  rotation  of  the  gas. 
A  completely  satisfactory  dynamo  model  for  the 
solar  magnetic  cycle  has  not  yet  been  found.  For 
example,  it  is  not  yet  known  whether  the  field  is 
produced  everywhere  in  the  convection  zone,  or 
just  in  the  boundary  layer  between  the  convective 


left  the  Sun  in  visible  light,  on  the  right  a  magnetogram, 
which  shows  the  opposite  polarities  of  the  magnetic  fields 
as  black  and  white.  (Photo  SOHO/NASA/ESA) 

and  radiative  regions,  as  some  indications  sug¬ 
gest. 

Other  Activity  The  Sun  shows  several  other 
types  of  surface  activity:  faculae  and  plages ; 
prominences ;  flares. 

The  faculae  and  plages  are  local  bright  regions 
in  the  photosphere  and  chromosphere,  respec¬ 
tively.  Observations  of  the  plages  are  made  in  the 
hydrogen  or  the  calcium  K  lines  (Fig.  13.16). 
The  plages  usually  occur  where  new  sunspots 
are  forming,  and  disappear  when  the  spots  disap¬ 
pear.  Apparently  they  are  caused  by  the  enhanced 
heating  of  the  chromosphere  in  strong  magnetic 
fields. 
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Fig.  13.17  (a)  Quiescent  “hedgerow”  prominence.  (Photograph  Sacramento  Peak  Observatory.)  (b)  Larger  eruptive 
prominence.  (Photograph  Big  Bear  Solar  Observatory) 


The  prominences  are  among  the  most  spec¬ 
tacular  solar  phenomena.  They  are  glowing  gas 
masses  in  the  corona,  easily  observed  near  the 
edge  of  the  Sun.  There  are  several  types  of  promi¬ 
nences  (Fig.  13.17):  the  quiescent  prominences, 
where  the  gas  is  slowly  sinking  along  the  mag¬ 
netic  field  lines;  loop  prominences,  connected 
with  magnetic  field  loops  in  sunspots;  and  the 


rarer  eruptive  prominences,  where  gas  is  violently 
thrown  outwards. 

The  temperature  of  prominences  is  about 
10,000-20,000  K.  In  Ha  photographs  of  the  chro¬ 
mosphere,  the  prominences  appear  as  dark  fila¬ 
ments  against  the  solar  surface  (Fig.  13.6). 

The  flare  outbursts  are  among  the  most  violent 
forms  of  solar  activity  (Fig.  13.18).  They  appear 
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Fig.  13.18  A  violent  flare 
near  some  small  sunspots. 
(Photograph  Sacramento 
Peak  Observatory) 


as  bright  flashes,  lasting  from  one  second  to  just 
under  an  hour.  In  the  flares  a  large  amount  of  en¬ 
ergy  stored  in  the  magnetic  field  is  suddenly  re¬ 
leased.  The  detailed  mechanism  is  not  yet  known. 

Flares  can  be  observed  at  all  wavelengths.  The 
hard  X-ray  emission  of  the  Sun  may  increase  hun¬ 
dredfold  during  a  flare.  Several  different  types 
of  flares  are  observed  at  radio  wavelengths.  The 
emission  of  solar  cosmic  ray  particles  also  rises. 

Prominences  and  flares  are  often  accompa¬ 
nied  with  coronal  mass  ejections  Fast  moving 
(500-2000  km/s)  clouds  cause  shock  waves  ac¬ 
celerating  particles  to  very  high  velocities.  Parti¬ 
cles  in  flares  are  the  fastest  (v  ~  0.3c).  Particles  in 
a  magnetic  cloud  travelling  at  the  velocity  of  the 
mass  ejection  reach  the  Earth  in  a  couple  of  days. 
Particles  activated  by  the  shock  wave  arrive  con¬ 
tinuously  affecting  the  space  weather  and  causing 
magnetic  storms. 

The  flares  and  coronal  mass  ejections  give  rise 
to  disturbances  on  the  Earth.  The  X-rays  cause 
changes  in  the  ionosphere,  which  affect  short¬ 
wave  radio  communications.  The  particles  give 
rise  to  strong  auroras  when  they  enter  the  Earth’s 
magnetic  field  a  few  days  after  the  outburst. 

Solar  Radio  Emission  The  Sun  is  the  strongest 
radio  source  in  the  sky  and  has  been  observed 


Fig.  13.19  An  X-ray  picture  of  the  active  Sun,  taken  by 
the  Japanese  Yohkoh  satellite  in  1999,  around  the  last 
maximum  of  sunspot  activity.  (Photo  JAXA) 


since  the  1940’s.  In  contrast  to  optical  emis¬ 
sion  the  radio  picture  of  the  Sun  shows  a  strong 
limb  brightening.  This  is  because  the  radio  ra¬ 
diation  comes  from  the  upper  layers  of  the  at¬ 
mosphere.  Since  the  propagation  of  radio  waves 
is  obstructed  by  free  electrons,  the  high  electron 
density  near  the  surface  prevents  radio  radiation 
from  getting  out.  Shorter  wavelengths  can  propa- 
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Fig.  13.20  The  SOHO 
(Solar  and  Heliospheric 
Observatory)  satellite 
keeps  a  constant  watch  on 
the  Sun  and  its 
surroundings  in  many 
wavelengths.  Here  the 
LASCO  (Large  Angle  and 
Spectrometric 
Coronagraph)  instrument 
sees  a  large  Coronal  Mass 
Ejection  erupting  from  the 
Sun.  The  surface  of  the 
Sun  is  covered  by  a  disk, 
and  the  size  and  position  of 
the  Sun  is  indicated  by  the 
white  circle.  (Photo 
SOHO/NASA/ESA) 
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gate  more  easily,  and  thus  millimetre- wavelength 
observations  give  a  picture  of  deeper  layers  in  the 
atmosphere,  whereas  the  long  wavelengths  show 
the  upper  layers.  (The  10  cm  emission  originates 
in  the  upper  layers  of  the  chromosphere  and  the 
1  m  emission,  in  the  corona.) 

The  Sun  looks  different  at  different  wave¬ 
lengths.  At  long  wavelengths  the  radiation  is 
coming  from  the  largest  area,  and  its  electron 
temperature  is  about  106  K,  since  it  originates 
in  the  corona. 

The  radio  emission  of  the  Sun  is  constantly 
changing  according  to  solar  activity.  During  large 
storms  the  total  emission  may  be  100,000  times 
higher  than  normal.  Especially  the  motion  of 
shock  waves  can  be  followed  by  the  radio  emis¬ 
sion,  since  the  electrons  accelerated  by  the  shock 
generate  radio  emission  (type  II  radio  bursts). 

X-ray  and  UV  Radiation  The  X-ray  emis¬ 
sion  of  the  Sun  is  also  related  to  active  regions 
(Fig.  13.19).  Signs  of  activity  are  bright  X-ray  re¬ 
gions  and  smaller  X-ray  bright  points ,  which  last 
for  around  ten  hours.  The  inner  solar  corona  also 


emits  X-rays.  Near  the  solar  poles  there  are  coro¬ 
nal  holes ,  where  the  X-ray  emission  is  weak. 

Ultraviolet  pictures  of  the  solar  surface  show  it 
as  much  more  irregular  than  it  appears  in  visible 
light.  Most  of  the  surface  does  not  emit  much  UV 
radiation,  but  there  are  large  active  regions  that 
are  very  bright  in  the  ultraviolet. 

Several  satellites  have  made  observations  of 
the  Sun  at  UV  and  X-ray  wavelengths,  for  ex¬ 
ample  Soho  (Solar  and  Heliospheric  Observa¬ 
tory,  1995-,  Fig.  13.20).  These  observations  have 
made  possible  detailed  studies  of  the  outer  lay¬ 
ers  of  the  Sun.  Observations  of  other  stars  have 
revealed  coronae,  chromospheres  and  magnetic 
variations  similar  to  those  in  the  Sun.  Thus  the 
new  observational  techniques  have  brought  the 
physics  of  the  Sun  and  the  stars  nearer  to  each 
other. 


1 3.4  Solar  Wind  and  Space  Weather 

A  continuous  stream  of  charged  particles  is  com¬ 
ing  from  the  Sun  as  the  solar  wind,  varying 
with  the  solar  activity.  The  coronal  mass  ejec- 
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tions  are  considered  as  “disturbances”.  Around  a 
minimum  mass  ejections  may  occur  a  few  times 
a  week;  around  a  maximum  there  may  be  sev¬ 
eral  ejections  every  day.  The  solar  wind  consists 
mainly  of  electrons  and  protons.  At  the  distance 
of  the  Earth  there  are  typically  5-10  particles  per 
cubic  centimetre.  There  are  also  some  nuclei  of 
helium  atoms.  The  velocity  is  the  solar  wind  close 
to  the  poles  of  the  Sun  is  about  800  km/s  but 
around  the  equator  only  about  300  km/s.  At  the 
distance  of  the  Earth  the  velocity  of  the  particles 
is  about  500  km/s  on  the  average.  The  Sun  loses 
2  —  3  x  10 ~14Mq  of  its  mass  every  year  as  solar 
wind. 

Magnetic  fields  of  planets  direct  the  motions 
of  the  particles  of  the  solar  wind  (Sect.  7.7).  The 
auroras  are  the  most  outstanding  phenomenon  on 
the  Earth.  Currents  induced  by  the  charged  parti¬ 
cles  can  also  have  considerable  negative  effects. 
They  can  damage  satellites  and  even  electric  net¬ 
works.  The  most  serious  accident  this  far  hap¬ 
pened  in  1989  in  Quebec  when  a  high  voltage  net¬ 
work  was  damaged  and  millions  of  people  had  to 
survive  without  electricity  for  several  hours. 

Space  weather  mean  the  interaction  of  the  so¬ 
lar  wind  and  the  magnetic  environment  of  the 
Earth.  Because  of  its  effects  it  is  now  followed 
actively.  The  space  weather  affects  the  upper  at¬ 
mosphere  but  it  is  still  debated  whether  it  is  con¬ 
nected  to  variations  of  the  weather  and  climate. 


13.5  Example 

Example  13.1  Assume  that  the  Sun  converts 
0.8  %  of  its  mass  into  energy.  Find  an  upper  limit 


for  the  age  of  the  Sun,  assuming  that  its  luminos¬ 
ity  has  remained  constant. 

The  total  amount  of  energy  released  is 

E  =  me2  =  0.008  Mqc2 
=  0.008  x  2  x  1030  kg  x  (3  x  108  ms-1)2 
=  1.4  x  1045  J. 

The  time  needed  to  radiate  this  energy  is 

^  _  E  _  1.4  x  1045  J 
1  ~  ~  3.9  x  1026  W 

=  3.6  x  1018  s  ~  10u  years. 

1 3.6  Exercises 

Exercise  13.1  The  solar  constant,  i.e.  the  flux 
density  of  the  solar  radiation  at  the  distance  of 
the  Earth  is  1370  W  m-2. 

(a)  Find  the  flux  density  on  the  surface  of  the 
Sun,  when  the  apparent  diameter  of  the  Sun 
is  32'. 

(b)  How  many  square  metres  of  solar  surface  is 
needed  to  produce  1000  megawatts? 

Exercise  13.2  Some  theories  have  assumed  that 
the  effective  temperature  of  the  Sun  4.5  billion 
years  ago  was  5000  K  and  radius  1.02  times 
the  current  radius.  What  was  the  solar  constant 
then?  Assume  that  the  orbit  of  the  Earth  has  not 
changed. 
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Stars  with  changing  magnitudes  are  called  vari¬ 
ables  (Fig.  14.1).  Variations  in  the  brightness  of 
stars  were  first  noted  in  Europe  at  the  end  of  the 
16th  century,  when  Tycho  Brahe’s  supernova  lit 
up  (1572)  and  the  regular  light  variation  of  the 
star  o  Ceti  (Mira)  was  observed  (1596).  The  num¬ 
ber  of  known  variables  has  grown  steadily  as  ob¬ 
servational  precision  has  improved  (Fig.  14.2). 
The  most  recent  catalogues  contain  about  40,000 
stars  known  or  suspected  to  be  variable. 

Strictly  speaking,  all  stars  are  variable.  As 
was  seen  in  Chap.  12,  the  structure  and  bright¬ 
ness  of  a  star  change  as  it  evolves.  Although 
these  changes  are  usually  slow,  some  evolution¬ 
ary  phases  may  be  extremely  rapid.  In  certain 
evolutionary  stages,  there  will  also  be  periodic 
variations,  for  example  pulsations  of  the  outer 
layers  of  a  star. 

Small  variations  in  stellar  brightness  are  also 
caused  by  hot  and  cool  spots  on  a  star’s  surface, 
appearing  and  disappearing  as  it  rotates  about  its 
axis.  The  luminosity  of  the  Sun  changes  slightly 
because  of  the  sunspots.  Probably  there  are  simi¬ 
lar  spots  on  almost  all  stars. 

Initially  stellar  brightnesses  were  determined 
visually  by  comparing  stars  near  each  other.  Later 
on,  comparisons  were  made  on  photographic 
plates.  At  present  the  most  accurate  observations 
are  made  photoelectrically  or  using  a  CCD  cam¬ 
era.  The  magnitude  variation  as  a  function  of  time 
is  called  the  lightcurve  of  a  star  (Fig.  14.3).  From 
it  one  obtains  the  amplitude  of  the  magnitude 
variation  and  its  period ,  if  the  variation  is  peri¬ 
odic. 


The  basic  reference  catalogue  of  variable  stars 
is  the  General  Catalogue  of  Variable  Stars  by  the 
Soviet  astronomer  Boris  Vasilyevich  Kukarkin. 
New,  supplemented  editions  appear  at  times;  the 
fourth  edition  published  in  1985-1987,  edited  by 
P.N.  Kholopov,  contains  about  32,000  variables 
of  the  Milky  Way  galaxy. 


14.1  Classification 

When  a  new  variable  is  discovered,  it  is  given 
a  name  according  to  the  constellation  in  which 
it  is  located.  The  name  of  the  first  variable  in 
a  given  constellation  is  R,  followed  by  the  name 
of  the  constellation  (in  the  genitive  case).  The 
symbol  for  the  second  variable  is  S,  and  so  on, 
to  Z.  After  these,  the  two-letter  symbols  RR, 
RS, . . .  to  ZZ  are  used,  and  then  A  A  to  QZ  (omit¬ 
ting  I).  This  is  only  enough  for  334  variables, 
a  number  that  has  long  been  exceeded  in  most 
constellations.  The  numbering  therefore  contin¬ 
ues:  V335,  V336,  etc.  (V  stands  for  variable). 
For  some  stars  the  established  Greek  letter  sym¬ 
bol  has  been  kept,  although  they  have  later  been 
found  to  be  variable  (e.g.  8  Cephei). 

The  classification  of  variables  is  based  on  the 
shape  of  the  lightcurve,  and  on  the  spectral  class 
and  observed  radial  motions.  The  spectrum  may 
also  contain  dark  absorption  lines  from  material 
around  the  star.  Observations  can  be  made  out¬ 
side  the  optical  region  as  well.  Thus  the  radio 
emission  of  some  variables  (e.g.  flare  stars)  in¬ 
creases  strongly,  simultaneously  with  their  opti¬ 
cal  brightness.  Examples  of  radio  and  X-ray  vari- 
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Fig.  14.1  The  variables 
are  stars  changing  in 
brightness.  Two  variables 
in  Scorpius,  R  and  S  Sco. 
(Photograph  Yerkes 
Observatory) 


Fig.  14.2  The  location  of 
variables  in  the  HR 
diagram 


-6  V 


-4 


-2  V 


0  f 


+2  - 


+4 


+6  - 


+8 


+10  - 


/ 3  Cephei 
stars 


Dwarf 

novae 


Classical 

cepheids^^  RVTauri  stars 

Jwm  I 


RR  Lyrae 
stars 


W  Virginis 
stars  Long 

period 

variables 


Magnetic  variables 


T  Tauri  stars 


Dwarf 
cepheids  Sun 


Semi¬ 

regular 

variables 


Flare 

stars 


BO 


AO 


F0 


GO 


K0 


MO 


ables  are  the  radio  and  X-ray  pulsars,  and  the  Variables  are  usually  divided  into  three  main 
X-ray  bursters.  types:  pulsating ,  eruptive  and  eclipsing  variables. 
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Fig.  14.3  The  variation  of  brightness,  colour  and  size  of 
a  cepheid  during  its  pulsation 


Table  14.1  The  main  properties  of  pulsating  variables 
(N,  number  of  stars  of  the  given  type  in  Kukarkin’s  cat¬ 
alogue,  P,  pulsation  period  in  days,  Am,  pulsation  ampli¬ 
tude  in  magnitudes) 


Variable 

N 

P 

Spectrum 

Am 

Classical  cepheids 
(S  Cep,  W  Vir) 

800 

1-135 

F-K I 

<2 

rsj 

RR  Lyrae 

6100 

<1 

A-F8 

<2 

Dwarf  cepheids 
(5  Scuti) 

200 

0.05-7 

A-F 

<1 

P  Cephei 

90 

0. 1-0.6 

B1-B3  III 

>0.3 

Mira  variables 

5800 

80-1000 

M-C 

>2.5 

RV  Tauri 

120 

30-150 

G-M 

<4 

Semiregular 

3400 

30-1000 

K-C 

<4.5 

Irregular 

2300 

— 

K-M 

<2 

variables  range  from  about  1  day  to  25  d,  and  the 
amplitudes  are  less  than  0.1  mag. 
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The  eclipsing  variables  are  binary  systems  in 
which  the  components  periodically  pass  in  front 
of  each  other.  In  these  variables  the  light  varia¬ 
tions  do  not  correspond  to  any  physical  change 
in  the  stars.  They  have  been  treated  in  connection 
with  the  binary  stars.  In  the  other  variables  the 
brightness  variations  are  intrinsic  to  the  stars.  In 
the  pulsating  variables  the  variations  are  due  to 
the  expansion  and  contraction  of  the  outer  lay¬ 
ers.  These  variables  are  giants  and  supergiants 
that  have  reached  an  unstable  stage  in  their  evolu¬ 
tion.  The  eruptive  variables  are  usually  faint  stars 
ejecting  mass.  They  are  mostly  members  of  close 
binary  systems  in  which  mass  is  transferred  from 
one  component  to  the  other. 

In  addition  a  few  rotating  variables  are  known, 
where  the  brightness  variations  are  due  to  an 
uneven  temperature  distribution  on  the  surface, 
starspots  coming  into  sight  when  the  star  rotates. 
Such  stars  may  be  quite  common — after  all,  our 
Sun  is  a  weak  rotating  variable.  The  most  promi¬ 
nent  group  of  rotating  variables  are  the  magnetic 
A  stars  (e.g.  the  a2  Canum  Venaticorum  stars). 
These  stars  have  strong  magnetic  fields  that  may 
be  giving  rise  to  starspots.  The  periods  of  rotating 


The  wavelengths  of  the  spectral  lines  of  the  pul¬ 
sating  variables  change  along  with  the  brightness 
variations  (Table  14.1).  These  changes  are  due  to 
the  Doppler  effect,  showing  that  the  outer  layers 
of  the  star  are  indeed  pulsating.  The  observed  gas 
velocities  are  in  the  range  of  40-200  km/s. 

The  period  of  pulsation  corresponds  to  a  pro¬ 
per  frequency  of  the  star.  Just  like  a  tuning  fork 
vibrates  with  a  characteristic  frequency  when  hit, 
a  star  has  a  fundamental  frequency  of  vibration. 
In  addition  to  the  fundamental  frequency  other 
frequencies,  “overtones”,  are  possible.  The  ob¬ 
served  brightness  variation  can  be  understood 
as  a  superposition  of  all  these  modes  of  vibra¬ 
tion.  Around  1920,  the  English  astrophysicist  Sir 
Arthur  Eddington  showed  that  the  period  of  pul¬ 
sation  P  is  inversely  proportional  to  the  square 
root  of  the  mean  density, 


1 

P  a - . 

\/~p 


(14.1) 


The  diameter  of  the  star  may  double  during  the 
pulsation,  but  usually  the  changes  in  size  are  mi¬ 
nor.  The  main  cause  of  the  light  variation  is  the 
periodic  variation  of  the  surface  temperature.  We 
have  seen  in  Sect.  5.7  that  the  luminosity  of  a  star 
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depends  sensitively  on  its  effective  temperature, 
L  (x  Te4.  Thus  a  small  change  in  effective  temper¬ 
ature  leads  to  a  large  brightness  variation. 

Normally  a  star  is  in  stable  hydrostatic  equi¬ 
librium.  If  its  outer  layers  expand,  the  density  and 
temperature  decrease.  The  pressure  then  becomes 
smaller  and  the  force  of  gravity  compresses  the 
gas  again.  However,  unless  energy  can  be  trans¬ 
ferred  to  the  gas  motions,  these  oscillations  will 
be  damped. 

The  flux  of  radiative  energy  from  the  stellar 
interior  could  provide  a  source  of  energy  for  the 
stellar  oscillations,  if  it  were  preferentially  ab¬ 
sorbed  in  regions  of  higher  gas  density.  Usually 
this  is  not  the  case  but  in  the  ionisation  zones , 
where  hydrogen  and  helium  are  partially  ionised, 
the  opacity  in  fact  becomes  larger  when  the  gas  is 
compressed.  If  the  ionisation  zones  are  at  a  suit¬ 
able  depth  in  the  atmosphere,  the  energy  absorbed 
during  compression  and  released  during  expan¬ 
sion  of  an  ionisation  zone  can  drive  an  oscillation. 
Stars  with  surface  temperatures  of  6000-9000  K 
are  liable  to  this  instability.  The  corresponding 
section  of  the  HR  diagram  is  called  the  cepheid 
instability  strip. 

Cepheids  Among  the  most  important  pulsating 
variables  are  the  cepheids,  named  after  8  Cephei 
(Fig.  14.3).  They  are  population  I  supergiants 
(stellar  populations  are  discussed  in  Sect.  18.2) 
of  spectral  class  F-K.  Their  periods  are  1-50 
days  and  their  amplitudes,  0. 1-2.5  magnitudes. 
The  shape  of  the  light  curve  is  regular,  showing 
a  fairly  rapid  brightening,  followed  by  a  slower 
fall  off.  There  is  a  relationship  between  the  period 
of  a  cepheid  and  its  absolute  magnitude  (i.e.  lu¬ 
minosity),  discovered  in  1912  by  Henrietta  Leav¬ 
itt  from  cepheids  in  the  Small  Magellanic  Cloud. 
This  period-luminosity  relation  (Fig.  14.4)  can 
be  used  to  measure  distances  of  stars  and  nearby 
galaxies. 

We  have  already  noted  that  the  pulsation  pe¬ 
riod  is  related  to  the  mean  density.  On  the  other 
hand  the  size  of  a  star,  and  hence  its  mean  density, 
is  related  to  its  total  luminosity.  Thus  one  can  un¬ 
derstand  why  there  should  be  a  relation  between 
the  period  and  the  luminosity  of  a  pulsating  star. 


Fig.  14.4  The  period-luminosity  relation  for  cepheids. 
The  black  points  and  squares  are  theoretically  calculated 
values,  the  crosses  and  the  straight  line  represent  the  ob¬ 
served  relation.  (Drawing  from  Novotny,  E.  (1973):  Intro¬ 
duction  to  Stellar  Atmospheres  and  Interiors  (Oxford  Uni¬ 
versity  Press,  New  York)  p.  359) 

The  magnitudes  M  and  periods  P  of  classical 
cepheids  are  shown  in  Fig.  14.4.  The  relation  be¬ 
tween  M  and  log  P  is  linear.  However,  to  some 
extent,  the  cepheid  luminosities  also  depend  on 
colour:  bluer  stars  are  brighter.  For  accurate  dis¬ 
tance  determinations,  this  effect  needs  to  be  taken 
into  consideration. 

W  Virginis  Stars  In  1952  Walter  Baade  noted 
that  there  are  in  fact  two  types  of  cepheids:  the 
classical  cepheids  and  the  W  Virginis  stars.  Both 
types  obey  a  period-luminosity  relation,  but  the 
W  Vir  stars  of  a  given  period  are  1.5  magnitudes 
fainter  than  the  corresponding  classical  cepheids. 
This  difference  is  due  to  the  fact  that  the  classical 
cepheids  are  young  population  I  objects,  whereas 
the  WVir  stars  are  old  stars  of  population  II. 
Otherwise,  the  two  classes  of  variables  are  sim¬ 
ilar. 

Earlier,  the  WVir  period-luminosity  relation 
had  been  used  for  both  types  of  cepheids.  Con¬ 
sequently  the  calculated  distances  to  classical 
cepheids  were  too  small.  For  example,  the  dis¬ 
tance  to  the  Andromeda  Galaxy  had  been  based 
on  classical  cepheids,  since  only  these  were 
bright  enough  to  be  visible  at  that  distance.  When 
the  correct  period-luminosity  relation  was  used, 
all  extragalactic  distances  had  to  be  doubled.  Dis¬ 
tances  within  the  Milky  Way  did  not  have  to  be 
changed,  since  their  measurements  were  based  on 
other  methods. 
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Fig.  14.5  The  lightcurve  of  a  long  period  Mira  variable 

RR  Lyrae  Stars  The  third  important  class  of 
pulsating  variables  are  the  RR  Lyrae  stars.  Their 
brightness  variations  are  smaller  than  those  of  the 
cepheids,  usually  less  than  a  magnitude.  Their  pe¬ 
riods  are  also  shorter,  less  than  a  day.  Like  the 
WVir  stars,  the  RR  Lyrae  stars  are  old  popula¬ 
tion  II  stars.  They  are  very  common  in  the  glob¬ 
ular  star  clusters  and  were  therefore  previously 
called  cluster  variables. 

The  absolute  magnitudes  of  the  RR  Lyrae  stars 
are  about  My  =  0.6  db  0.3.  They  are  all  of  roughly 
the  same  age  and  mass,  and  thus  represent  the 
same  evolutionary  phase,  where  helium  is  just  be¬ 
ginning  to  bum  in  the  core.  Since  the  absolute 
magnitudes  of  the  RR  Lyrae  variables  are  known, 
they  can  be  used  to  determine  distances  to  the 
globular  clusters. 

Mira  Variables  (See  Fig.  14.5.)  The  Mira  vari¬ 
ables  (named  after  Mira  Ceti)  are  supergiants  of 
spectral  classes  M,  S  or  C,  usually  with  emis¬ 
sion  lines  in  their  spectrum.  They  are  losing  gas 
in  a  steady  stellar  wind.  Their  periods  are  nor¬ 
mally  100-500  days,  and  for  this  reason,  they 
are  also  sometimes  called  long  period  variables. 
The  amplitude  of  the  light  variations  is  typically 
about  6  magnitudes  in  the  visual  region.  The  pe¬ 
riod  of  Mira  itself  is  about  330  days  and  its  di¬ 
ameter  is  about  2  au.  At  its  brightest,  Mira  has 
the  magnitude  2-4,  but  at  light  minimum,  it  may 
be  down  to  12.  The  effective  temperature  of  the 
Mira  variables  is  only  about  2000  K.  Thus  95  % 
of  their  radiation  is  in  the  infrared,  which  means 
that  a  very  small  change  in  temperature  can  cause 
a  very  large  change  in  visual  brightness. 

Other  Pulsating  Variables  One  additional  lar¬ 
ge  group  of  pulsating  stars  are  the  semiregular 


and  irregular  variables.  They  are  supergiants,  of¬ 
ten  very  massive  young  stars  with  unsteady  pul¬ 
sations  in  their  extended  outer  layers.  If  there 
is  some  periodicity  in  the  pulsations,  these  vari¬ 
ables  are  called  semiregular;  otherwise  they  are 
irregular.  An  example  of  a  semiregular  variable 
is  Betelgeuse  ( a  Orionis).  The  pulsation  mecha¬ 
nism  of  these  stars  is  not  well  understood,  since 
their  outer  layers  are  convective,  and  the  theory 
of  stellar  convection  is  still  poorly  developed. 

In  addition  to  the  main  types  of  pulsating  vari¬ 
ables,  there  are  some  smaller  separate  classes, 
shown  in  Fig.  14.2. 

The  dwarf  cepheid  and  the  8  Scuti  stars,  which 
are  sometimes  counted  as  a  separate  type,  are  lo¬ 
cated  below  the  RR  Lyrae  stars  in  the  cepheid 
instability  strip  in  the  HR  diagram.  The  dwarf 
cepheids  are  fainter  and  more  rapidly  varying 
than  the  classical  cepheids.  Their  light  curves  of¬ 
ten  show  a  beating  due  to  interference  between 
the  fundamental  frequency  and  the  first  overtone. 

The  p  Cephei  stars  are  located  in  a  different 
part  of  the  HR  diagram  than  the  other  variables. 
They  are  hot  massive  stars,  radiating  mainly  in 
the  ultraviolet.  The  variations  are  rapid  and  of 
small  amplitude.  The  pulsation  mechanism  of  the 
P  Cephei  stars  is  unknown. 

The  RV  Tauri  stars  lie  between  the  cepheids 
and  the  Mira  variables  in  the  HR  diagram.  Their 
period  depends  slightly  on  the  luminosity.  There 
are  some  unexplained  features  in  the  light  curves 
of  the  RV  Tauri  stars,  e.g.  the  minima  are  alter¬ 
nately  deep  and  shallow. 


1 4.3  Eruptive  Variables 

In  the  eruptive  variables  there  are  no  regular 
pulsations.  Instead  sudden  outbursts  occur  in 
which  material  is  ejected  into  space.  Nowadays 
such  stars  are  divided  into  two  main  categories, 
eruptive  and  cataclysmic  variables.  Brightness 
changes  of  eruptive  variables  are  caused  by  sud¬ 
den  eruptions  in  the  chromosphere  or  corona,  the 
contributions  of  which  are,  however,  rather  small 
in  the  stellar  scale.  These  stars  are  usually  sur¬ 
rounded  by  a  gas  shell  or  interstellar  matter  par¬ 
ticipating  in  the  eruption.  This  group  includes  e.g. 
flare  stars ,  various  kinds  of  nebular  variables , 
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Table  14.2  Main  properties  of  eruptive  variables  ( N , 
number  of  stars  of  the  given  type  in  Kukarkin’s  catalogue, 
Am,  change  in  brightness  in  magnitudes.  The  velocity  is 
the  expansion  velocity  in  km/s,  based  on  the  Doppler  shifts 
of  the  spectral  lines) 


Variable 

N 

Am 

Velocity 

Supernovae 

1 

>20 

4000-10,000 

Ordinary  novae 

210 

7-18 

200-3500 

Recurrent  novae 

<10 

600 

Nova-like  stars 
(P  Cygni,  symbiotic) 

80 

<2 

30-100 

Dwarf  novae 
(SS  Cyg  =  U  Gem, 

ZZ  Cam) 

330 

2-6 

(700) 

R  Coronae  Borealis 

40 

1-9 

— 

Irregular 

(nebular  variables, 

T  Tau,  RW  Aur) 

1450 

<4 

(300) 

Flare  stars 
(UV  Ceti) 

750 

<6 

2000 

and  R  Coronae  Borealis  stars.  Eruptions  of  the 
cataclysmic  variables  are  due  to  nuclear  reactions 
on  the  stellar  surface  or  interior.  Explosions  are  so 
violent  that  they  can  even  destroy  the  whole  star. 
This  group  includes  novae  and  nova-like  stars, 
dwarf  novae  and  supernovae  (Table  14.2). 

Flare  Stars  The  flare  or  UV  Ceti  stars  are 
dwarf  stars  of  spectral  class  M.  They  are  young 
stars,  mostly  found  in  young  star  clusters  and  as¬ 
sociations.  At  irregular  intervals  there  are  flare 
outbursts  on  the  surface  of  the  stars  similar  to 
those  on  the  Sun.  The  flares  are  related  to  dis¬ 
turbances  in  the  surface  magnetic  fields.  The  en¬ 
ergy  of  the  outbursts  of  the  flare  stars  is  appar¬ 
ently  about  the  same  as  in  solar  flares,  but  because 
the  stars  are  much  fainter  than  the  Sun,  a  flare 
can  cause  a  brightening  by  up  to  4-5  magnitudes. 
A  flare  lights  up  in  a  few  seconds  and  then  fades 
away  in  a  few  minutes  (Fig.  14.6).  The  same  star 
may  flare  several  times  in  one  day.  The  optical 
flare  is  accompanied  by  a  radio  outburst,  like  in 
the  Sun.  In  fact,  the  flare  stars  were  the  first  stars 
to  be  detected  as  radio  sources. 

Nebular  Variables  In  connection  with  bright 
and  dark  interstellar  clouds  e.g.  in  the  constel¬ 
lations  of  Orion,  Taurus  and  Auriga,  there  are 


Fig.  14.6  The  outbursts  of  typical  flare  stars  are  of  short 
duration 


Fig.  14.7  Light  curve  of  a  T  Tauri  variable 

variable  stars.  The  T  Tauri  stars  are  the  most  in¬ 
teresting  of  them.  These  stars  are  newly  formed 
or  just  contracting  towards  the  main  sequence. 
The  brightness  variations  of  the  T  Tauri  stars 
are  irregular  (Fig.  14.7).  Their  spectra  contain 
bright  emission  lines,  formed  in  the  stellar  chro¬ 
mosphere,  and  forbidden  lines,  which  can  only  be 
formed  at  extremely  low  densities.  The  spectral 
lines  also  show  that  matter  is  streaming  out  from 
the  stars. 

Since  the  T  Tauri  stars  are  situated  inside 
dense  gas  clouds,  they  are  difficult  to  observe. 
However,  this  situation  has  improved  with  the  de¬ 
velopment  of  radio  and  infrared  techniques. 

Stars  in  the  process  of  formation  may  change 
in  brightness  very  rapidly.  For  example,  in  1937, 
FU  Orionis  brightened  by  6  magnitudes.  This  star 
is  a  strong  source  of  infrared  radiation,  which 
shows  that  it  is  still  enveloped  by  large  quantities 
of  interstellar  dust  and  gas.  A  similar  brighten¬ 
ing  by  six  magnitudes  was  observed  in  1969  in 
V1057Cygni  (Fig.  14.8).  Before  its  brightening, 
it  was  an  irregular  T  Tauri  variable;  since  then, 
it  has  remained  a  fairly  constant  tenth-magnitude 
AB  star. 
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Stars  of  the  R  Coronae  Borealis  type  have  “in¬ 
verse  nova”  light  curves.  Their  brightness  may 
drop  by  almost  ten  magnitudes  and  stay  low  for 
years,  before  the  star  brightens  to  its  normal  lu¬ 
minosity.  For  example,  R  CrB  itself  is  of  magni¬ 
tude  5.8,  but  may  fade  to  14.8  magnitudes.  Fig¬ 
ure  14.9  shows  its  recent  decline,  based  on  ob¬ 
servations  by  Finnish  and  French  amateurs.  The 
R  CrB  stars  are  rich  in  carbon  and  the  decline  is 
produced  when  the  carbon  condenses  into  a  cir- 
cumstellar  dust  shell. 

One  very  interesting  variable  is  77  Carinae 
(Fig.  14.10).  At  present  it  is  a  six  magnitude 
star  surrounded  by  a  thick,  extensive  envelope  of 
dust  and  gas.  In  the  early  19th  century  77  Carinae 
was  the  second  brightest  star  in  the  sky  after  Sir¬ 
ius.  Around  the  middle  of  the  century  it  rapidly 
dimmed  to  magnitude  8,  but  during  the  20th  cen¬ 
tury  it  has  brightened  somewhat.  r\  Carinae  is  a 
so  called  bright  blue  variable.  Its  mass  is  esti¬ 
mated  to  be  of  the  order  of  100  solar  masses.  The 
circumstellar  dust  cloud  is  the  brightest  infrared 
source  in  the  sky  outside  the  solar  system.  The  en- 


Fig.  14.8  In  1969-1970,  the  star  V1057  Cygni  bright¬ 
ened  by  almost  6  magnitudes 


ergy  radiated  by  77  Carinae  is  absorbed  by  the  neb¬ 
ula  and  re-radiated  at  infrared  wavelengths.  The 
exact  reason  for  the  enormous  brightness  varia¬ 
tions  is  not  known.  It  is  believed  that  the  stability 
of  the  star  is  disturbed  by  the  radiation  pressure 
of  the  huge  energy  produced  by  a  very  massive 
star.  When  the  nucleus  will  collapse  at  the  end  of 
the  lifespan  of  the  star,  77  Carinae  will  explode  as 
a  supernova. 

Novae  One  of  the  best  known  types  of  erup¬ 
tive  variables  are  the  novae.  They  are  classified 
into  several  subtypes:  ordinary  novae ,  recurrent 
novae  and  nova-like  variables.  The  dwarf  novae 
(Fig.  14.11)  are  nova-like  rather  frequently  erup¬ 
tive  stars;  although  the  lightcurves  are  similar  to 
those  of  novae,  the  mechanism  is  different. 

The  outbursts  of  all  novae  are  rapid.  Within 
a  day  or  two  the  brightness  rises  to  a  maximum, 
which  may  be  7-16  magnitudes  brighter  than  the 
normal  luminosity.  This  is  followed  by  a  gradual 
decline,  which  may  go  on  for  months  or  years. 
The  light  curve  of  a  typical  nova  is  shown  in 
Fig.  14.13.  This  light  curve  of  Nova  Cygni  1975 
has  been  composed  from  hundreds  of  observa¬ 
tions,  mostly  by  amateurs. 

In  recurrent  novae,  the  brightening  is  some¬ 
what  less  than  10  magnitudes  and  in  dwarf  novae, 
2-6  magnitudes.  In  both  types  there  are  repeated 
outbursts.  For  recurrent  novae  the  time  between 
outbursts  is  a  few  decades  and  for  the  dwarf  no¬ 
vae  20-600  days.  The  interval  depends  on  the 
strength  of  the  outburst:  the  stronger  the  outburst, 
the  longer  the  time  until  the  next  one.  The  bright- 
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Fig.  14.10  In  the  19th 

century,  r]  Carinae  was  one 
of  the  brightest  stars  in  the 
sky;  since  then  it  has 
dimmed  considerably.  In 
an  outburst  in  1843  the  star 
ejected  an  expanding 
nebula,  which  has  been 
called  “Homunculus”.  The 
Hubble  photograph  has 
been  printed  negative,  to 
show  finer  details  in  the 
bipolar  outflow. 
(Photograph 

NASA/HST/University  of 
Minnesota) 


Fig.  14.11  The  light 
curve  of  the  dwarf  nova  SS 
Cygni  in  the  beginning  of 
1966.  (Drawing  by  Martti 
Perala  is  based  on 
observations  by  Nordic 
amateurs) 


ening  in  magnitudes  is  roughly  proportional  to  sible  that  ordinary  novae  obey  the  same  relation- 
the  logarithm  of  the  recharging  interval.  It  is  pos-  ship.  However,  their  amplitude  is  so  much  larger 
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Fig.  14.12  The  novae  are 
thought  to  be  white  dwarfs 
accreting  matter  from 
a  nearby  companion  star. 
At  times,  nuclear  reactions 
burning  the  accreted 
hydrogen  are  ignited,  and 
this  is  seen  as  the  flare-up 
of  a  nova 


Nova 

outbursts 


Mass  flow 


White 

dwarf 


Normal  star 


that  the  time  between  outbursts  should  be  thou¬ 
sands  or  millions  of  years. 

Observations  have  shown  all  novae  and  dwarf 
novae  to  be  members  of  close  binary  systems. 
One  component  of  the  system  is  a  normal  star  and 
the  other  is  a  white  dwarf  surrounded  by  a  gas 
ring.  (The  evolution  of  close  binary  systems  was 
considered  in  Sect.  12.6,  where  it  was  seen  how 
this  kind  of  system  might  have  been  formed.) 
The  normal  star  fills  its  Roche  surface,  and  ma¬ 
terial  from  it  streams  over  to  the  white  dwarf. 
When  enough  mass  has  collected  on  the  surface 
of  the  white  dwarf,  the  hydrogen  is  explosively 
ignited  and  the  outer  shell  is  ejected.  The  bright¬ 
ness  of  the  star  grows  rapidly.  As  the  ejected 
shell  expands,  the  temperature  of  the  star  drops 
and  the  luminosity  gradually  decreases.  How¬ 
ever,  the  outburst  does  not  stop  the  mass  transfer 
from  the  companion  star,  and  gradually  the  white 
dwarf  accretes  new  material  for  the  next  explo¬ 
sion  (Fig.  14.12). 

The  emission  and  absorption  lines  from  the  ex¬ 
panding  gas  shell  can  be  observed  in  the  spec¬ 
trum  of  a  nova.  The  Doppler  shifts  correspond 
to  an  expansion  velocity  of  about  1000  km/s.  As 
the  gas  shell  disperses,  its  spectrum  becomes  that 
of  a  typical  diffuse  emission  nebula.  The  expand¬ 
ing  shell  around  a  nova  can  also  sometimes  be 
directly  seen  in  photographs. 

In  dwarf  novae  the  energy  does  not  come  from 
nuclear  reactions  but  mainly  from  the  potential 
energy  of  the  matter  falling  into  the  white  dwarf. 
An  eruption  occurs  when  the  density  of  the  accre¬ 


tion  disk  formed  by  the  matter  flowing  from  the 
companion  star  exceeds  a  certain  critical  limit, 
becomes  unstable  and  is  strongly  heated. 

A  considerable  fraction  of  the  novae  in  our 
Galaxy  are  hidden  by  interstellar  clouds  and  their 
number  is  therefore  difficult  to  estimate.  In  the 
Andromeda  Galaxy,  observations  indicate  25- 
30  nova  explosions  per  year.  The  number  of 
dwarf  novae  is  much  larger.  In  addition  there 
are  nova-like  variables,  which  share  many  of  the 
properties  of  novae,  such  as  emission  lines  from 
circumstellar  gas  and  rapid  brightness  variations. 
These  variables,  some  of  which  are  called  symbi¬ 
otic  stars ,  are  close  binaries  with  mass  transfer. 
Gas  streaming  from  the  primary  hits  a  gas  disk 
around  the  secondary  in  a  hot  spot,  but  there  are 
no  nova  outbursts. 

The  supersoft  stars  (SSS)  are  a  somewhat  pe¬ 
culiar  subclass  of  cataclysmic  variables.  Their  X- 
ray  radiation  is  much  stronger  and  much  softer, 
i.e.  has  longer  wavelength,  than  that  of  the  ordi¬ 
nary  cataclysmic  variables.  In  the  SSS  objects  the 
component  losing  mass  is  more  massive  than  the 
compact  star  (white  dwarf),  in  contrast  with  or¬ 
dinary  interacting  binaries.  Therefore  the  mass 
transfer  is  an  unstable  self  sustained  process 
transferring  more  material  to  the  surface  of  the 
white  dwarf  than  in  the  normal  cataclysmic  vari¬ 
ables.  On  the  surface  of  the  white  dwarf  a  contin¬ 
uous  fusion  reaction  resembling  nova  eruptions 
is  producing  a  lot  of  soft  X-ray  radiation.  Due  to 
their  unstable  character  the  SSS  objects  are  short 
lived  and  therefore  relatively  rare. 
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Fig.  14.13  In  1975  a  new 
variable,  Nova  Cygni  or 
V1500Cygni,  was 
discovered  in  Cygnus.  In 
the  upper  photograph  the 
nova  is  at  its  brightest 
(about  2  magnitudes),  and 
in  the  lower  photograph  it 
has  faded  to  magnitude  15. 
(Photographs  Lick 
Observatory) 
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14.4  Supernovae 

Explosions  of  stars  as  supernovae  are  among 
the  most  energetic  phenomena  of  the  universe. 
Within  a  couple  of  weeks  the  star  becomes  so 
bright  that  its  luminosity  corresponds  to  over  a 
billion  Suns  or  a  whole  small  galaxy.  Thus  they 
can  be  observed  even  at  cosmological  distances, 
and  they  are  useful  standard  candles  for  measur¬ 


ing  the  dimensions  of  the  universe  and  cosmo¬ 
logical  quantities.  Supernovae  are  also  important 
sources  of  heavy  elements.  Most  elements  heav¬ 
ier  than  iron  have  originated  in  supernova  explo¬ 
sions. 

The  maximum  is  followed  by  a  slow  decay, 
and  the  nearest  supernovae  can  be  observed  still 
years  after  the  explosion.  In  the  explosion  a  gas 
shell  expanding  with  a  velocity  of  thousands  of 
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Fig.  14.14  The  type  of  a  supernova  can  be  deduced  from 
different  spectral  lines  in  its  spectrum  and  the  shape  of  its 
lightcurve 


kilometres  per  second  is  ejected.  Over  200  of 
such  supernova  remnants  have  been  discovered  in 
the  Milky  Way.  Their  ages  vary  from  a  few  hun¬ 
dred  to  tens  of  thousands  of  years. 

At  least  six  supernova  explosions  have  been 
observed  in  the  Milky  Way.  Best  known  are  the 
“guest  star”  seen  in  China  in  1054  (whose  rem¬ 
nant  is  the  Crab  nebula),  Tycho  Brahe’s  super¬ 
nova  in  1572  and  Kepler’s  supernova  in  1604.  On 
the  basis  of  observations  of  other  Sb-Sc-type  spi¬ 
ral  galaxies,  the  interval  between  supernova  ex¬ 
plosions  in  the  Milky  Way  is  predicted  to  be  about 
50  years.  Some  will  be  hidden  by  obscuring  ma¬ 
terial,  especially  near  the  centre  of  the  galaxy.  In 
particularly  dusty  starburst  galaxies  almost  all  su¬ 
pernovae  remain  undetected  due  to  the  extinction 
of  the  dust.  Still  the  400  years’  interval  since  the 
last  observed  supernova  in  the  Milky  Way  is  un¬ 
usually  long. 

Supernovae  in  other  galaxies  are  detected 
quite  frequently.  For  example,  both  in  2006  and 
2007  over  500  new  supernovae  were  found.  In 
the  near  future  the  number  will  increase  rapidly 
thanks  to  new  all  sky  surveys. 

Earlier  supernovae  were  divided  just  to  two 
main  categories,  I  and  II,  based  on  whether  their 
spectra  showed  evidence  of  hydrogen.  Later  these 
main  classes  have  been  divided  into  several  sub- 


Fig.  14.15  Typical  shapes  of  the  lightcurves  of  different 
supemovae 

classes  depending  on  other  spectral  features  and 
their  lightcurves  (Figs.  14.14,  14.15  and  14.16). 

Type  la  supernovae  are  recognised  by  the  sil¬ 
icon  absorption  line  (615  nm),  which  is  strong 
in  the  spectra  of  young  novae.  It  is  thought  that 
these  supemovae  originate  in  binaries,  at  least  a 
billion  years  old,  consisting  of  a  white  dwarf  and 
its  companion.  In  such  a  close  binary  material  can 
flow  from  the  companion  to  the  white  dwarf  un¬ 
til  it  mass  exceeds  the  limit  (about  1.4M©).  Then 
the  pressure  of  the  degenerate  of  the  electron  gas 
inside  the  white  dwarf  cannot  overcome  the  grav¬ 
itation  and  star  will  collapse.  Increase  of  the  den¬ 
sity  and  temperature  will  ignite  an  explosive  fu¬ 
sion  reaction  that  can  destroy  the  white  dwarf. 
The  kinetic  energy  released  in  the  process  is  of 
the  order  of  1044  J  causing  part  of  the  gas  to  ex¬ 
pand  even  at  a  velocity  of  0.1c.  The  origin  of  the 
radiation  energy  (around  1042  J)  is  the  fission  of 
the  radioactive  nickel  isotope  56  created  in  the 
explosion  (typically  about  0.5 M©)  into  radioac¬ 
tive  cobalt  and  further  into  stable  iron.  The  more 
nickel  is  produced  in  the  explosion  the  brighter 
the  supernova  shines  at  its  maximum;  the  abso¬ 
lute  magnitude  will  reach  —19 — 20.  The  shape 
of  the  lightcurve  depends  on  the  brightness  of  the 
supernova:  the  brighter  the  supernova  the  broader 
the  top  of  the  lightcurve.  Therefore  the  maximum 
brightness  of  a  typer  la  supernova  can  be  de¬ 
termined  precisely  from  its  lightcurve.  This  way 
they  can  be  used  as  standard  candles  to  determine 
dimensions  of  the  universe  and  cosmological  pa¬ 
rameters  (Chap.  20). 

In  Chap.  12  we  discussed  the  final  stages  of 
stellar  evolution  and  how  the  collapse  of  a  mas¬ 
sive  star  will  lead  to  an  explosion.  It  is  believed 
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Fig.  1 4.1 6  Spectra  of 
different  types  of 
supernovae  at  the 
maximum,  three  weeks 
after  the  maximum  and 
after  one  year.  (Turatto: 
Supemovae  and 
Gamma-Ray  Bursters,  ed. 
K.  Weiler,  Lecture  Notes  in 
Physics,  vol.  598, 
pp.  21-36) 
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that  all  supernovae  except  type  la  are  caused  by 
the  collapse  of  massive  and  short  lived  stars.  At 
the  end  of  its  lifespan  such  a  star  has  an  iron 
core  that  will  collapse  under  its  own  gravity  after 
exceeding  the  Chandrasekhar  limit.  In  the  stellar 
core  the  matter  will  reach  the  density  of  atomic 
nuclei  and  the  outer  layers  bounce  back  send¬ 
ing  an  outbound  shock  wave.  The  process  cre¬ 
ates  a  huge  number  of  neutrinos.  Since  they  in¬ 
teract  only  weakly  with  the  outer  layers  of  the 
star  they  rush  to  the  surrounding  space.  The  re¬ 
maining  core  may  become  a  neutron  star  or,  in 
the  case  of  a  very  massive  star,  a  black  hole.  Most 
of  the  released  energy  (about  99  %)  escapes  with 
the  neutrinos.  The  outer  layers  of  the  star  receive 
a  kinetic  energy  of  about  1044  J,  part  of  which 
may  be  released  as  radiation  when  the  expanding 
matter  hits  the  material  surrounding  the  star  and 
later  the  interstellar  matter.  Typically  the  super¬ 
nova  itself  emits  radiation  about  1042  J  within  a 
few  months  after  the  explosion,  reaching  an  ab¬ 
solute  magnitude  between  —16  and  —20  at  the 
maximum.  Later  the  brightness  is  sustained  by 
the  fission  of  the  radioactive  nickel  56,  born  in  the 
explosion,  to  other  elements  just  as  in  the  case  of 
the  type  la  supernovae. 

If  the  spectrum  of  a  supernova  does  not  con¬ 
tain  strong  hydrogen  or  silicon  lines  its  type  is 


either  lb  or  Ic.  The  type  lb  supernovae  are  recog¬ 
nised  by  strong  helium  lines  that  are  not  present 
in  the  spectra  of  type  Ic.  These  supernovae  orig¬ 
inate  in  stars  that  don’t  have  any  more  hydrogen 
in  their  outer  layer.  The  stars  exploding  as  type  Ic 
supernovae  have  also  lost  their  outer  layer  of  he¬ 
lium.  Such  stars  that  have  lost  their  outer  layers 
are  called  Wolf-Ray et  stars. 

All  type  II  supernovae  have  hydrogen  lines  in 
their  spectra.  They  are  divided  into  main  types 
II-P  and  II-L  based  on  the  flat  (plateau)  and  lin¬ 
ear  shape  of  the  lightcurve.  Recently  several  stars 
that  later  exploded  as  supernovae  have  been  iden¬ 
tified  in  high  resolution  pictures  taken  e.g.  by  the 
Hubble  space  telescope  and  the  VLT  telescope 
before  the  explosion  (Fig.  14.17).  These  obser¬ 
vations  have  shown  that  the  more  common  type 
II-P  supernovae  originate  in  red  supergiant  stars 
as  predicted  by  theories  of  stellar  evolution.  Ini¬ 
tially  the  masses  of  such  stars  have  been  at  least  8 
solar  masses.  Smaller  stars  are  not  believed  to  ex¬ 
plode  in  a  collapse.  They  end  up  as  white  dwarfs 
after  more  quiescent  evolution. 

Type  Iln  supernovae  are  recognised  by  their 
hydrogen  emission  lines  that  are  much  narrower 
than  in  the  spectra  of  other  supernovae.  The  lines 
originate  in  the  matter  around  the  star.  The  spec¬ 
tra  of  young  type  Iln  supernovae  have  hydrogen 
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lines,  which,  however,  disappear  in  a  few  weeks, 
and  afterwards  the  spectrum  resembles  the  type 
lb  spectra.  It  is  assumed  that  these  supernovae 
were  originally  massive  stars  that  still  had  a  small 
amount  of  hydrogen  in  their  outer  layer  before  the 
explosion.  Thus  they  are  intermedia  between  the 
types  II  and  lb.  The  observed  properties  and  types 
of  supernovae  depend  on  the  properties  of  their 
progenitors,  and  hence  individual  supernovae  can 
of  intermediate  types.  Hence  it  makes  more  sense 


to  classify  supernovae  according  to  the  explosion 
mechanism  simply  to  two  different  types,  ther¬ 
monuclear  and  core  collapse  supernovae.  About 
30  %  of  the  supernovae  found  in  nearby  galaxies 
are  thermonuclear  and  the  remaining  70  %  core 
collapse  supernovae. 

On  February  23,  1987  the  first  burst  of  light 
from  a  supernova  in  the  Large  Magellanic  Cloud, 
the  small  companion  galaxy  of  the  Milky  Way, 
reached  the  Earth  (Fig.  14.18).  This  supernova, 


Fig.  14.17  Left:  The 
supernova  2003 gd  of  type 
II-P  after  the  explosion. 
Right:  The  same  region 
before  the  explosion.  The 
exploded  red  supergiant 
(progenitor)  was  located  in 
the  image  taken  before  the 
explosion.  Both  images 
were  taken  with  the  Hubble 
space  telescope.  (Smartt  et 
al.,  2004,  Science, 
vol.  303,  5657,  499) 
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Fig.  1 4.1 8  Supernova  1987A  in  the  Large  Magellanic  Cloud  before  and  after  the  explosion.  (Photographs  ESO) 
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SN  1987 A,  was  of  type  II,  and  was  the  bright¬ 
est  supernova  for  383  years.  After  its  first  detec¬ 
tion,  SN  19 87 A  was  studied  in  great  detail  by  all 
available  means.  Although  the  general  ideas  of 
Sects.  11.4  and  11.5  on  the  final  stages  of  stellar 
evolution  have  been  confirmed,  there  are  compli¬ 
cations.  Thus  e.g.  the  progenitor  star  was  a  blue 
rather  than  a  red  giant  as  expected,  perhaps  be¬ 
cause  of  the  lower  abundance  of  heavy  elements 
in  the  Large  Magellanic  Cloud  compared  to  that 
in  the  Milky  Way.  The  collapse  of  its  core  re¬ 
leased  a  vast  amount  of  energy  as  a  pulse  of  neu¬ 
trinos,  which  was  detected  in  Japan  and  the  USA. 
The  amount  of  energy  released  indicates  that  the 
remnant  is  a  neutron  star. 
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14.6  Exercises 

Exercise  14.1  The  absolute  visual  magnitude  of 
RRLyrae  variables  is  0.6  db  0.3.  What  is  the  rela¬ 
tive  error  of  distances  due  to  the  deviation  in  the 
magnitude? 


Example  14.1  The  observed  period  of  a  cepheid 
is  20  days  and  its  mean  apparent  magnitude  m  = 
20.  From  Fig.  14.4,  its  absolute  magnitude  is 
M  ~  —5.  According  to  (4.12),  the  distance  of  the 
cepheid  is 

r  =  10  x  l0(m“M)/5  =  10  x  10(20  +  5)/5 
=  106  pc  =  1  Mpc. 


Example  14.2  The  brightness  of  a  cepheid 
varies  2  mag.  If  the  effective  temperature  is 
6000  K  at  the  maximum  and  5000  K  at  the  mini¬ 
mum,  how  much  does  the  radius  change? 

The  luminosity  varies  between 
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Exercise  14.2  The  bolometric  magnitude  of 
a  long  period  variable  varies  by  one  magnitude. 
The  effective  temperature  at  the  maximum  is 
4500  K. 

(a)  What  is  the  temperature  at  the  minimum,  if 
the  variation  is  due  to  temperature  change 
only? 

(b)  If  the  temperature  remains  constant,  what  is 
the  relative  variation  in  the  radius? 

Exercise  14.3  In  1983  the  radius  of  the  Crab 
nebula  was  about  3'.  It  is  expanding  0.21"  a  year. 
Radial  velocities  of  1300  kms-1  with  respect  to 
the  central  star  have  been  observed  in  the  nebula. 

(a)  What  is  the  distance  of  the  nebula,  assuming 
its  expansion  is  symmetric? 

(b)  A  supernova  explosion  has  been  observed  in 
the  direction  of  the  nebula.  Estimate,  how 
long  time  ago? 

(c)  What  was  the  apparent  magnitude  of  the  su¬ 
pernova,  if  the  absolute  magnitude  was  a  typ¬ 
ical  -18? 
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In  astrophysics  those  stars  in  which  the  density 
of  matter  is  much  larger  than  in  ordinary  stars  are 
known  as  compact  objects.  These  include  white 
dwarfs,  neutron  stars,  and  black  holes.  In  addi¬ 
tion  to  a  very  high  density,  the  compact  objects 
are  characterised  by  the  fact  that  nuclear  reactions 
have  completely  ceased  in  their  interiors.  Conse¬ 
quently  they  cannot  support  themselves  against 
gravity  by  thermal  gas  pressure.  In  the  white 
dwarfs  and  neutron  stars,  gravity  is  resisted  by  the 
pressure  of  a  degenerate  gas.  In  the  black  holes 
the  force  of  gravity  is  completely  dominant  and 
compresses  the  stellar  material  to  infinite  density. 

Compact  stars  in  binary  systems  give  rise  to  a 
variety  of  striking  new  phenomena.  If  the  com¬ 
panion  star  is  losing  mass  by  a  stellar  wind  or 
a  Roche  lobe  overflow,  the  gas  that  is  shed  may 
be  accreted  by  the  compact  object.  This  will  re¬ 
lease  gravitational  energy  that  can  be  observable 
in  the  form  of  X-ray  emission  and  strong  and 
rapid  brightness  variations. 


15.1  White  Dwarfs 

As  was  mentioned  in  Sect.  11.2,  in  ordinary  stars 
the  pressure  of  the  gas  obeys  the  equation  of  state 
of  an  ideal  gas.  In  stellar  interiors  the  gas  is  fully 
ionised,  i.e.  it  is  plasma  consisting  of  ions  and 
free  electrons.  The  partial  pressures  of  the  ions 
and  electrons  together  with  the  radiation  pressure 
important  in  hot  stars  comprise  the  total  pres¬ 
sure  balancing  gravitation.  When  the  star  runs 
out  of  its  nuclear  fuel,  the  density  in  the  interior 
increases,  but  the  temperature  does  not  change 


much.  The  electrons  become  degenerate,  and  the 
pressure  is  mainly  due  to  the  pressure  of  the  de¬ 
generate  electron  gas,  the  pressure  due  to  the  ions 
and  radiation  being  negligible.  The  star  becomes 
a  white  dwarf. 

As  will  be  explained  in  Box  15.1  the  radius  of 
a  degenerate  star  is  inversely  proportional  to  the 
cubic  root  of  the  mass.  Unlike  in  a  normal  star  the 
radius  decreases  as  the  mass  increases. 

The  first  white  dwarf  to  be  discovered  was  Sir¬ 
ius  B,  the  companion  of  Sirius  (Fig.  15.1).  Its  ex¬ 
ceptional  nature  was  realised  in  1915,  when  it 
was  discovered  that  its  effective  temperature  was 
very  high.  Since  it  is  faint,  this  meant  that  its  ra¬ 
dius  had  to  be  very  small,  slightly  smaller  than 
that  of  the  Earth.  The  mass  of  Sirius  B  was  known 
to  be  about  equal  to  that  of  the  Sun,  so  its  density 
had  to  be  extremely  large. 

The  high  density  of  Sirius  B  was  confirmed  in 
1925,  when  the  gravitational  redshift  of  its  spec¬ 
tral  lines  was  measured.  This  measurement  also 
provided  early  observational  support  to  Einstein’s 
general  theory  of  relativity. 

White  dwarfs  occur  both  as  single  stars  and 
in  binary  systems.  Their  spectral  lines  are  broad¬ 
ened  by  the  strong  gravitational  field  at  the  sur¬ 
face.  In  some  white  dwarfs  the  spectral  lines  are 
further  broadened  by  rapid  rotation.  Strong  mag¬ 
netic  fields  have  also  been  observed. 

White  dwarfs  have  no  internal  sources  of  en¬ 
ergy,  but  further  gravitational  contraction  is  pre¬ 
vented  by  the  pressure  of  the  degenerate  electron 
gas.  Radiating  away  the  remaining  heat,  white 
dwarfs  will  slowly  cool,  changing  in  colour  from 
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Fig.  15.1  Two  views  of  the  best-known  white  dwarf  Sir¬ 
ius  B,  the  small  companion  to  Sirius.  On  the  left ,  a  picture 
in  visible  light  by  the  Hubble  Space  Telescope.  Sirius  B 
is  the  tiny  white  dot  on  lower  left  from  the  overexposed 
image  of  Sirius.  On  the  right ,  an  X-ray  picture  of  the  pair 

white  to  red  and  finally  to  black.  The  cooling  time 
is  comparable  to  the  age  of  the  Universe,  and  even 
the  oldest  white  dwarfs  should  still  be  observable. 
Looking  for  the  faintest  white  dwarfs  has  been 
used  as  a  way  to  set  a  lower  limit  on  the  age  of 
the  Universe. 

Cataclysmic  Variables  When  a  white  dwarf  is 
a  member  of  a  close  binary  system,  it  can  ac¬ 
crete  mass  from  its  companion  star.  The  most  in¬ 
teresting  case  is  where  a  main  sequence  star  is 
filling  its  Roche  lobe ,  the  largest  volume  it  can 
have  without  spilling  over  to  the  white  dwarf.  As 
the  secondary  evolves,  it  expands  and  begins  to 
lose  mass,  which  is  eventually  accreted  by  the 
primary.  Binary  stars  of  this  kind  are  known  as 
cataclysmic  variables. 

The  present  definition  of  the  class  of  cata¬ 
clysmic  variables  has  gradually  evolved,  and  in 
consequence  many  types  of  systems  that  were 
earlier  viewed  as  separate  are  now  collected  un¬ 
der  this  heading.  In  principle,  even  type  la  super¬ 
novae  should  be  included.  The  classical  novae , 
whose  eruptions  are  caused  by  the  sudden  igni¬ 
tion  of  hydrogen  that  has  collected  on  the  sur¬ 
face  of  the  white  dwarf,  have  been  described  in 
Sect.  14.3.  In  the  eruptions  most  of  the  accreted 


taken  by  the  Chandra  X-ray  observatory.  Sirius  B  is  now 
the  brighter  source,  and  Sirius  is  weaker,  because  its  sur¬ 
face  is  much  cooler  than  the  surface  of  the  white  dwarf. 
(Photos  NASA  /  HST  and  Chandra) 


gas  is  expelled  in  a  shell,  but  if  the  mass  transfer 
continues  in  the  system  further  eruptions  may  oc¬ 
cur,  giving  rise  to  recurrent  novae.  Finally,  cata¬ 
clysmic  variables  without  eruptions,  for  example 
pre-novae  or  post-novae,  are  classified  as  nova¬ 
like  variables. 

The  dwarf  novae,  also  described  in  Sect.  14.3, 
are  produced  by  a  quite  different  mechanism.  In 
their  case  the  outbursts  are  not  caused  by  ther¬ 
monuclear  reactions,  but  by  instabilities  in  the 
accretion  flow  around  the  white  dwarf.  Although 
the  details  of  the  outburst  mechanism  are  still  not 
completely  clear,  the  basic  picture  is  that  the  disk 
has  two  possible  states,  a  hot  and  a  cool  one, 
available.  Under  some  conditions  the  disk  cannot 
remain  permanently  in  either  of  these  states,  and 
has  to  jump  repeatedly  between  the  hot  outburst 
state  and  the  cool  quiescent  state. 

A  special  type  of  nova-like  variables  are  the 
magnetic  cataclysmic  variables.  In  the  polars  the 
magnetic  field  is  so  strong  that  the  accreted  gas 
cannot  settle  into  an  accretion  disk.  Instead  it  is 
forced  to  follow  the  magnetic  field  lines,  form¬ 
ing  an  accretion  column.  As  the  gas  hits  the  sur¬ 
face  of  the  white  dwarf  it  is  strongly  heated  giving 
rise  to  bright  X-ray  emission,  which  is  a  charac¬ 
teristic  feature  of  polars.  Systems  with  a  slightly 
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Fig.  15.2  The  structure  of 
a  neutron  star.  The  crust  is 
rigid  solid  material  and  the 
mantle  a  freely  streaming 
superfluid 


weaker  magnetic  field  are  called  intermediate  po- 
lars.  These  systems  exhibit  both  X-ray  emission 
and  variations  due  to  an  accretion  disk. 


15.2  Neutron  Stars 

If  the  mass  of  a  star  is  large  enough,  the  density  of 
matter  may  grow  even  larger  than  in  normal  white 
dwarfs.  The  equation  of  state  of  a  classical  degen¬ 
erate  electron  gas  then  has  to  be  replaced  with  the 
corresponding  relativistic  formula.  In  this  case 
decreasing  the  radius  of  the  star  no  longer  helps 
in  resisting  the  gravitational  attraction.  Equilib¬ 
rium  is  possible  only  for  one  particular  value  of 
the  mass,  the  Chandrasekhar  mass  Mch,  already 
introduced  in  Sect.  12.5.  The  value  of  Mch  is 
about  1 .4  M©  ,  which  is  thus  the  upper  limit  to  the 
mass  of  a  white  dwarf.  If  the  mass  of  the  star  is 
larger  than  M©h,  gravity  overwhelms  the  pressure 
and  the  star  will  rapidly  contract  towards  higher 
densities.  The  final  stable  state  reached  after  this 
collapse  will  be  a  neutron  star  (Fig.  15.2).  On  the 
other  hand,  if  the  mass  is  smaller  than  M©h,  the 
pressure  dominates.  The  star  will  then  expand  un¬ 
til  the  density  is  small  enough  to  allow  an  equilib¬ 
rium  state  with  a  less  relativistic  equation  of  state. 

When  a  massive  star  reaches  the  end  of  its  evo¬ 
lution  and  explodes  as  a  supernova,  the  simulta¬ 
neous  collapse  of  its  core  will  not  necessarily  stop 
at  the  density  of  a  white  dwarf.  If  the  mass  of  the 
collapsing  core  is  larger  than  the  Chandrasekhar 
mass  (>1.4  Mq),  the  collapse  continues  to  a  neu¬ 
tron  star. 

An  important  particle  reaction  during  the  fi¬ 
nal  stages  of  stellar  evolution  is  the  URCA  pro¬ 
cess,  which  was  put  forward  by  Schonberg  and 
Gamow  in  the  1940’s  and  which  produces  a  large 


neutrino  emission  without  otherwise  affecting  the 
composition  of  matter.  (The  URCA  process  was 
invented  in  Rio  de  Janeiro  and  named  after  a  local 
casino.  Apparently  money  disappeared  at  URCA 
just  as  energy  disappeared  from  stellar  interiors  in 
the  form  of  neutrinos.  It  is  claimed  that  the  casino 
was  closed  by  the  authorities  when  this  similarity 
became  known.)  The  URCA  process  consists  of 
the  reactions 

(Z,  A)  +  e  — >  (Z  —  1,  A)  +  ve, 

(Z  —  1,  A)  — >  (Z,  A)  +  e  +  ve, 

where  Z  is  the  number  of  protons  in  a  nucleus; 
A  the  mass  number;  e~  an  electron;  and  ve  and 
ve  the  electron  neutrino  and  antineutrino.  When 
the  electron  gas  is  degenerate,  the  latter  reaction 
is  suppressed  by  the  Pauli  exclusion  principle.  In 
consequence  the  protons  in  the  nuclei  are  trans¬ 
formed  into  neutrons.  As  the  number  of  neutrons 
in  the  nuclei  grows,  their  binding  energies  de¬ 
crease.  At  densities  of  about  4  x  1014  kg/m3  the 
neutrons  begin  to  leak  out  of  the  nucleus,  and  at 
1017  kg/m3  the  nuclei  disappear  altogether.  Mat¬ 
ter  then  consists  of  a  neutron  “porridge”,  mixed 
with  about  0.5  %  electrons  and  protons. 

Neutron  stars  are  supported  against  gravity  by 
the  pressure  of  the  degenerate  neutron  gas,  just 
as  white  dwarfs  are  supported  by  electron  pres¬ 
sure.  The  equation  of  state  is  the  same,  except 
that  the  electron  mass  is  replaced  by  the  neu¬ 
tron  mass,  and  that  the  mean  molecular  weight  is 
defined  with  respect  to  the  number  of  free  neu¬ 
trons.  Since  the  gas  consists  almost  entirely  of 
neutrons,  the  mean  molecular  weight  is  approx¬ 
imately  one. 

The  typical  diameters  of  neutron  stars  are 
about  10  km.  Unlike  ordinary  stars  they  have 
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Fig.  15.3  A  rotating  neutron  star  is  surrounded  by  a 
strong  magnetic  field  which  drags  electrons  from  the  sur¬ 
face  and  accelerates  them  to  relativistic  speeds  over  the 
magnetic  poles.  When  the  electrons  are  accelerated  along 
the  magnetic  field  lines,  they  radiate  so  called  curvature 
radiation  in  a  narrow  beam.  Since  the  magnetic  axis  is  mis¬ 
aligned  with  the  rotation  axis,  the  beams  sweep  around  the 
sky  like  in  a  lighthouse.  (Lorimer-Kramer  2005,  Hand¬ 
book  of  Pulsar  Astronomy,  Cambridge  University  Press, 
p.  55) 

a  well-defined  solid  surface.  The  atmosphere 
above  it  is  a  few  centimetres  thick.  The  upper 
crust  is  a  metallic  solid  with  the  density  grow¬ 
ing  rapidly  inwards.  Most  of  the  star  is  a  neutron 
superfluid,  and  in  the  centre,  where  the  density 
exceeds  1018  kg/m3,  there  may  be  a  solid  nu¬ 
cleus  of  heavier  particles  (hyperons),  or  of  quark 
matter,  where  the  quarks  that  normally  constitute 
neutrons  have  become  unconfined. 

A  neutron  star  formed  in  the  explosion  and 
collapse  of  a  supernova  will  initially  rotate  ra¬ 
pidly,  because  its  angular  momentum  is  un¬ 
changed  while  its  radius  is  much  smaller  than 
before.  In  a  few  hours  the  star  will  settle  in 
a  flattened  equilibrium,  rotating  several  hundred 
times  per  second.  The  initial  magnetic  field  of  the 
neutron  star  will  also  be  compressed  in  the  col¬ 
lapse,  so  that  there  will  be  a  strong  field  coupling 
the  star  to  the  surrounding  material.  The  angu¬ 
lar  momentum  of  the  neutron  star  is  steadily  de- 


Fig.  15.4  Consecutive  radio  pulses  at  408  MHz  from 
two  pulsars.  To  the  left  PSR  1642-03  and  to  the  right 
PSR  1133+16.  Observations  made  at  Jodrell  Bank.  (Pic¬ 
ture  from  Smith,  F.G.  (1977):  Pulsars  (Cambridge  Univer¬ 
sity  Press,  Cambridge)  pp.  93,  95) 

creased  by  the  emission  of  electromagnetic  radia¬ 
tion,  neutrinos,  cosmic  ray  particles  and  possibly 
gravitational  radiation.  Thus  the  angular  veloc¬ 
ity  decreases.  The  rotation  can  also  break  the  star 
into  several  separate  objects.  They  will  eventu¬ 
ally  recombine  when  the  energy  of  the  system  is 
reduced.  In  some  cases  the  stars  can  remain  sep¬ 
arated,  resulting  e.g.  in  a  binary  neutron  star. 

The  theory  of  neutron  stars  was  developed  in 
the  1930’s,  but  the  first  observations  were  not 
made  until  the  1960’s.  At  that  time  the  pulsars , 
a  new  type  of  rapidly  pulsating  radio  sources, 
were  discovered  and  identified  as  neutron  stars. 
In  the  1970’s  neutron  stars  were  also  seen  as  X- 
ray  pulsars ,  X-ray  bursters  and  magnetars. 

Pulsars  The  pulsars  were  discovered  in  1967, 
when  Anthony  Hewish  and  Jocelyn  Bell  in  Cam¬ 
bridge,  England,  detected  sharp,  regular  radio 
pulses  coming  from  the  sky.  Since  then  about 
1500  pulsars  have  been  discovered  (Fig.  15.4). 
Their  periods  range  from  0.0016  s  (for  the  pul¬ 
sar  1937  +  214)  up  to  20  minutes. 

In  addition  to  the  steady  slowing  down  of  the 
rotation,  sometimes  small  sudden  jumps  in  the 
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Fig.  15.5  A  time- 
sequence  of  the  pulsation 
of  the  Crab  pulsar  in  visible 
light.  The  pictures  were 
taken  once  about  every 
millisecond;  the  period  of 
the  pulsar  is  about  33 
milliseconds.  (Photos  N.A. 
Sharp/N  O  AO/AUR  A/N  S  F) 


period  are  observed.  These  might  be  a  sign  of 
rapid  mass  movements  in  the  neutron  star  crust 
(“starquakes”)  or  in  its  surroundings. 

The  origin  of  the  radio  pulses  can  be  under¬ 
stood  if  the  magnetic  field  is  tilted  at  an  angle 
of  45° -90°  with  respect  to  the  rotation  axis.  The 
field  is  so  strong  that  it  drags  electrons  from  the 
surface  and  accelerates  them  to  relativistic  speeds 
over  the  magnetic  poles.  When  the  electrons  are 
accelerated  along  the  magnetic  field  lines,  they 
radiate  so  called  curvature  radiation  which  is  re¬ 
lated  to  synchrotron  radiation  (Fig.  15.3).  In  the 
direction  of  the  magnetic  poles  two  thin  beams 
of  radio  radiation  are  emitted.  The  beams  sweep 
around  the  sky,  and  if  the  Earth  happens  to  be  in 
the  path  of  the  beam  a  pulsar  is  seen. 

The  best-known  pulsar  is  located  in  the  Crab 
nebula  (Figs.  15.5  and  15.7).  This  small  neb¬ 
ula  in  the  constellation  Taurus  was  noted  by  the 
French  astronomer  Charles  Messier  in  the  middle 
of  the  18th  century  and  became  the  first  object  in 
the  Messier  catalogue,  Ml.  The  Crab  nebula  was 
found  to  be  a  strong  radio  source  in  1948  and  an 
X-ray  source  in  1964.  The  pulsar  was  discovered 
in  1968.  In  the  following  year  it  was  observed  op¬ 
tically  and  was  also  found  to  be  an  X-ray  emitter. 

Neutron  stars  are  difficult  to  study  optically, 
since  their  luminosity  in  the  visible  region  is  very 


small  (typically  about  10-6  L©).  For  instance  the 
Vela  pulsar  has  been  observed  at  a  visual  magni¬ 
tude  of  about  25.  In  the  radio  region,  it  is  a  very 
strong  pulsating  source. 

A  few  pulsars  have  been  discovered  in  binary 
systems;  the  first  one,  PSR  1913+16,  in  1974. 
In  1993  Joseph  Taylor  and  Russell  Hulse  were 
awarded  the  Nobel  prize  for  the  detection  and 
studies  of  this  pulsar.  The  pulsar  orbits  about 
a  companion,  presumably  another  neutron  star, 
with  the  orbital  eccentricity  0.6  and  the  period 
8  hours.  The  observed  period  of  the  pulses  is  al¬ 
tered  by  the  Doppler  effect,  and  this  allows  one  to 
determine  the  velocity  curve  of  the  pulsar.  These 
observations  can  be  made  very  accurately,  and  it 
has  therefore  been  possible  to  follow  the  changes 
in  the  orbital  elements  of  the  system  over  a  pe¬ 
riod  of  several  years.  For  example,  the  periastron 
direction  of  the  binary  pulsar  has  been  found  to 
rotate  about  4°  per  year.  This  phenomenon  can 
be  explained  by  means  of  the  general  theory  of 
relativity;  in  the  solar  system,  the  corresponding 
rotation  (the  minor  fraction  of  the  rotation  not 
explained  by  the  Newtonian  mechanics)  of  the 
perihelion  of  Mercury  is  43  arc  seconds  per  cen¬ 
tury. 

The  binary  pulsar  PSR  1913+16  has  also  pro¬ 
vided  the  first  strong  evidence  for  the  existence  of 
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gravitational  waves.  During  the  time  of  observa¬ 
tion  the  orbital  period  of  the  system  has  steadily 
decreased.  This  shows  that  the  system  is  losing 
orbital  energy  at  a  rate  that  agrees  exactly  with 
that  predicted  by  the  general  theory  of  relativity. 
The  lost  energy  is  radiated  as  gravitational  waves. 

Magnetars  The  energy  emitted  by  common 
pulsars  has  its  origin  in  the  slowing  down  of  their 
rotation.  In  some  neutron  stars,  the  magnetars, 
the  magnetic  field  is  so  strong  that  the  energy  re¬ 
leased  in  the  decay  of  the  field  is  the  main  source 
of  energy.  Whereas  in  ordinary  pulsars  the  mag¬ 
netic  field  is  typically  109  T,  in  magnetars  a  typi¬ 
cal  value  may  be  109— 101 1  T. 

Magnetars  were  first  invoked  as  an  explana¬ 
tion  of  the  soft  gamma  repeaters  (SGR),  X-ray 
stars  that  irregularly  emit  bright,  short  (0.1  s)  re¬ 
peating  flashes  of  low-energy  gamma  rays.  Later 
a  second  class  of  mysterious  objects,  the  anoma¬ 
lous  X-ray  pulsars  (AXP),  were  identified  as 
magnetars.  AXP  are  slowly  rotating  pulsars,  with 
a  rotation  period  of  6  to  12  seconds.  Despite  this 
they  are  bright  X-ray  sources,  which  can  be  un¬ 
derstood  if  their  energy  is  of  magnetic  origin. 

It  is  thought  that  magnetars  are  the  remnants 
of  stars  that  were  more  massive  and  rapidly  ro¬ 
tating  than  those  giving  rise  to  ordinary  pulsars, 
although  the  details  are  still  subject  to  debate. 
A  magnetar  first  appears  as  a  SGR.  During  this 
phase,  which  only  lasts  about  10,000  years,  the 
very  strong  magnetic  field  is  slowing  down  the 
rate  of  rotation.  At  the  same  time  the  field  is 
drifting  with  respect  to  the  neutron  star  crust. 
This  causes  shifts  in  the  crust  structure,  leading 
to  powerful  magnetic  flares  and  the  observed  out¬ 
bursts.  After  about  10,000  years  the  rotation  has 
slowed  down  so  much  that  the  outbursts  cease, 
leaving  the  neutron  star  observable  as  an  AXP. 

Gamma  Ray  Bursts  For  a  long  time  the  gam¬ 
ma  ray  bursts  (GRB),  very  short  and  sharp 
gamma  ray  pulses  first  discovered  in  1973,  re¬ 
mained  a  mystery.  Unlike  the  much  less  common 
SGR,  the  GRB  never  recurred,  and  they  had  no 
optical  or  X-ray  counterparts.  A  first  major  ad¬ 
vance  was  made  when  satellite  observations  with 
the  Compton  Gamma  Ray  Observatory  showed 


Fig.  15.6  The  location  of  the  peculiar  type  Ibc  supernova 
SN  1998bw  at  redshift  z  =  0.0085  is  in  the  circle  to  the 
lower  left.  This  was  also  the  position  of  the  faint  gam¬ 
ma-ray  burst  GRB  980425,  the  first  GRB  to  be  connected 
with  a  supernova.  The  circle  on  the  upper  right  marks  an 
ultraluminous  X-ray  source.  (C.  Kouveliotou  et  al.  2004, 
ApJ  608,  872,  Fig.  1) 

that  the  gamma  ray  bursts  are  almost  uniformly 
distributed  in  the  sky,  unlike  the  known  neutron 
stars. 

The  nature  of  the  gamma  ray  bursts  is  now 
becoming  clear  thanks  to  dedicated  observing 
programmes  that  have  used  burst  detections  by 
gamma  and  X-ray  satellites  such  as  Beppo-SAX 
and,  in  particular,  Swift  rapidly  to  look  for  after¬ 
glows  of  the  GRB  at  optical  wavelengths.  The  de¬ 
tection  of  these  afterglows  has  made  it  possible  to 
determine  distances  to  the  bursts  and  their  loca¬ 
tion  in  their  host  galaxies  (see  Fig.  15.6). 

It  has  become  clear  that  there  are  at  least  two 
kinds  of  bursts,  with  the  self-descriptive  names 
long  soft  bursts,  and  short  hard  bursts.  The  long 
soft  gamma  ray  bursts,  lasting  longer  than  2  sec¬ 
onds,  have  now  been  convincingly  shown  to  be 
produced  in  the  explosions  of  massive  stars  at  the 
end  of  their  life,  specifically  supernovae  of  types 
lb  and  Ic  (Sect.  14.3).  Only  a  small  fraction  of 
all  type  Ibc  supernovae  give  rise  to  a  GRB.  The 
explosions  that  produce  GRB  have  been  called 
hypernovae,  and  are  among  the  brightest  objects 
in  the  Universe.  A  gamma  ray  burst  observed  in 
late  2005  took  place  when  the  Universe  was  only 
900  million  years  old,  making  it  one  of  the  most 
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Fig.  15.7  Some  pulsars  shine  brightly  in  gamma-rays.  In 
the  center  the  Crab  pulsar  and  on  the  upper  left  the  gamma 
source  Geminga,  which  was  identified  in  1992  to  be  the 
nearest  pulsar  with  a  distance  of  about  100  pc  from  the 
Sun.  (Photo  by  Compton  Gamma  Ray  Observatory) 

distant  objects  ever  observed.  The  conditions  re¬ 
quired  for  hypernova  explosions  are  still  not  cer¬ 
tain. 

The  nature  of  the  systems  giving  rise  to  short 
gamma  ray  bursts,  lasting  less  than  2  seconds, 
have  been  more  difficult  to  ascertain.  The  most 
popular  theory  has  been  that  they  are  produced  in 
compact  binary  systems  consisting  of  two  neu¬ 
tron  stars  or  a  neutron  star  and  a  black  hole. 
These  systems  lose  energy  by  gravitational  radi¬ 
ation,  and  eventually  the  two  components  should 
merge,  producing  a  burst  of  gamma  radiation. 
This  theory  has  now  received  strong  support 
when  the  afterglow  of  a  few  short  bursts  has  been 
detected  in  the  outer  parts  of  their  host  galaxies. 
Since  the  stars  in  these  regions  are  all  old  and 
no  longer  give  rise  to  core-collapse  supemovae, 
the  neutron  star  merger  hypothesis  appears  most 
likely.  However,  it  is  still  also  possible  that  some 
of  the  short  bursts  are  exceptionally  bright  mag- 
netar  flares. 


Box  15.1  (The  Radius  of  White  Dwarfs  and 
Neutron  Stars)  The  mass  of  a  white  dwarf  or 
a  neutron  star  determines  its  radius.  This  fol¬ 
lows  from  the  equation  of  hydrostatic  equilib¬ 
rium  and  from  the  pressure-density  relation  for 


a  degenerate  gas.  Using  the  hydrostatic  equilib¬ 
rium  equation  (11.1) 

d  P  GMrp 

dr  r 2 

one  can  estimate  the  average  pressure  P : 


d  P 
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Here  we  have  used  p  oc  M/R3 .  Thus  the  pres¬ 
sure  obeys 

P(xM2/Ra.  (1) 

In  the  nonrelativistic  case,  the  pressure  of  a  de¬ 
generate  electron  gas  is  given  by  (11.16): 

P  ss  (h2 / me)(tiemu)~5/3 p5/3 


and  hence 


P  oc 


p5/3 


5/3  ' 
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By  combining  (1)  and  (2)  we  obtain 


M2  M5C 

or 

R  oc - - — -=-pr  oc  M_1//3 . 

Ml/3mQpt5J3 

Thus  the  smaller  the  radius  of  a  white  dwarf  is, 
the  larger  its  mass  will  be.  If  the  density  be¬ 
comes  so  large  that  the  relativistic  equation  of 
state  (11.17)  has  to  be  used,  the  expression  for 
the  pressure  is 

4/3  M4/3 

Pcxp  °< 


As  the  star  contracts,  the  pressure  grows  at  the 
same  rate  as  demanded  by  the  condition  for  hy¬ 
drostatic  support  (1).  Once  contraction  has  be¬ 
gun,  it  can  only  stop  when  the  state  of  mat¬ 
ter  changes:  the  electrons  and  protons  com¬ 
bine  into  neutrons.  Only  a  star  that  is  massive 
enough  can  give  rise  to  a  relativistic  degenerate 
pressure. 

The  neutrons  are  fermions,  just  like  the  elec¬ 
trons.  They  obey  the  Pauli  exclusion  principle, 
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and  the  degenerate  neutron  gas  pressure  is  ob¬ 
tained  from  an  expression  analogous  to  (2): 

p5/3 

Pn  OC  - jjj. 


where  mn  is  the  neutron  mass  and  /xn,  the 
molecular  weight  per  free  neutron.  Corre¬ 
spondingly,  the  radius  of  a  neutron  star  is  given 
by 


Rns  oc 


1 


5/3 


If  a  white  dwarf  consists  purely  of  helium, 
lie  =  2;  for  a  neutron  star,  /xn  ~  1.  If  a  white 
dwarf  and  a  neutron  star  have  the  same  mass, 
the  ratio  of  their  radii  is 


flwd  /Mnsy/3/Mn\5/3mn 

^ns  \^wd  /  \Me/ 


%  1  X 


5/3 

x  1840  ~  600. 


Thus  the  radius  of  a  neutron  star  is  about  1/600 
of  that  of  a  white  dwarf.  Typically  Rns  is  about 
10  km. 


15.3  Black  Holes 


If  the  mass  of  a  star  exceeds  Mqv  (Sect.  12.5), 
and  if  it  does  not  lose  mass  during  its  evolu¬ 
tion  it  can  no  longer  reach  any  stable  final  state. 
The  force  of  gravity  will  dominate  over  all  other 
forces,  and  the  star  will  collapse  to  a  black  hole. 
A  black  hole  is  black  because  not  even  light  can 
escape  from  it.  Already  at  the  end  of  the  18th 
century  Laplace  showed  that  a  sufficiently  mas¬ 
sive  body  would  prevent  the  escape  of  light  from 
its  surface.  According  to  classical  mechanics,  the 
escape  velocity  from  a  body  of  radius  R  and 
mass  M  is 


vQ 


2  GM 


R 


This  is  greater  than  the  speed  of  light,  if  the  radius 
is  smaller  than  the  critical  radius 


Rs=2GM/c2.  (15.1) 


Axis  of  rotation 


Fig.  15.8  A  black  hole  is  surrounded  by  a  spherical  event 
horizon.  In  addition  to  this  a  rotating  black  hole  is  sur¬ 
rounded  by  a  flattened  surface  inside  which  no  matter  can 
remain  stationary.  This  region  is  called  the  ergosphere 


The  same  value  for  the  critical  radius,  the  Schwa- 
rzschild  radius ,  is  obtained  from  the  general  the¬ 
ory  of  relativity.  For  example,  for  the  Sun,  Rs 
is  about  3  km;  however,  the  Sun’s  mass  is  so 
small  that  it  cannot  become  a  black  hole  by  nor¬ 
mal  stellar  evolution.  Because  the  mass  of  a  black 
hole  formed  by  stellar  collapse  has  to  be  larger 
than  Mov  the  radius  of  the  smallest  black  holes 
formed  in  this  way  is  about  5-10  km. 

The  properties  of  black  holes  have  to  be  stud¬ 
ied  on  the  basis  of  the  general  theory  of  relativity, 
which  is  beyond  the  scope  of  this  book.  Thus  only 
some  basic  properties  are  discussed  qualitatively. 

An  event  horizon  is  a  surface  through  which 
no  information  can  be  sent  out,  even  in  principle. 
A  black  hole  is  surrounded  by  an  event  horizon  at 
the  Schwarzschild  radius  (Fig.  15.8).  In  the  the¬ 
ory  of  relativity  each  observer  carries  with  him 
his  own  local  measure  of  time.  If  two  observers 
are  at  rest  with  respect  to  each  other  at  the  same 
point  their  clocks  go  at  the  same  rate.  Otherwise 
their  clock  rates  are  different,  and  the  apparent 
course  of  events  differs,  too. 

Near  the  event  horizon  the  different  time  def¬ 
initions  become  significant.  An  observer  falling 
into  a  black  hole  reaches  the  centre  in  a  finite 
time,  according  to  his  own  clock,  and  does  not 
notice  anything  special  as  he  passes  through  the 
event  horizon.  However,  to  a  distant  observer  he 
never  seems  to  reach  the  event  horizon;  his  ve- 
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locity  of  fall  seems  to  decrease  towards  zero  as 
he  approaches  the  horizon. 

The  slowing  down  of  time  also  appears  as 
a  decrease  in  the  frequency  of  light  signals.  The 
formula  for  the  gravitational  redshift  can  be  writ¬ 
ten  in  terms  of  the  Schwarzschild  radius  as  (Ap¬ 
pendix  B) 

A  ^  nc. 

Voo  =  vJ  1 - — =  vJ  1  -  — .  (15.2) 

Here,  v  is  the  frequency  of  radiation  emitted  at 
a  distance  r  from  the  black  hole  and  Voo  the  fre¬ 
quency  observed  by  an  infinitely  distant  observer. 
It  can  be  seen  that  the  frequency  at  infinity  ap¬ 
proaches  zero  for  radiation  emitted  near  the  event 
horizon. 

Since  the  gravitational  force  is  directed  to¬ 
wards  the  centre  of  the  hole  and  depends  on  the 
distance,  different  parts  of  a  falling  body  feel 
a  gravitational  pull  that  is  different  in  magnitude 
and  direction.  The  tidal  forces  become  extremely 
large  near  a  black  hole  so  that  any  material  falling 
into  the  hole  will  be  torn  apart.  All  atoms  and  el¬ 
ementary  particles  are  destroyed  near  the  central 
point,  and  the  final  state  of  matter  is  unknown  to 
present-day  physics.  The  observable  properties  of 
a  black  hole  do  not  depend  on  how  it  was  made. 

Not  only  all  information  on  the  material  com¬ 
position  disappears  as  a  star  collapses  into  a  black 
hole;  any  magnetic  field,  for  example,  also  disap¬ 
pears  behind  the  event  horizon.  A  black  hole  can 
only  have  three  observable  properties:  mass,  an¬ 
gular  momentum  and  electric  charge. 

It  is  improbable  that  a  black  hole  could  have 
a  significant  net  charge.  An  electrically  charged 
black  hole  would  attract  particles  with  opposite 
charge  until  it  became  neutral.  Rotation,  on  the 
other  hand,  is  typical  to  stars,  and  thus  black 
holes,  too,  must  rotate.  Since  the  angular  momen¬ 
tum  is  conserved,  stars  collapsed  to  black  holes 
must  rotate  very  fast. 

In  1963  Roy  Kerr  managed  to  find  a  solution 
of  the  field  equations  for  a  rotating  black  hole.  In 
addition  to  the  event  horizon  a  rotating  hole  has 
another  limiting  surface,  an  ellipsoidal  static  limit 
(Fig.  15.8).  Objects  inside  the  static  limit  cannot 
be  kept  stationary  by  any  force,  but  they  must 
orbit  the  hole.  However,  it  is  possible  to  escape 


from  the  region  between  the  static  limit  and  the 
event  horizon,  called  the  ergo  sphere.  In  fact  it  is 
possible  to  utilise  the  rotational  energy  of  a  black 
hole  by  dropping  an  object  to  the  ergosphere  in 
such  a  way  that  part  of  the  object  falls  into  the 
hole  and  another  part  is  slung  out.  The  outcom- 
ing  part  may  then  have  considerably  more  kinetic 
energy  than  the  original  object. 

At  present  the  only  known  way  in  which 
a  black  hole  could  be  directly  observed  is  by 
means  of  the  radiation  from  gas  falling  into  it. 
For  example,  if  a  black  hole  is  part  of  a  binary 
system,  gas  streaming  from  the  companion  will 
settle  into  a  disk  around  the  hole.  Matter  at  the 
inner  edge  of  the  disk  will  fall  into  the  hole.  The 
accreting  gas  will  lose  a  considerable  part  of  its 
energy  (up  to  40  %  of  the  rest  mass)  as  radiation, 
which  should  be  observable  in  the  X-ray  region. 

Some  rapidly  and  irregularly  varying  X-ray 
sources  of  the  right  kind  have  been  discovered. 
The  first  strong  evidence  for  black  hole  in  an 
X-ray  binary  was  for  CygnusX-1  (Fig.  15.9).  Its 
luminosity  varies  on  the  time  scale  of  0.001  s, 
which  means  that  the  emitting  region  must  be 
only  0.001  light-seconds  or  a  few  hundred  kilo¬ 
metres  in  size.  Only  neutron  stars  and  black  holes 
are  small  and  dense  enough  to  give  rise  to  such 
high-energy  processes.  CygnusX-1  is  the  smaller 
component  of  the  double  system  HDE  226868. 
The  larger  component  is  an  optically  visible  su¬ 
pergiant  with  a  mass  20-25  M©.  The  mass  of 
the  unseen  component  has  been  calculated  to  be 
10-15  AT©.  If  this  is  correct,  the  mass  of  the  sec¬ 
ondary  component  is  much  larger  than  the  up¬ 
per  limit  for  a  neutron  star,  and  thus  it  has  to  be 
a  black  hole. 

Today  20  such  systems  are  known,  where  the 
compact  component  has  a  mass  larger  than  3  Af©, 
and  therefore  is  probably  a  black  hole.  As  shown 
in  Fig.  15.10  these  can  be  of  very  different  sizes. 
Nearly  all  of  them  have  been  discovered  as  X-ray 
novae. 

Many  frightening  stories  about  black  holes 
have  been  invented.  It  should  therefore  be  stressed 
that  they  obey  the  same  dynamical  laws  as  other 
stars — they  are  not  lurking  in  the  darkness  of 
space  to  attack  innocent  passers-by.  If  the  Sun 
became  a  black  hole,  the  planets  would  continue 
in  their  orbits  as  if  nothing  had  happened. 


322 


15  Compact  Stars 


Fig.  15.9  The  arrow  shows  the  variable  star  VI 357  Cyg. 
Its  companion  is  the  X-ray  source  Cygnus  X-l,  suspected 
to  be  a  black  hole.  Cyg  X-l  itself  is  so  faint  that  it  can  be 
observed  by  its  X-ray  radiation  only.  The  bright  star  to  the 
lower  right  of  V 1357  is  r/  Cygni,  one  of  the  brightest  stars 
in  the  constellation  Cygnus 

So  far  we  have  discussed  only  black  holes  with 
masses  in  the  range  of  stellar  masses.  There  is 
however  no  upper  limit  to  the  mass  of  a  black 
hole.  Many  active  phenomena  in  the  nuclei  of 
galaxies  can  be  explained  with  supermassive 
black  holes  with  masses  of  millions  or  thousands 
of  millions  solar  masses  (see  Sect.  19.7). 


15.4  X-ray  Binaries 

Close  binaries  where  a  neutron  star  or  a  black 
hole  is  accreting  matter  from  its  companion,  usu¬ 
ally  a  main  sequence  star,  will  be  visible  as 
strong  X-ray  sources.  They  are  generally  classi¬ 
fied  as  high-mass  X-ray  binaries  (HMXB),  when 
the  companion  has  a  mass  larger  than  about 
10  M0,  and  low-mass  X-ray  binaries  (LMXB) 
with  a  companion  mass  smaller  than  1.2  M©. 
In  HMXBs  the  source  of  the  accreted  material 


is  a  strong  stellar  wind.  LMXBs  are  produced 
by  Roche-lobe  overflow  of  the  companion  star, 
either  because  the  major  axis  of  the  binary  de¬ 
creases  due  to  angular  momentum  loss  from  the 
system,  or  else  because  the  radius  of  the  compan¬ 
ion  is  increasing  as  it  evolves. 

Because  of  the  rapid  evolution  of  the  massive 
component  in  HMXBs  these  systems  are  young 
and  short-lived,  105-107  a.  In  LMXBs  the  life¬ 
time  is  determined  by  the  mass-transfer  process, 
and  may  be  longer,  107— 1010  a.  In  many  respects 
they  are  similar  to  cataclysmic  variables,  and  may 
give  rise  to  analogous  phenomena. 

Many  kinds  of  variable  X-ray  sources  have 
been  discovered  since  they  were  first  observed  in 
the  1970’s.  Among  these,  the  X-ray  pulsars  and 
the  X-ray  bursters  can  only  be  neutron  stars.  In 
other  types  of  X-ray  binaries  it  can  be  difficult  to 
determine  whether  the  primary  is  a  neutron  star 
or  a  black  hole. 

Neutron  stars  and  black  holes  are  formed  in 
supernova  explosions,  and  in  a  binary  system  the 
explosion  would  normally  be  expected  to  disrupt 
the  binary.  An  X-ray  binary  will  only  form  under 
special  conditions.  Some  examples  are  shown  in 
Sect.  12.6. 

X-ray  Pulsars  X-ray  pulsars  always  belong  to 
binary  systems  and  may  be  either  HMXBs  or 
LMXBs.  The  pulse  periods  of  X-ray  pulsars  in 
high-mass  systems  are  significantly  longer  than 
those  of  radio  pulsars,  from  a  few  seconds  to  tens 
of  minutes.  In  contrast  to  radio  pulsars,  the  period 
of  the  pulsed  emission  of  these  pulsars  decreases 
with  time. 

The  characteristic  properties  of  X-ray  pul¬ 
sars  can  be  understood  from  their  binary  nature. 
A  neutron  star  formed  in  a  binary  system  is  first 
seen  as  a  normal  radio  pulsar.  Initially,  the  strong 
radiation  of  the  pulsar  prevents  gas  from  falling 
onto  it.  However,  as  it  slows  down,  its  energy  de¬ 
creases,  and  eventually  the  stellar  wind  from  the 
companion  can  reach  its  surface.  The  incoming 
gas  is  channelled  to  the  magnetic  polar  caps  of 
the  neutron  star,  where  it  emits  strong  X-ray  ra¬ 
diation  as  it  hits  the  surface.  This  produces  the 
observed  pulsed  emission. 

In  low-mass  systems,  the  angular  momentum 
of  the  incoming  gas  speeds  up  the  rotation  of 
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Fig.  15.10  Scale  drawings 
of  16  black-hole  binaries  in 
the  Milky  Way  (courtesy  of 
J.  Orosz).  The 
Sun-Mercury  distance 
(0.4  AU)  is  shown  at  the 
top.  The  estimated  binary 
inclination  is  indicated  by 
the  tilt  of  the  accretion 
disk.  (R.A.  Remillard, 

J.E.  McClintock  2006, 
ARAA  44,  54) 
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the  pulsar.  The  maximum  possible  rotation  rate 
of  a  neutron  star  before  centrifugal  forces  start  to 
break  it  up  corresponds  to  a  period  of  about  a  mil¬ 
lisecond.  A  few  millisecond  pulsars  with  periods 
of  this  order  are  known,  both  in  the  radio  and  in 
the  X-ray  region.  It  is  thought  that  these  are  (or, 
in  the  radio  case,  have  once  been)  members  of  bi¬ 
nary  systems. 

The  emission  curve  of  a  typical  fast  X-ray  pul¬ 
sar,  Hercules  XI,  is  shown  in  Fig.  15.11.  The  pe¬ 
riod  of  the  pulses  is  1.24  s.  This  neutron  star  is 
part  of  an  eclipsing  binary  system,  known  from 
optical  observations  as  HZHerculis.  The  orbital 
properties  of  the  system  can  therefore  be  deter¬ 
mined.  Thus  e.g.  the  mass  of  the  pulsar  is  about 
one  solar  mass,  reasonable  for  a  neutron  star. 

X-ray  Bursters  X-ray  bursters  are  irregular 
variables,  showing  sudden  brightenings,  known 
as  type  I  X-ray  bursts,  at  random  times 
(Fig.  15.12).  The  typical  interval  between  out¬ 
bursts  is  a  few  hours  or  days,  but  more  rapid 


bursters  are  also  known.  The  strength  of  the  out¬ 
burst  seems  to  be  related  to  the  recharging  time. 

Type  I  X-ray  bursts  are  analogous  to  the  erup¬ 
tions  of  classical  novae.  However,  the  source  of 
radiation  in  X-ray  bursters  cannot  be  the  igni¬ 
tion  of  hydrogen,  since  the  maximum  emission 
is  in  the  X-ray  region.  Instead,  gas  from  the  com¬ 
panion  settles  on  the  surface  of  the  neutron  star, 
where  hydrogen  burns  steadily  to  helium.  Then, 
when  the  growing  shell  of  helium  reaches  a  criti¬ 
cal  temperature,  it  bums  to  carbon  in  a  rapid  he¬ 
lium  flash.  Since,  in  this  case,  there  are  no  thick 
damping  outer  layers,  the  flash  appears  as  a  burst 
of  X-ray  radiation. 

X-ray  Novae  The  X-ray  pulsars  and  bursters 
have  to  be  neutron  stars.  Other  X-ray  binaries 
may  be  either  neutron  stars  or  black  holes.  All 
compact  X-ray  sources  are  variable  to  some  ex¬ 
tent.  In  the  persistent  sources  the  variations  are 
moderate,  an  the  sources  always  visible.  The  ma¬ 
jority  of  sources  are  transient. 
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Fig.  1 5.1 1  The  pulses  of 
the  X-ray  pulsar 
Hercules  XI  have  the 
period  1.24  s.  The 
best-fitting  curve  has  been 
superimposed  on  the 
observations.  (Tananbaum, 
H.  et  al.  (1972):  Astrophys. 
J.  (Lett.)  174,  L143) 


Fig.  15.12  The  variations 
of  the  rapid  X-ray  burster 
MXB  1730-335.  An 
100  second  interval  is 
marked  in  the  diagram. 
(Lewin,  W.H.G.  (1977): 
Ann.  N.Y.  Acad.  Sci.  302, 
310) 


If  the  X-ray  bursters  correspond  to  classical 
novae,  the  counterparts  of  dwarf  novae  are  the 
X-ray  novae ,  also  known  as  soft  X-ray  transients 
(SXT).  Quantitatively  there  are  large  differences 
between  these  types  of  systems.  Dwarf  novae 
have  outbursts  lasting  for  a  few  days  at  intervals 
of  a  few  months,  for  SXTs  the  outbursts  hap¬ 
pen  at  decade-long  intervals  and  last  for  months. 
A  dwarf  nova  brightens  by  a  factor  about  100  dur¬ 
ing  outbursts,  a  SXT  by  a  factor  of  106.  The  light- 
curves  of  neutron- star  and  black-hole  SXTs  are 
compared  in  Fig.  15.13. 


The  SXTs  are  alternating  between  (at  least) 
two  states:  During  the  high  state  thermal  radia¬ 
tion  from  the  accretion  disk  dominates,  whereas 
in  the  low  state  the  X-ray  have  a  higher  energy, 
and  are  produced  by  Compton  scattering  by  hot 
electrons  in  a  disk  corona  or  a  jet. 

Microquasars  One  interesting  aspect  of  X-ray 
binaries  is  their  connection  to  models  of  active 
galactic  nuclei  (AGN,  Sect.  19.7).  In  both  sys¬ 
tems  a  black  hole,  which  in  the  case  of  AGN  may 
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have  a  mass  in  the  range  106-1010MQ,  is  sur¬ 
rounded  by  an  accretion  disk. 

In  an  X-ray  binary  there  is  similarly  an  accre¬ 
tion  disk  surrounding  a  compact  object,  a  stellar- 
mass  black  hole.  It  will  exhibit  phenomena  in 
many  respects  similar  to  those  in  AGN.  Since 
the  galactic  sources  are  much  nearer,  and  vary  on 
much  shorter  time-scales,  they  may  allow  more 
detailed  observations  of  these  phenomena. 


Fig.  15.1 3  Light  -curves  of  a  neutron-star  (Aql  X-l)  and  a 
black-hole  (GRO  J1655-40)  transient  source,  as  observed 
by  the  All  Sky  Monitor  on  RXTE.  (D.  Psaltis  2006,  in 
Compact  Stellar  X-ray  Sources,  ed.  Lewin,  vdKlis,  CUP, 
p.  16,  Fig.  1.9) 


For  example,  relativistic  jets  perpendicular  to 
the  disk  are  common  in  AGN,  and  they  can  also 
be  expected  in  X-ray  binaries.  A  few  examples 
of  such  microquasars  have  been  discovered,  see 
Fig.  15.14. 

Furthermore,  in  AGN  the  jet  may  sometimes 
be  pointing  straight  at  us.  Relativistic  effects  will 
then  lead  to  a  brightening  of  the  source.  In  a  mi¬ 
croquasar  there  might  be  a  similar  effect,  which 
would  provide  one  explanation  for  the  ultralu- 
minous  X-ray  sources  (ULX),  sources  which  ap¬ 
pear  to  be  too  luminous  to  be  produced  by  or¬ 
dinary  stellar-mass  black  holes.  This  is  impor¬ 
tant,  because  according  to  an  alternative  model 
ULXs  contain  an  intermediate  mass  black  hole 
with  a  mass  about  103  M©.  The  origin  of  such  in¬ 
termediate  mass  black  holes,  if  they  exist,  is  an 
intriguing  problem. 


15.5  Examples 

Example  15.1  Assume  that  the  Sun  collapses 
into  a  neutron  star  with  a  radius  of  20  km. 
(a)  What  will  be  the  mean  density  of  the  neutron 
star?  (b)  What  would  be  it  rotation  period? 


Fig.  15.14  Observed 
outburst  of  the  microquasar 
GRS  1915+105  on 
9  September,  1997.  The 
disappearance  of  the 
internal  part  of  the 
accretion  disc  (decrease  in 
the  X-ray  flux)  is  followed 
by  an  ejection  of 
relativistic  plasma  clouds 
(oscillation  in  the  infrared 
and  radio).  (S.  Chaty, 
astro-ph/0607 668) 
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(a)  The  mean  density  is 

M©  2  x  1030  kg 

^  ^7rR3  ^7r(20  x  103)3  m3 

~6x  1016  kg/m3. 


For  the  Sun  we  have 


2x6.67x10  11m3s  2  kg  1  x  1.989  x  1030  kg 
(2.998 x  108  ms"1)2 


=  2950  m. 


One  cubic  millimetre  of  this  substance  would 
weigh  60  million  kilos. 

(b)  To  obtain  an  exact  value,  we  should  take 
into  account  the  mass  distributions  of  the  Sun  and 
the  resulting  neutron  star.  Very  rough  estimates 
can  be  found  assuming  that  both  are  homoge- 

0  o 

neous.  Then  the  moment  of  inertia  is  I  = 
and  the  angular  momentum  is  L  =  loo.  The  rota¬ 
tion  period  is  then  obtained  as  in  Example  12.1: 


2 


=  25  d 


20  x  103  m 
6.96  x  108  m 


2 

=  2.064  x  1CT8  d 


0.0018  s. 


The  Sun  would  make  over  550  revolutions  per 
second. 


Example  15.2  What  should  be  the  radius  of  the 
Sun  if  the  escape  velocity  from  the  surface  were 
to  exceed  the  speed  of  light? 

The  escape  velocity  exceeds  the  speed  of  light 
if 


or 


2  GM 


R 


>  c 


2  GM 

R  <  — =  *s. 


15.6  Exercises 

Exercise  15.1  The  mass  of  a  pulsar  is  1.5  M©, 
radius  10  km,  and  rotation  period  0.033  s.  What 
is  the  angular  momentum  of  the  pulsar?  Varia¬ 
tions  of  0.0003  s  are  observed  in  the  period.  If 
they  are  due  to  radial  oscillations  (“starquakes”), 
how  large  are  these  oscillations? 

Exercise  15.2  In  Dragon  \s  Egg  by  Robert  L. 
Forward  a  spaceship  orbits  a  neutron  star  at  a  dis¬ 
tance  of  406  km  from  the  centre  of  the  star.  The 
orbital  period  is  the  same  as  the  rotation  period  of 
the  star,  0.1993  s. 

(a)  Find  the  mass  of  the  star  and  the  gravitational 
acceleration  felt  by  the  spaceship. 

(b)  What  is  the  effect  of  the  gravitation  on 
a  175  cm  tall  astronaut,  if  (s)he  stands  with 
her/his  feet  pointing  towards  the  star?  And  if 
(s)he  is  lying  tangential  to  the  orbit? 

Exercise  15.3  A  photon  leaves  the  surface  of 
a  star  at  a  frequency  ve.  An  infinitely  distant  ob¬ 
server  finds  that  its  frequency  is  v.  If  the  differ¬ 
ence  is  due  to  gravitation  only,  the  change  in  the 
energy  of  the  photon,  h  Av,  equals  the  change  in 
its  potential  energy.  Find  the  relation  between  v 
and  vQ ,  assuming  the  mass  and  radius  of  the  star 
are  M  and  R.  How  much  will  the  solar  radiation 
redshift  on  its  way  to  the  Earth? 
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Although  most  of  the  mass  of  the  Milky  Way 
Galaxy  is  condensed  into  stars,  interstellar  space 
is  not  completely  empty.  It  contains  gas  and  dust 
in  the  form  both  of  individual  clouds  and  of  a  dif¬ 
fuse  medium.  Interstellar  space  typically  contains 
about  one  gas  atom  per  cubic  centimetre  and 
100  dust  particles  per  cubic  kilometre. 

Altogether,  about  10  %  of  the  mass  of  the 
Milky  Way  consists  of  interstellar  gas.  Since  the 
gas  is  strongly  concentrated  in  the  galactic  plane 
and  the  spiral  arms,  in  these  regions  there  are 
many  places  where  the  quantities  of  stars  and  in¬ 
terstellar  matter  are  about  equal.  The  dust  (a  bet¬ 
ter  name  would  be  “smoke”,  since  the  particle 
sizes  are  much  smaller  than  in  terrestrial  dust) 
constitutes  about  one  percent  of  the  gas.  High- 
energy  cosmic  ray  particles  are  mixed  with  the 
gas  and  dust.  There  is  also  a  weak,  but  still  very 
important,  galactic  magnetic  field. 

At  present  the  most  important  observations  of 
the  interstellar  medium  are  made  at  radio  and  in¬ 
frared  wavelengths,  since  the  peak  of  the  emis¬ 
sion  often  lies  at  these  wavelengths.  But  many 
forms  of  interstellar  matter  (such  as  solid  bod¬ 
ies  with  diameters  larger  than  1  mm)  would  be 
almost  impossible  to  detect  on  the  basis  of  their 
emission  or  absorption.  In  principle,  the  mass  of 
these  forms  of  matter  might  be  larger  than  the  ob¬ 
served  mass  of  all  other  forms  put  together.  How¬ 
ever,  an  upper  limit  on  the  total  mass  of  interstel¬ 
lar  matter,  regardless  of  its  form,  can  be  derived 
on  the  basis  of  its  gravitational  effects.  This  is 
the  Oort  limit.  The  galactic  gravitational  field  is 
determined  by  the  distribution  of  matter.  By  ob¬ 


serving  the  motions  of  stars  perpendicular  to  the 
galactic  plane,  the  vertical  gravitational  force  and 
hence  the  amount  of  mass  in  the  galactic  plane 
can  be  determined.  The  result  is  that  the  local 
density  within  1  kpc  of  the  Sun  is  (7.3-10.0)  x 
10-21  kgm-3.  The  density  of  known  stars  is 
(5. 9-6. 7)  x  10  21  kgm  3  and  that  of  known  in¬ 
terstellar  matter  about  1.7  x  10-21  kgm-3.  Thus 
there  is  very  little  room  for  unknown  forms  of 
mass  in  the  solar  neighbourhood.  However,  the 
limit  concerns  only  the  dark  matter  concentrated 
in  the  galactic  plane.  There  are  indications  that 
the  Milky  Way  is  surrounded  by  a  spherical  halo 
of  dark  matter  (Chap.  18). 


1 6.1  Interstellar  Dust 

The  first  clear  evidence  for  the  existence  of  in¬ 
terstellar  dust  was  obtained  around  1930.  Before 
that,  it  had  been  generally  thought  that  space  is 
completely  transparent  and  that  light  can  propa¬ 
gate  indefinitely  without  extinction. 

In  1930  Robert  Trumpler  published  his  study 
of  the  space  distribution  of  the  open  clusters. 
The  absolute  magnitudes  M  of  the  brightest  stars 
could  be  estimated  on  the  basis  of  the  spectral 
type.  Thus  the  distance  r  to  the  clusters  could 
be  calculated  from  the  observed  apparent  magni¬ 
tudes  m  of  the  bright  stars: 

m-M  =  5  lg  — ^ — .  (16.1) 

10  pc 

Trumpler  also  studied  the  diameters  of  the  clus¬ 
ters.  The  linear  diameter  D  is  obtained  from  the 
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apparent  angular  diameter  d  by  means  of  the  for¬ 
mula 


D  =  dr,  (16.2) 

where  r  is  the  distance  of  the  cluster. 

It  caught  Trumpler’ s  attention  that  the  more 
distant  clusters  appeared  to  be  systematically 
larger  than  the  nearer  ones  (Fig.  16.1).  Since  this 
could  hardly  be  true,  the  distances  of  the  more 
distant  clusters  must  have  been  overestimated. 
Trumpler  concluded  that  space  is  not  completely 
transparent,  but  that  the  light  of  a  star  is  dimmed 
by  some  intervening  material.  To  take  this  into 
account,  (16.1)  has  to  be  replaced  with  (4.17) 

m  —  M  =  5  lg  — ^ - FA,  (16.3) 

10  pc 

where  A  >  0  is  the  extinction  in  magnitudes  due 
to  the  intervening  medium.  If  the  opacity  of  the 
medium  is  assumed  to  be  the  same  at  all  distances 
and  in  all  directions,  A  can  be  written 

A  —  ar,  (16.4) 

where  a  is  a  constant.  Trumpler  obtained  for  the 
average  value  of  a  in  the  galactic  plane,  apg  = 
0.79  mag/kpc,  in  photographic  magnitudes.  At 
present,  a  value  of  2  mag/kpc  is  used  for  the  av¬ 
erage  extinction.  Thus  the  extinction  over  a  5  kpc 
path  is  already  10  magnitudes. 

Extinction  due  to  dust  varies  strongly  with 
direction.  For  example,  visible  light  from  the 


Fig.  16.1  The  diameters  of  open  star  clusters  calculated 
with  the  distance  given  by  the  formula  (15.1)  according 
to  Trumpler  (1930).  The  increase  of  the  diameter  with  dis¬ 
tance  is  not  a  real  phenomenon,  but  an  effect  of  interstellar 
extinction,  which  was  discovered  in  this  way 


galactic  centre  (distance  8-9  kpc)  is  dimmed  by 
30  magnitudes.  Therefore  the  galactic  centre  can¬ 
not  be  observed  at  optical  wavelengths. 

Extinction  is  due  to  dust  grains  that  have  diam¬ 
eters  near  the  wavelength  of  the  light.  Such  par¬ 
ticles  scatter  light  extremely  efficiently.  Gas  can 
also  cause  extinction  by  scattering,  but  its  scatter¬ 
ing  efficiency  per  unit  mass  is  much  smaller.  The 
total  amount  of  gas  allowed  by  the  Oort  limit  is  so 
small  that  scattering  by  gas  is  negligible  in  inter¬ 
stellar  space.  (This  is  in  contrast  with  the  Earth’s 
atmosphere,  where  air  molecules  make  a  signifi¬ 
cant  contribution  to  the  total  extinction). 

Interstellar  particles  can  cause  extinction  in 
two  ways: 

1.  In  absorption  the  radiant  energy  is  trans¬ 
formed  into  heat,  which  is  then  re-radiated 
at  infrared  wavelengths  corresponding  to  the 
temperature  of  the  dust  particles. 

2.  In  scattering  the  direction  of  light  propagation 
is  changed,  leading  to  a  reduced  intensity  in 
the  original  direction  of  propagation. 

An  expression  for  interstellar  extinction  will 
now  be  derived.  The  size,  index  of  refraction  and 
number  density  of  the  particles  are  assumed  to 
be  known.  For  simplicity  we  shall  assume  that  all 
particles  are  spheres  with  the  same  radius  a  and 
the  geometrical  cross  section  n a2.  The  true  ex¬ 
tinction  cross  section  of  the  particles  Cex t  will  be 

Gext  —  QzxiTca  ,  (16.5) 


where  Qqx t  is  the  extinction  efficiency  factor. 

Let  us  consider  a  volume  element  with  length 
d l  and  cross  section  dA,  normal  to  the  direction 
of  propagation  (Fig.  16.2).  It  is  assumed  that  the 
particles  inside  the  element  do  not  shadow  each 
other.  If  the  particle  density  is  n ,  there  are  n  d /  dA 
particles  in  the  volume  element  and  they  will 
cover  the  fraction  dr  of  the  area  dA,  where 


dr  = 


n  d  A  d l  Cext 
dA 


—  n  Cqx t  d  / . 


In  the  length  d /  the  intensity  is  thus  changed  by 


dl  =  —  /dr.  (16.6) 


On  the  basis  of  (16.6)  dr  can  be  identified  as  the 
optical  depth. 
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Fig.  16.2  Extinction  by  a  distribution  of  particles.  In  the 
volume  element  with  length  d /  and  cross  section  d A,  there 
are  n  d A  d /  particles,  where  n  is  the  particle  density  in  the 
medium.  If  the  extinction  cross  section  of  one  particle  is 


Cext>  the  total  area  covered  by  the  particles  is  n  d A  dl  Cex t- 
Thus  the  fractional  decrease  in  intensity  over  the  distance 
dl  is  d ///  =  —  n  d A  dl  Cext/dA  =  —  n  Cext  dl 


The  total  optical  depth  between  the  star  and 
the  Earth  is 


r  nr 

dr  =  I  Tl  Cext  d /  =  Cex \UY , 

Jo 

where  n  is  the  average  particle  density  along  the 
given  path.  According  to  (4.18)  the  extinction  in 
magnitudes  is 


A  =  (2.5  lge)r, 


and  hence 

A(r)  =  (2.5  lge)Cextnr.  (16.7) 

This  formula  can  also  be  inverted  to  calculate  n , 
if  the  other  quantities  are  known. 

The  extinction  efficiency  factor  Gext  can  be 
calculated  exactly  for  spherical  particles  with 
given  radius  a  and  refractive  index  m .  In  general, 

Gext  —  Gabs  T  Q  sea  > 

where 

Gabs  =  absorption  efficiency  factor, 

Gsca  =  scattering  efficiency  factor. 

If  we  define 

x  =  2Tta/X,  (16.8) 

where  A  is  the  wavelength  of  the  radiation,  then 

(16.9) 


Fig.  16.3  Mie  scattering:  the  extinction  efficiency  factor 
for  spherical  particles  for  the  refractive  indices  m  =  1.5 
and  m  —  1.33  (refractive  index  of  water).  The  horizon¬ 
tal  axis  is  related  to  the  size  of  the  particle  according  to 
x  =  2i ra/X,  where  a  is  the  particle  radius  and  X,  the  wave¬ 
length  of  the  radiation 

The  exact  expression  for  Gext  is  a  series  ex¬ 
pansion  in  v  that  converges  more  slowly  for 
larger  values  of  v.  When  v  1,  the  process  is 
called  Rayleigh  scattering ;  otherwise  it  is  known 
as  Mie  scattering.  Figure  16.3  shows  Gext  as 
a  function  of  v  for  m  =  1.5  and  m  =  1.33.  For 
very  large  particles,  (x  1)  Gext  =  2,  as  appears 
from  Fig.  16.3.  Purely  geometrically  one  would 
have  expected  Gext  =  1 ;  the  two  times  larger  scat¬ 
tering  efficiency  is  due  to  the  diffraction  of  light 
at  the  edges  of  the  particle. 

Other  observable  phenomena,  apart  from  ex¬ 
tinction,  are  also  caused  by  interstellar  dust.  One 
of  these  is  the  reddening  of  the  light  of  stars. 
(This  should  not  be  confused  with  the  redshift 
of  spectral  lines.)  Reddening  is  due  to  the  fact 
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Wavelength  A  [nm] 

1000  500  300  200  125  100 


Fig.  16.4  (a)  Schematic  representation  of  the  interstel¬ 
lar  extinction.  As  the  wavelength  increases,  the  extinc¬ 
tion  approaches  zero.  (Drawing  based  on  Greenberg,  J.M. 
(1968):  “Interstellar  Grains”,  in  Nebulae  and  Interstel¬ 
lar  Matter ,  ed.  by  Middlehurst,  B.M.,  Aller,  L.H.,  Stars 

that  the  amount  of  extinction  becomes  larger  for 
shorter  wavelengths.  Going  from  red  to  ultravi¬ 
olet,  the  extinction  is  roughly  inversely  propor¬ 
tional  to  wavelength.  For  this  reason  the  light  of 
distant  stars  is  redder  than  would  be  expected  on 
the  basis  of  their  spectral  class.  The  spectral  class 
is  defined  on  the  basis  of  the  relative  strengths  of 
the  spectral  lines  which  are  not  affected  by  ex¬ 
tinction. 

According  to  (4.20),  the  observed  colour  index 
B  —  V  of  a  star  is 

B  —  V  =  Mg  —  M\  +  Ag  —  Ag 

=  (B  -  V)0  +  EB-y,  (16.10) 

where  (B  —  V) o  is  the  intrinsic  colour  of  the  star 
and  £g-v  the  colour  excess.  As  noted  in  Sect.  4.5 
the  ratio  between  the  visual  extinction  Ay  and  the 
colour  excess  is  approximately  constant: 

Ay  Ay 

R  =  — —  = - - —  «  3.0.  (16.11) 

£g_v  Ag  —  Ay 

R  does  not  depend  on  the  properties  of  the  star 
or  the  amount  of  extinction.  This  is  particularly 
important  in  photometric  distance  determinations 
because  of  the  fact  that  the  colour  excess  £g_y 
can  be  directly  determined  from  the  difference 
between  the  observed  colour  index  B  —  V  and  the 
intrinsic  colour  ( B  —  V)o  known  from  the  spec¬ 


and  Stellar  Systems,  Vol.  VII  (The  University  of  Chicago 
Press,  Chicago)  p.  224).  (b)  Measured  extinction  curve, 
normalised  to  make  Fb-v  =  1.  (Hoyle,  F.,  Narlikar,  J. 
(1980):  The  Physics- Astronomy  Frontier  (W.H.  Freeman 
and  Company,  San  Francisco)  p.  156.  Used  by  permission) 

tral  class.  One  can  then  calculate  the  extinction 

Ay^3.0£g— V  (16.12) 

and  finally  the  distance.  Since  the  interstellar 
medium  is  far  from  homogeneous,  the  colour  ex¬ 
cess  method  gives  a  much  more  reliable  value 
than  using  some  average  value  for  the  extinction 
in  (4.18). 

The  wavelength  dependence  of  the  extinction, 
A  (A),  can  be  studied  by  comparing  the  magni¬ 
tudes  of  stars  of  the  same  spectral  class  in  differ¬ 
ent  colours.  These  measurements  have  shown  that 
A  (A)  approaches  zero  as  A  becomes  very  large. 
In  practice  A  (A)  can  be  measured  up  to  a  wave¬ 
length  of  about  two  micrometres.  The  extrapola¬ 
tion  to  zero  inverse  wavelength  is  then  fairly  re¬ 
liable.  Figure  16.4(a)  shows  A  (A)  as  a  function 
of  inverse  wavelength.  It  also  illustrates  how  the 
quantities  Ay  and  £g_y,  which  are  needed  in  or¬ 
der  to  calculate  the  value  of  R ,  are  obtained  from 
this  extinction  or  reddening  curve.  Figure  16.4(b) 
shows  the  observed  extinction  curve.  The  points 
in  the  ultraviolet  (A  <  0.3  m)  are  based  on  rocket 
measurements. 

It  is  clear  from  Fig.  16.4(b)  that  interstellar  ex¬ 
tinction  is  largest  at  short  wavelengths  in  the  ul¬ 
traviolet  and  decreases  for  longer  wavelengths.  In 
the  infrared  it  is  only  about  ten  percent  of  the  op¬ 
tical  extinction  and  in  the  radio  region  it  is  vanish¬ 
ingly  small.  Objects  that  are  invisible  in  the  opti- 
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cal  region  can  therefore  be  studied  at  infrared  and 
radio  wavelengths. 

Another  observed  phenomenon  caused  by  dust 
is  the  polarisation  of  the  light  of  the  stars.  Since 
spherical  particles  cannot  produce  any  polarisa¬ 
tion,  the  interstellar  dust  particles  have  to  be  non- 
spherical  in  shape.  If  the  particles  in  a  cloud  are 
aligned  by  the  interstellar  magnetic  field,  they 
will  polarise  the  radiation  passing  through  the 
cloud.  The  degree  of  polarisation  and  its  wave¬ 
length  dependence  give  information  on  the  prop¬ 
erties  of  the  dust  particles.  By  studying  the  di¬ 
rection  of  polarisation  in  various  directions,  one 
can  map  the  structure  of  the  galactic  magnetic 
field. 

In  the  Milky  Way  interstellar  dust  is  essen¬ 
tially  confined  to  a  very  thin,  about  100  pc,  layer 
in  the  galactic  plane.  The  dust  in  other  spiral 
galaxies  has  a  similar  distribution  and  is  directly 
visible  as  a  dark  band  in  the  disk  of  the  galaxy 
(Fig.  19.17  bottom).  The  Sun  is  located  near 
the  central  plane  of  the  galactic  dust  layer,  and 
thus  the  extinction  in  the  direction  of  the  galac¬ 
tic  plane  is  very  large,  whereas  the  total  extinc¬ 
tion  towards  the  galactic  poles  may  be  less  than 
0.1  magnitudes.  This  is  apparent  in  the  distribu¬ 
tion  of  galaxies  in  the  sky:  at  high  galactic  lat¬ 
itudes,  there  are  many  galaxies,  while  near  the 
galactic  plane,  there  is  a  20°  zone  where  hardly 
any  galaxies  are  seen.  This  empty  region  is  called 
the  zone  of  avoidance. 

If  a  homogeneous  dust  layer  gives  rise  to  a  to¬ 
tal  extinction  of  Am  magnitudes  in  the  vertical 
direction,  then  according  to  Fig.  16.5,  the  total 
extinction  at  galactic  latitude  b  will  be 

Am(b)  =  Am/ sin/?.  (16.13) 

If  the  galaxies  are  uniformly  distributed  in  space, 
then  in  the  absence  of  extinction,  the  number  of 
galaxies  per  square  degree  brighter  than  the  mag¬ 
nitude  m  would  be 

lg  No(m)  =  0.6m  +  C,  (16.14) 

where  C  is  a  constant  (see  Exercise  17.1).  How¬ 
ever,  due  to  extinction,  a  galaxy  that  would  oth¬ 
erwise  have  the  apparent  magnitude  mo  will  have 


Fig.  16.5  In  a  homogeneous  medium  the  extinction  in 
magnitudes  is  proportional  to  the  pathlength  traversed. 
If  the  extinction  in  the  direction  of  the  galactic  pole  is 
Am,  then  the  extinction  at  the  galactic  latitude  b  will  be 
Am/  sin/? 

the  magnitude 

m(b)  =  mo  +  Am(b)  =  mo  +  Am/  sin  b, 

(16.15) 

where  b  is  the  galactic  latitude.  Thus  the  observ¬ 
able  number  of  galaxies  at  latitude  b  will  be 

lg N(m,  b)  =  lg  No(m  —  A m(b)) 

=  0.6  (m  —  A  m(b))  +  C 
=  \gNo(m)  —  0.6A  m(b) 

or 

lg  N(m,  b)  =  C'  —  0.6—,  (16.16) 

sin  b 

where  C^lgA^m)  does  not  depend  on  the 
galactic  latitude.  By  making  galaxy  counts  at  var¬ 
ious  latitudes  b ,  the  extinction  Am  can  be  de¬ 
termined.  The  value  obtained  from  galaxy  counts 
made  at  Lick  Observatory  is  Ampg  =  0.51  mag. 

The  total  vertical  extinction  of  the  Milky  Way 
has  also  been  determined  from  the  colour  ex¬ 
cesses  of  stars.  These  investigations  have  yielded 
much  smaller  extinction  values,  about  0. 1  mag.  In 
the  direction  of  the  north  pole,  extinction  is  only 
0.03  mag.  The  disagreement  between  the  two  ex¬ 
tinction  values  is  probably  largely  due  to  the  fact 
that  the  dust  layer  is  not  really  homogeneous.  If 
the  Sun  is  located  in  a  local  region  of  low  dust 
content,  the  view  towards  the  galactic  poles  might 
be  almost  unobstructed  by  dust. 
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Fig.  16.6 


The  Coalsack  is  a  dark  nebula  next  to  the  Southern  Cross.  (Photograph  K.  Mattila,  Helsinki  University) 


Dark  Nebulae  Observations  of  other  galaxies 
show  that  the  dust  is  concentrated  in  the  spi¬ 
ral  arms,  in  particular  at  their  inner  edge.  In  ad¬ 
dition  dust  is  concentrated  in  individual  clouds, 
which  appear  as  star-poor  regions  or  dark  nebulae 
against  the  background  of  the  Milky  Way.  Exam¬ 
ples  of  dark  nebulae  are  the  Coalsack  in  the  south¬ 
ern  sky  (Fig.  16.6)  and  the  Horsehead  nebula 
in  Orion.  Sometimes  the  dark  nebulae  form  ex¬ 
tended  winding  bands,  and  sometimes  small,  al¬ 
most  spherical,  objects.  Objects  of  the  latter  type 
are  most  easy  to  see  against  a  bright  background, 
e.g.  a  gas  nebula  (see  Fig.  16.19).  These  objects 
have  been  named  globules  by  Bart  J.  Bok ,  who 


put  forward  the  hypothesis  that  they  are  clouds 
that  are  just  beginning  to  contract  into  stars. 

The  extinction  by  a  dark  nebula  can  be  illus¬ 
trated  and  studied  by  means  of  a  Wolf  diagram , 
shown  schematically  in  Fig.  16.7.  The  diagram  is 
constructed  on  the  basis  of  star  counts.  The  num¬ 
ber  of  stars  per  square  degree  in  some  magnitude 
interval  (e.g.  between  magnitudes  14  and  15)  in 
the  cloud  is  counted  and  compared  with  the  num¬ 
ber  outside  the  nebula.  In  the  comparison  area, 
the  number  of  stars  increases  monotonically  to¬ 
wards  fainter  magnitudes.  In  the  dark  nebula  the 
numbers  first  increase  in  the  same  way,  but  be¬ 
yond  some  limiting  magnitude  (10  in  the  figure) 
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Fig.  1 6.7  Wolf  diagram.  The  horizontal  coordinate  is  the 
magnitude  and  the  vertical  coordinate  is  the  number  of 
stars  per  square  degree  in  the  sky  brighter  than  that  mag¬ 
nitude.  A  dark  nebula  diminishes  the  brightness  of  stars 
lying  behind  it  by  the  amount  Am 

the  number  of  stars  falls  below  that  outside  the 
cloud.  The  reason  for  this  is  that  the  fainter  stars 
are  predominantly  behind  the  nebula,  and  their 
brightness  is  reduced  by  some  constant  amount 
Am  (2  magnitudes  in  the  figure).  The  brighter 
stars  are  mostly  in  front  of  the  nebula  and  suffer 
no  extinction. 

Reflection  Nebulae  If  a  dust  cloud  is  near 
a  bright  star,  it  will  scatter,  i.e.  reflect  the  fight 
of  the  star.  Thus  individual  dust  clouds  can  some¬ 
times  be  observed  as  bright  reflection  nebulae. 
Some  500  reflection  nebulae  are  known. 

The  regions  in  the  sky  richest  in  reflection 
nebulae  are  the  areas  around  the  Pleiades  and 
around  the  giant  star  Antares.  Antares  itself  is 
surrounded  by  a  large  red  reflection  nebula.  This 
region  is  shown  in  Fig.  16.8.  Figure  16.9  shows 
the  reflection  nebula  NGC  2068,  which  is  located 
near  a  large,  thick  dust  cloud  a  few  degrees  north¬ 
west  of  Orion’s  belt.  It  is  one  of  the  brightest 
reflection  nebulae  and  the  only  one  included  in 
the  Messier  catalogue  (M78).  In  the  middle  of 
the  nebula  there  are  two  stars  of  about  1 1  mag¬ 
nitudes.  The  northern  star  illuminates  the  nebula, 
while  the  other  one  is  probably  in  front  of  the 
nebula.  Figure  16.10  shows  the  reflection  nebula 


NGC  1435  around  Merope  in  the  Pleiades.  An¬ 
other  bright  and  much- studied  reflection  nebula  is 
NGC  7023  in  Cepheus.  It,  too,  is  connected  with 
a  dark  nebula.  The  illuminating  star  has  emission 
lines  in  its  spectrum  (spectral  type  Be).  Infrared 
stars  have  also  been  discovered  in  the  area  of  the 
nebula,  probably  a  region  of  star  formation. 

In  1922  Edwin  Hubble  published  a  fundamen¬ 
tal  investigation  of  bright  nebulae  in  the  Milky 
Way.  On  the  basis  of  extensive  photographic  and 
spectroscopic  observations,  he  was  able  to  estab¬ 
lish  two  interesting  relationships.  First  he  found 
that  emission  nebulae  only  occur  near  stars  with 
spectral  class  earlier  than  B0,  whereas  reflec¬ 
tion  nebulae  may  be  found  near  stars  of  spectral 
class  B1  and  later.  Secondly  Hubble  discovered 
a  relationship  between  the  angular  size  R  of  the 
nebula  and  the  apparent  magnitude  m  of  the  illu¬ 
minating  star: 

5  lg  R  =  —  m  +  const.  (16.17) 

Thus  the  angular  diameter  of  a  reflection  neb¬ 
ula  is  larger  for  a  brighter  illuminating  star.  Since 
the  measured  size  of  a  nebula  generally  increases 
for  longer  exposures,  i.e.  fainter  limiting  surface 
brightness,  the  value  of  R  should  be  defined  to 
correspond  to  a  fixed  limiting  surface  brightness. 
The  value  of  the  constant  in  the  Hubble  relation 
depends  on  this  limiting  surface  brightness.  The 
Hubble  relation  for  reflection  nebulae  is  shown  in 
Fig.  16.1 1,  based  on  measurements  by  Sidney  van 
den  Bergh  from  Palomar  Sky  Atlas  plates.  Each 
point  corresponds  to  a  reflection  nebula  and  the 
straight  line  represents  the  relation  (16.17),  where 
the  value  of  the  constant  is  12.0  ( R  is  given  in  arc 
minutes). 

The  Hubble  relation  can  be  derived  theo¬ 
retically,  if  it  is  assumed  that  the  illumination 
of  a  dust  cloud  is  inversely  proportional  to  the 
square  of  the  distance  to  the  illuminating  star,  and 
that  the  dust  clouds  are  uniformly  distributed  in 
space.  The  theoretical  Hubble  relation  also  gives 
an  expression  for  the  constant  on  the  right-hand 
side,  which  involves  the  albedo  and  the  phase 
function  of  the  grains. 

The  observations  of  reflection  nebulae  show 
that  the  albedo  of  interstellar  grains  must  be  quite 
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Fig.  1 6.8  Bright  and  dark  nebulae  in  Scorpius  and  Ophi- 
uchus.  Photograph  (a)  was  taken  in  the  blue  colour  re¬ 
gion,  X  =  350-500  nm,  and  (b)  in  the  red  colour  region, 
X  =  600-680  nm.  (The  sharp  rings  in  (b)  are  reflections 
of  Antares  in  the  correction  lens  of  the  Schmidt  camera.) 
The  nebulae  located  in  the  area  are  identified  in  draw¬ 
ing  (c).  B44  and  H4  are  dark  nebulae.  There  is  a  large 
reflection  nebula  around  Antares,  which  is  faintly  visible 
in  the  blue  (a),  but  bright  in  the  red  (b)  regions.  Antares 
is  very  red  (spectral  class  Ml)  and  therefore  the  reflec- 

high.  It  has  not  yet  been  possible  to  obtain  its  pre¬ 
cise  numerical  value  in  this  way,  since  the  dis¬ 
tances  between  the  nebulae  and  their  illuminating 
stars  are  not  known  well  enough. 

One  may  also  consider  the  surface  brightness 
of  dark  nebulae  that  are  not  close  enough  to  a  star 
to  be  visible  as  reflection  nebulae.  These  nebulae 
will  still  reflect  the  diffuse  galactic  light  from  all 
the  stars  in  the  Milky  Way.  Calculations  show  that 
if  the  dust  grains  have  a  large  albedo,  then  the 


tion  nebula  is  also  red.  In  contrast,  the  reflection  nebu¬ 
lae  around  the  blue  stars  p  Ophiuchi  (B2),  CD-24°  12684 
(B3),  22Scorpii  (B2)  and  er  Scorpii  (Bl)  are  blue  and  are 
visible  only  in  (a).  In  (b)  there  is  an  elongated  nebula  to 
the  right  of  er  Scorpii,  which  is  invisible  in  (a).  This  is  an 
emission  nebula,  which  is  very  bright  in  the  red  hydro¬ 
gen  Hq,  line  (656  nm).  In  this  way  reflection  and  emission 
nebulae  can  be  distinguished  by  means  of  pictures  taken  in 
different  wavelength  regions.  (Photograph  (a)  E.  Barnard, 
and  (b)  K.  Mattila) 

reflected  diffuse  light  should  be  bright  enough  to 
be  observable,  and  it  has  indeed  been  observed. 
Thus  the  dark  nebulae  are  not  totally  dark.  The 
diffuse  galactic  light  constitutes  about  20-30  % 
of  the  total  brightness  of  the  Milky  Way. 

Dust  Temperature  In  addition  to  scattering 
the  interstellar  grains  also  absorb  radiation.  The 
absorbed  energy  is  re-radiated  by  the  grains 
at  infrared  wavelengths  corresponding  to  their 


16.1  Interstellar  Dust 


335 


Fig.  16.9  The  reflection 
nebula  NGC  2068  (M78)  in 
Orion.  In  the  middle  of  the 
nebula  there  are  two  stars 
of  about  magnitude  1 1 .  The 
northern  one  (at  the  top )  is 
the  illuminating  star,  while 
the  other  one  probably  lies 
in  the  foreground. 
(Photography  Lunar  and 
Planetary  Laboratory, 
Catalina  Observatory) 


Fig.  16.10  The  reflection 
nebula  NGC  1435  around 
Merope  (23  Tau,  spectral 
class  B6)  in  the  Pleiades. 
This  figure  should  be 
compared  with  Fig.  16.1, 
where  Merope  is  visible  as 
the  lowest  of  the  bright 
stars  in  the  Pleiades. 
(National  Optical 
Astronomy  Observatories, 
Kitt  Peak  National 
Observatory) 
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Nebular  radius  R  [Arcminutes] 


Fig.  16.11  The  Hubble  relation  for  reflection  nebulae. 
The  horizontal  axis  shows  the  radius  R  of  the  nebulae  in 
arc  minutes  and  the  vertical  axis ,  the  (blue)  apparent  mag¬ 
nitude  m  of  the  central  star.  No  measurements  were  made 
in  the  hatched  region,  (van  den  Bergh,  S.  (1966):  Astron. 
J.  71,  990) 


temperatures.  The  temperature  of  dust  in  inter¬ 
stellar  space  (including  dark  nebulae)  is  about 
10-20  K.  The  corresponding  wavelength  ac¬ 
cording  to  Wien’s  displacement  law  (5.21)  is 
300-150  pm.  Near  a  hot  star  the  temperature  of 
the  dust  may  be  100-600  K  and  the  maximum 
emission  is  then  at  30-5  pm.  In  H II  regions  the 
dust  temperature  is  about  70-100  K. 

The  rapid  development  of  infrared  astronomy 
in  the  1970’s  has  brought  all  the  above-mentioned 
dust  sources  within  the  reach  of  observations 
(Fig.  16.12).  In  addition  infrared  radiation  from 
the  nuclei  of  normal  and  active  galaxies  is  largely 
thermal  radiation  from  dust.  Thermal  dust  emis¬ 
sion  is  one  of  the  most  important  sources  of  in¬ 
frared  radiation  in  astronomy. 


Fig.  16.12  Interstellar  dust  is  best  seen  in  infrared  wave¬ 
lengths.  Two  examples  of  the  images  by  the  IRAS  satel¬ 
lite.  (a)  In  a  view  towards  the  Galactic  centre,  the  dust  is 
seen  to  be  concentrated  in  a  narrow  layer  in  the  galactic 
plane.  Several  separate  clouds  are  also  seen,  (b)  Most  of 


the  constellation  Orion  is  covered  by  a  complex  area  of  in¬ 
terstellar  matter.  The  densest  concentrations  of  dust  below 
the  centre  of  the  image,  are  in  the  region  of  the  Horsehead 
nebula  and  the  Orion  nebula.  (Photos  NASA) 
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Wavelength  A 

Fig.  16.13  More  than  99  %  of  the  radiation  from  the 
^Carinae  nebula  (Fig.  13.10)  is  in  the  infrared.  The 
peak  in  the  visual  region  is  from  the  hydrogen  line 
(0.66  pm).  In  the  infrared,  the  silicate  emission  from  dust 
is  evident  at  10  pm.  (Allen,  D.A.  (1975):  Infrared,  the  New 
Astronomy  (Keith  Reid  Ltd.,  Shaldon)  p.  103) 

One  of  the  strongest  infrared  sources  in  the 
sky  is  the  nebula  around  the  star  17  Carinae.  The 
nebula  consists  of  ionised  gas,  but  infrared  radi¬ 
ation  from  dust  is  also  clearly  visible  in  its  spec¬ 
trum  (Fig.  16.13).  In  even  more  extreme  cases, 
the  central  star  may  be  completely  obscured, 
but  revealed  by  the  infrared  emission  from  hot 
dust. 

Composition  and  Origin  of  the  Dust  (Ta¬ 
bles  16.1  and  16.2).  From  the  peaks  in  the  ex¬ 
tinction  curve,  it  may  be  concluded  that  inter¬ 
stellar  dust  contains  water  ice  and  silicates,  and 
probably  graphite  as  well.  The  sizes  of  the  grains 
can  be  deduced  from  their  scattering  properties; 
usually  they  are  smaller  than  one  micrometre. 
The  strongest  scattering  is  due  to  grains  of  about 
0.3  pm  but  smaller  particles  must  also  be  present. 

Dust  grains  are  formed  in  the  atmospheres  of 
stars  of  late  spectral  types  (K,  M).  Gas  condenses 


into  grains  just  as  water  in  the  Earth’s  atmosphere 
may  condense  into  snow  and  ice.  The  grains  are 
then  expelled  into  interstellar  space  by  the  ra¬ 
diation  pressure.  Grains  may  also  form  in  con¬ 
nection  with  star  formation  and  possibly  directly 
from  atoms  and  molecules  in  interstellar  clouds 
as  well. 


1 6.2  Interstellar  Gas 

The  mass  of  gas  in  interstellar  space  is  a  hundred 
times  larger  than  that  of  dust.  Although  there  is 
more  gas,  it  is  less  easily  observed,  since  the  gas 
does  not  cause  a  general  extinction  of  light.  In  the 
optical  region  it  can  only  be  observed  on  the  basis 
of  a  small  number  of  spectral  lines. 

The  existence  of  interstellar  gas  began  to  be 
suspected  in  the  first  decade  of  the  20th  cen¬ 
tury,  when  in  1904  Johannes  Hartmann  observed 
that  some  absorption  lines  in  the  spectra  of  cer¬ 
tain  binary  stars  were  not  Doppler  shifted  by 
the  motions  of  the  stars  like  the  other  lines.  It 
was  concluded  that  these  absorption  lines  were 
formed  in  gas  clouds  in  the  space  between  the 
Earth  and  the  stars.  In  some  stars  there  were  sev¬ 
eral  lines,  apparently  formed  in  clouds  moving 
with  different  velocities.  The  strongest  lines  in 
the  visible  region  are  those  of  neutral  sodium 
and  singly  ionised  calcium  (Fig.  16.14).  In  the 
ultraviolet  region,  the  lines  are  more  numerous. 
The  strongest  one  is  the  hydrogen  Lyman  a  line 
(121.6  nm). 

On  the  basis  of  the  optical  and  ultraviolet  lines, 
it  has  been  found  that  many  atoms  are  ionised  in 
interstellar  space.  This  ionisation  is  mainly  due 
to  ultraviolet  radiation  from  stars  and,  to  some 
extent,  to  ionisation  by  cosmic  rays.  Since  the 
density  of  interstellar  matter  is  very  low,  the  free 
electrons  only  rarely  encounter  ions,  and  the  gas 
remains  ionised. 

About  thirty  elements  have  been  discovered  by 
absorption  line  observations  in  the  visible  and  ul¬ 
traviolet  region.  With  a  few  exceptions,  all  ele¬ 
ments  from  hydrogen  to  zinc  (atomic  number  30) 
and  a  few  additional  heavier  elements  have  been 
detected  (Table  16.3).  Like  in  the  stars,  most  of 
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Table  16.1  Main 
properties  of  interstellar 
gas  and  dust 


Property 

Gas 

Dust 

Mass  fraction 

10% 

0.1  % 

Composition 

H I,  H II,  H2  (70  %) 

He  (28  %) 

C,  N,  O,  Ne,  Na, 

Mg,  Al,  Si,  S,  . . .  (2  %) 

Solid  particles 
d  ~  0.1-1  pm 

H20  (ice),  silicates, 
graphite  +  impurities 

Particle  density 

1  /cm3 

10-13  /cm3  =  100  /km3 

Mass  density 

10-21  kg/m3 

10-23  kg/m3 

Temperature 

100  K  (HI),  104  K(HII) 

50  K  (H2) 

10-20  K 

Method  of  study 

Absorption  lines  in  stellar  spectra. 
Optical: 

Cal,  Call,  Nal, 

K I,  Till,  Fel, 

CN,  CH,  CH+ 

Ultraviolet: 

H2,  CO,  HD 

Radio  lines: 

hydrogen  21  cm 
emission  and  absorption; 

H II,  He  II,  C II 
recombination  lines; 
molecular  emission  and 
absorption  lines 

OH,  H2CO,  NH3,H20, 

CO,  h2c2hcn,  c2h5oh 

Absorption  and  scattering  of 
starlight. 

Interstellar  reddening 
Interstellar  polarisation 
Thermal  infrared  emission 

the  mass  is  hydrogen  (about  70  %)  and  helium 
(almost  30  %).  On  the  other  hand,  heavy  elements 
are  significantly  less  abundant  than  in  the  Sun  and 
other  population  I  stars.  It  is  thought  that  they 
have  been  incorporated  into  dust  grains,  where 
they  do  not  produce  any  absorption  lines.  The 
element  abundances  in  the  interstellar  medium 
(gas  +  dust)  would  then  be  normal,  although  the 
interstellar  gas  is  depleted  in  heavy  elements. 
This  interpretation  is  supported  by  the  observa¬ 
tion  that  in  regions  where  the  amount  of  dust  is 
smaller  than  usual,  the  element  abundances  in  the 
gas  are  closer  to  normal. 

Atomic  Hydrogen  Ultraviolet  observations 
have  provided  an  excellent  way  of  studying  in¬ 
terstellar  neutral  hydrogen.  The  strongest  inter¬ 
stellar  absorption  line,  as  has  already  been  men¬ 
tioned,  is  the  hydrogen  Lyman  a  line  (Fig.  16.15). 
This  line  corresponds  to  the  transition  of  the  elec¬ 
tron  in  the  hydrogen  atom  from  a  state  with  prin¬ 
cipal  quantum  number  n  =  1  to  one  with  n  =  2. 


The  conditions  in  interstellar  space  are  such  that 
almost  all  hydrogen  atoms  are  in  the  ground 
state  with  n  =  1.  Therefore  the  Lyman  a  line  is 
a  strong  absorption  line,  whereas  the  B aimer  ab¬ 
sorption  lines,  which  arise  from  the  excited  initial 
state  n  =  2,  are  unobservable.  (The  Balmer  lines 
are  strong  in  stellar  atmospheres  with  tempera¬ 
tures  of  about  10,000  K,  where  a  large  number  of 
atoms  are  in  the  first  excited  state.) 

The  first  observations  of  the  interstellar  Lyman 
a  line  were  made  from  a  rocket  already  in  1967. 
More  extensive  observations  comprising  95  stars 
were  obtained  by  the  OAO  2  satellite.  The  dis¬ 
tances  of  the  observed  stars  are  between  100  and 
1000  parsecs. 

Comparison  of  the  Lyman  a  observations  with 
observations  of  the  21  cm  neutral  hydrogen  line 
have  been  especially  useful.  The  distribution  of 
neutral  hydrogen  over  the  whole  sky  has  been 
mapped  by  means  of  the  21  cm  line.  However, 
the  distances  to  nearby  hydrogen  clouds  are  diffi¬ 
cult  to  determine  from  these  observations.  In  the 
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Table  1 6.2  Phenomena  caused  by  the  interstellar  medium 


Observable  phenomenon 

Cause 

Interstellar  extinction  and  polarisation 

Non-spherical  dust  grains  aligned  by  magnetic  field 

Dark  nebulae,  uneven  distribution  of  stars  and  galaxies 

Dust  clouds 

Interstellar  absorption  lines  in  stellar  spectra 

Atoms  and  molecules  in  the  interstellar  gas 

Reflection  nebulae 

Interstellar  dust  clouds  illuminated  by  nearby  stars 

Emission  nebulae  or  H II  regions  (optical,  infrared  and 
radio  emission) 

Interstellar  gas  and  dust  cloud,  where  a  nearby  hot  star 
ionises  the  gas  and  heats  the  dust  to  50-100  K 

Optical  galactic  background  (diffuse  galactic  light) 

Interstellar  dust  illuminated  by  the  integrated  light  of  all 
stars 

Galactic  background  radiation: 


(a)  short  wavelength  (<  1  m) 

Free-free  emission  from  hot  interstellar  gas 

(b)  long  wavelength  (>  1  m) 

Synchrotron  radiation  from  cosmic  ray  electrons  in  the 
magnetic  field 

Galactic  21  cm  emission 

Cold  (100  K)  interstellar  neutral  hydrogen  clouds  (H I 
regions) 

Molecular  line  emission  (extended) 

Giant  molecular  clouds  (masses  even  105— 106  M©), 
dark  nebulae 

Point- like  OH,  H2O  and  SiO  sources 

Maser  sources  near  protostars  and  long-period  variables 

Fig.  16.14  (a)  The  D 

lines  D\  and  D2  of 
interstellar  sodium  (rest 
wavelengths  589.89  and 
589.00  nm)  in  the  spectrum 
of  the  star  HD  14134.  Both 
lines  consist  of  two 
components  formed  in  the 
gas  clouds  of  two  spiral 
arms.  The  radial  velocity 
difference  of  the  arms  is 
about  30  km/s.  (Mt.  Wilson 
Observatory),  (b)  The 
interstellar  absorption  lines 
of  ionised  calcium  Ca  II 
and  ionised  methylidyne 
CH+  in  the  spectra  of 
several  stars.  The  emission 
spectrum  of  iron  is  shown 
for  comparison  in  (a)  and 
(b).  (Lick  Observatory) 


C  lJcr 

£  Per 

k  Qri 
01  Or]  C 
HD  190603 

p  Cvs 

Jr  C 

Call  K  C\l+  Call  II 


Lyman  a  observations  one  usually  knows  the  dis¬ 
tance  to  the  star  in  front  of  which  the  absorbing 
clouds  must  lie. 

The  average  gas  density  within  about  1  kpc 
of  the  Sun  derived  from  the  Lyman  a  observa¬ 
tions  is  0.7  atoms/cm3.  Because  the  interstellar 
Lyman  a  line  is  so  strong,  it  can  be  observed  even 


in  the  spectra  of  very  nearby  stars.  For  exam¬ 
ple,  it  has  been  detected  by  the  Copernicus  satel¬ 
lite  in  the  spectrum  of  Arcturus,  whose  distance 
is  only  1 1  parsecs.  The  deduced  density  of  neu¬ 
tral  hydrogen  between  the  Sun  and  Arcturus  is 
0.02-0.1  atoms/cm3.  Thus  the  Sun  is  situated  in 
a  clearing  in  the  interstellar  medium,  where  the 
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Table  16.3  Element  abundances  in  the  interstellar 
medium  towards  £  Ophiuchi  and  in  the  Sun.  The  abun¬ 
dances  are  given  relative  to  that  of  hydrogen,  which  has 
been  defined  to  be  1,000,000.  An  asterisk  (*)  means  that 


the  abundance  has  been  determined  from  meteorites.  The 
last  column  gives  the  ratio  of  the  abundances  in  the  inter¬ 
stellar  medium  and  in  the  Sun 


Atomic  number 

Name 

Chemical  symbol 

Interstellar  abundance 

Solar  abundance 

Abundance  ratio 

1 

Hydrogen 

H 

1,000,000 

1,000,000 

1.00 

2 

Helium 

He 

85,000 

85,000 

<1 

3 

Lithium 

Li 

0.000051 

0.00158* 

0.034 

4 

Beryllium 

Be 

<0.000070 

0.000012 

<5.8 

5 

Boron 

B 

0.000074 

0.0046* 

0.016 

6 

Carbon 

C 

74 

370 

0.20 

7 

Nitrogen 

N 

21 

110 

0.19 

8 

Oxygen 

O 

172 

660 

0.26 

9 

Fluorine 

F 

— 

0.040 

— 

10 

Neon 

Ne 

— 

83 

— 

11 

Sodium 

Na 

0.22 

1.7 

0.13 

12 

Magnesium 

Mg 

1.05 

35 

0.030 

13 

Aluminium 

A1 

0.0013 

2.5 

0.00052 

14 

Silicon 

Si 

0.81 

35 

0.023 

15 

Phosphorus 

P 

0.021 

0.27 

0.079 

16 

Sulfur 

S 

8.2 

16 

0.51 

17 

Chlorine 

Cl 

0.099 

0.45 

0.22 

18 

Argon 

Ar 

0.86 

4.5 

0.19 

19 

Potassium 

K 

0.010 

0.11 

0.094 

20 

Calcium 

Ca 

0.00046 

2.1 

0.00022 

21 

Scandium 

Sc 

— 

0.0017 

— 

22 

Titanium 

Ti 

0.00018 

0.055 

0.0032 

23 

Vanadium 

V 

<0.0032 

0.013 

<0.25 

24 

Chromium 

Cr 

<0.002 

0.50 

<0.004 

25 

Manganese 

Mn 

0.014 

0.26 

0.055 

26 

Iron 

Fe 

0.28 

25 

0.011 

27 

Cobalt 

Co 

<0.19 

0.032 

<5.8 

28 

Nickel 

Ni 

0.0065 

1.3 

0.0050 

29 

Copper 

Cu 

0.00064 

0.028 

0.023 

30 

Zinc 

Zn 

0.014 

0.026 

0.53 

density  is  less  than  one  tenth  of  the  average  den¬ 
sity. 

If  a  hydrogen  atom  in  its  ground  state  absorbs 
radiation  with  a  wavelength  smaller  than  91.2  nm, 
it  will  be  ionised.  Knowing  the  density  of  neutral 
hydrogen,  one  can  calculate  the  expected  distance 
a  91.2  nm  photon  can  propagate  before  being  ab¬ 
sorbed  in  the  ionisation  of  a  hydrogen  atom.  Even 
in  the  close  neighbourhood  of  the  Sun,  where  the 


density  is  exceptionally  low,  the  mean  free  path 
of  a  91.2  nm  photon  is  only  about  a  parsec  and 
that  of  a  10  nm  photon  a  few  hundred  parsecs. 
Thus  only  the  closest  neighbourhood  of  the  Sun 
can  be  studied  in  the  extreme  ultraviolet  (XUV) 
spectral  region. 

The  Hydrogen  21  cm  Line  The  spins  of  the 
electron  and  proton  in  the  neutral  hydrogen 
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Fig.  16.15  Interstellar  absorption  lines  in  the  ultraviolet 
spectrum  of  £  Ophiuchi.  The  strongest  line  is  the  hydro¬ 
gen  Lyman  a  line  (equivalent  width,  more  than  1  nm). 


The  observations  were  made  with  the  Copernicus  satellite. 
(Morton,  D.C.  (1975):  Astrophys.  J.  197,  85) 


atom  in  the  ground  state  may  be  either  paral¬ 
lel  or  opposite.  The  energy  difference  between 
these  two  states  corresponds  to  the  frequency  of 
1420.4  MHz.  Thus  transitions  between  these  two 
hyperfine  structure  energy  levels  will  give  rise  to 
a  spectral  line  at  the  wavelength  of  21.049  cm 
(Fig.  5.8).  The  existence  of  the  line  was  theo¬ 
retically  predicted  by  Hendrick  van  de  Hulst  in 
1944,  and  was  first  observed  by  Harold  Ewen 
and  Edward  Purcell  in  1951.  Studies  of  this  line 
have  revealed  more  about  the  properties  of  the 
interstellar  medium  than  any  other  method — one 
might  even  speak  of  a  special  branch  of  21  cm  as¬ 
tronomy.  The  spiral  structure  and  rotation  of  the 
Milky  Way  and  other  galaxies  can  also  be  studied 
by  means  of  the  21  cm  line. 

Usually  the  hydrogen  21  cm  line  occurs  in 
emission.  Because  of  the  large  abundance  of  hy¬ 
drogen,  it  can  be  observed  in  all  directions  in 
the  sky.  Some  observed  21  cm  line  profiles  are 
shown  in  Fig.  16.16.  Rather  than  frequency  or 
wavelength,  the  radial  velocity  calculated  from 
the  Doppler  formula  is  plotted  on  the  horizon¬ 
tal  axis.  This  is  because  the  broadening  of  the 
21  cm  spectral  line  is  always  due  to  gas  mo¬ 
tions  either  within  the  cloud  (turbulence)  or  of  the 
cloud  as  a  whole.  The  vertical  axis  is  mostly  plot¬ 
ted  in  terms  of  the  antenna  temperature  7a  (see 
Chap.  5),  the  usual  radio  astronomical  measure 
of  intensity.  The  brightness  temperature  of  an  ex¬ 
tended  source  is  then  T b  =  7a/??b>  where  r ]b  is 
the  beam  efficiency  of  the  antenna. 

For  the  21  cm  line  hv/k  =  0.07  K,  and  thus 
hv/kT  <  1  for  all  relevant  temperatures.  One 


Fig.  16.16  Hydrogen  21  cm  emission  line  profiles  in  the 
galactic  plane  at  longitude  180°,  90°  and  1°  (in  the  direc¬ 
tion  /  =  0°  there  is  strong  absorption).  The  horizontal  axis 
gives  the  radial  velocity  according  to  the  Doppler  formula, 
the  vertical  axis  gives  the  brightness  temperature.  (Burton, 
W.B.  (1974):  “The  Large  Scale  Distribution  of  Neutral 
Hydrogen  in  the  Galaxy”,  in  Galactic  and  Extra-Galac¬ 
tic  Radio  Astronomy,  ed.  by  Verschuur,  G.L.,  Kellermann, 
K.I.  (Springer,  Berlin,  Heidelberg,  New  York)  p.  91) 


may  therefore  use  the  Rayleigh-Jeans  approxi¬ 
mation  (5.24) 

2  v2kT 

/„  =  — (16.18) 
cz 

In  the  solution  of  the  equation  of  radiative  trans¬ 
fer  (5.42)  the  intensity  can  thus  be  directly  related 
to  a  corresponding  temperature.  By  definition, 
7V  is  related  to  the  brightness  temperature  7b,  and 
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the  source  function  Sv  is  related  to  the  excitation 
temperature  Texc,  i.e. 

Tb  =  7exc(l  —  e_rv).  (16.19) 

In  certain  directions  in  the  Milky  Way  there  is 
so  much  hydrogen  along  the  line  of  sight  that  the 
21  cm  line  is  optically  thick,  rv  1.  In  that  case 

Th  =  Texc,  (16.20) 

i.e.  the  brightness  temperature  immediately  yields 
the  excitation  temperature  of  the  cloud.  This  is 
often  referred  to  as  the  spin  temperature  7s . 

The  excitation  temperature  need  not  always 
agree  with  the  kinetic  temperature  of  the  gas. 
However,  in  the  present  case  the  population  num¬ 
bers  of  the  hyperfine  levels  are  determined  by 
mutual  collisions  of  hydrogen  atoms:  the  time 
between  collisions  is  400  years  on  the  average, 
whereas  the  time  for  spontaneous  radiative  transi¬ 
tions  is  1 1  million  years;  thus  the  excitation  tem¬ 
perature  will  be  the  same  as  the  kinetic  tempera¬ 
ture.  The  observed  temperature  is  T  &  125  K. 

The  distance  to  a  source  cannot  be  obtained 
directly  from  the  observed  emission.  Thus  one 
can  only  study  the  number  of  hydrogen  atoms  in 
a  cylinder  with  a  1  cm2  base  area  extending  from 
the  observer  to  outside  the  Milky  Way  along  the 
line  of  sight.  This  is  called  the  projected  or  col¬ 
umn  density  and  is  denoted  by  N.  One  may  also 
consider  the  column  density  N(v)dv  of  atoms 
with  velocities  in  the  interval  [v,v  +  dv]. 

It  can  be  shown  that  if  the  gas  is  optically 
thin,  the  brightness  temperature  in  a  spectral  line 
is  directly  proportional  to  the  column  density  N 
of  atoms  with  the  corresponding  radial  velocity. 
Hence,  if  the  diameter  L  of  a  cloud  along  the  line 
of  sight  is  known,  the  gas  density  can  be  deter¬ 
mined  from  the  observed  line  profile: 

n  =  N/L. 

The  diameter  L  can  be  obtained  from  the  ap¬ 
parent  diameter,  if  the  distance  and  shape  of  the 
cloud  are  assumed  known. 

The  distances  of  clouds  can  be  determined 
from  their  radial  velocities  by  making  use  of  the 
rotation  of  the  Milky  Way  (Sect.  18.3).  Thus  if 


the  observed  peaks  in  the  21  cm  line  profiles 
(Fig.  16.16)  are  due  to  individual  clouds,  their 
distances  and  densities  can  be  obtained.  Since 
radio  observations  are  not  affected  by  extinc¬ 
tion,  it  has  been  possible  in  this  way  to  map  the 
density  distribution  of  neutral  hydrogen  in  the 
whole  galactic  plane.  The  resulting  distribution, 
based  on  observations  at  Leiden  and  Parkes,  is 
shown  in  Fig.  16.17.  It  appears  that  the  Milky 
Way  is  a  spiral  galaxy  and  that  the  interstel¬ 
lar  hydrogen  is  concentrated  in  the  spiral  arms. 
The  average  density  of  interstellar  hydrogen  is 
1  atom/cm3,  but  the  distribution  is  very  inhomo¬ 
geneous.  Typically  the  hydrogen  forms  denser  re¬ 
gions,  a  few  parsecs  in  size,  where  the  densities 
may  be  10-100  atoms/cm3.  Regions  where  the 
hydrogen  is  predominantly  neutral  are  known  as 
HI  regions  (in  contrast  to  HII  regions  of  ionised 
hydrogen). 

The  hydrogen  21  cm  line  may  also  occur  in 
absorption,  when  the  light  from  a  bright  radio 
source,  e.g.  a  quasar,  passes  through  an  interven¬ 
ing  cloud.  The  same  cloud  may  give  rise  to  both 
an  absorption  and  an  emission  spectrum.  In  that 
case  the  temperature,  optical  thickness  and  hy¬ 
drogen  content  of  the  cloud  can  all  be  derived. 

Like  interstellar  dust  hydrogen  is  concentrated 
in  a  thin  disk  in  the  galactic  plane.  The  thickness 
of  the  hydrogen  layer  is  about  twice  that  of  the 
dust  or  about  200  pc. 

H II  Regions  In  many  parts  of  space  hydrogen 
does  not  occur  as  neutral  atoms,  but  is  ionised. 
This  is  true  in  particular  around  hot  O  stars, 
which  radiate  strongly  in  the  ultraviolet.  If  there 
is  enough  hydrogen  around  such  a  star,  it  will 
be  visible  as  an  emission  nebula  of  ionised  hy¬ 
drogen.  Such  nebulae  are  known  as  HII  region 
(Figs.  16.18  and  16.19). 

A  typical  emission  nebula  is  the  great  nebula 
in  Orion,  M42.  It  is  visible  even  to  the  unaided 
eye,  and  is  a  beautiful  sight  when  seen  through 
a  telescope.  In  the  middle  of  the  nebula  there  is 
a  group  of  four  hot  stars  known  as  the  Trapez¬ 
ium,  which  can  be  distinguished  inside  the  bright 
nebula,  even  with  a  small  telescope.  The  Trapez¬ 
ium  stars  emit  strong  ultraviolet  radiation,  which 
keeps  the  gas  nebula  ionised. 
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Fig.  16.17  The  distribution  of  neutral  hydrogen  in  the  galaxy  from  the  Leiden  and  Parkes  surveys.  The  density  is  given 
in  atoms/cm3.  (Oort,  J.H.,  Kerr,  P.T.,  Westerhout,  G.L.  (1958):  Mon.  Not.  R.  Astron.  Soc.  118,  379) 


Unlike  a  star  a  cloud  of  ionised  gas  has 
a  spectrum  dominated  by  a  few  narrow  emission 
lines.  The  continuous  spectrum  of  HII  regions  is 
weak.  In  the  visible  region  the  hydrogen  B  aimer 
emission  lines  are  particularly  strong.  These  are 
formed  when  a  hydrogen  atom  recombines  into 
an  excited  state  and  subsequently  returns  to  the 
ground  state  via  a  sequence  of  radiative  transi¬ 
tions.  Typically  a  hydrogen  atom  in  a  HII  re¬ 
gion  remains  ionised  for  several  hundred  years. 
Upon  recombination  it  stays  neutral  for  some 
months,  before  being  ionised  again  by  a  photon 
from  a  nearby  star. 

The  number  of  recombinations  per  unit  time 
and  volume  is  proportional  to  the  product  of  the 
densities  of  electrons  and  ions, 

(16.21) 


In  completely  ionised  hydrogen,  ne  =  nu  and 
hence 

nVQC(xn g.  (16.22) 

Most  recombinations  will  include  the  transition 
n  =  3  — >  2,  i.e.  will  lead  to  the  emission  of  a  Ha 
photon.  Thus  the  surface  brightness  of  a  nebula  in 
the  Ha  line  will  be  proportional  to  the  emission 
measure , 


where  the  integral  is  along  the  line  of  sight 
through  the  nebula. 

The  ionisation  of  a  helium  atom  requires  more 
energy  than  that  of  a  hydrogen  atom,  and  thus  re¬ 
gions  of  ionised  helium  are  formed  only  around 
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Fig.  1 6.1 8  The  great 
nebula  in  Orion  (M42, 
NGC  1976).  The  nebula 
gets  its  energy  from  newly 
formed  hot  stars.  The  dark 
regions  are  opaque  dust 
clouds  in  front  of  the 
nebula.  Radio  and  infrared 
observations  have  revealed 
a  rich  molecular  cloud 
behind  the  nebula 
(Fig.  16.20).  In  the  upper 
part  of  this  picture  is  the 
gas  nebula  NGC  1977,  in 
the  lower  part  the  bright 
star  l  Orionis.  (Lick 
Observatory) 


the  hottest  stars.  In  these  cases,  a  large  H II  region 
will  surround  a  smaller  central  He+  or  He++  re¬ 
gion.  The  helium  lines  will  then  be  strong  in  the 
spectrum  of  the  nebula. 

Although  hydrogen  and  helium  are  the  main 
constituents  of  clouds,  their  emission  lines  are 
not  always  strongest  in  the  spectrum.  At  the  be¬ 
ginning  of  this  century  it  was  even  suggested 
that  some  strong  unidentified  lines  in  the  spec¬ 
tra  of  nebulae  were  due  to  the  new  element  neb- 
ulium.  However,  in  1927  Ira  S.  Bowen  showed 
that  they  were  forbidden  lines  of  ionised  oxygen 
and  nitrogen,  0+,  0++  and  N+.  Forbidden  lines 
are  extremely  difficult  to  observe  in  the  labora¬ 
tory,  because  their  transition  probabilities  are  so 
small  that  at  laboratory  densities  the  ions  are  de- 


excited  by  collisions  before  they  have  had  time 
to  radiate.  In  the  extremely  diffuse  interstellar 
gas,  collisions  are  much  less  frequent,  and  thus 
there  is  a  chance  that  an  excited  ion  will  make 
the  transition  to  a  lower  state  by  emitting  a  pho¬ 
ton. 

Because  of  interstellar  extinction,  only  the 
nearest  H II  regions  can  be  studied  in  visible  light. 
At  infrared  and  radio  wavelengths  much  more 
distant  regions  can  be  studied.  The  most  impor¬ 
tant  lines  at  radio  wavelengths  are  recombination 
lines  of  hydrogen  and  helium;  thus  the  hydro¬ 
gen  transition  between  energy  levels  110  and  109 
at  5.01  GHz  has  been  much  studied.  These  lines 
are  also  important  because  with  their  help  radial 
velocities,  and  hence  (using  the  galactic  rotation 
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Fig.  1 6.1 9  The  Lagoon 
nebula  (M8,  NGC  6523)  in 
Sagittarius.  This  H II 
region  contains  many  stars 
of  early  spectral  types  and 
stars  that  are  still 
contracting  towards  the 
main  sequence.  Small, 
round  dark  nebulae, 
globules,  are  also  visible 
against  the  bright 
background.  These  are 
presumably  gas  clouds  in 
the  process  of  condensation 
into  stars.  (National 
Optical  Astronomy 
Observatories,  Kitt  Peak 
National  Observatory) 


law),  distances  of  H II  regions  can  be  determined, 
just  as  for  neutral  hydrogen. 

The  physical  properties  of  HII  regions  can 
also  be  studied  by  means  of  their  continuum  radio 
emission.  The  radiation  is  due  to  bremsstrahlung 
or  free-free  emission  from  the  electrons.  The  in¬ 
tensity  of  the  radiation  is  proportional  to  the  emis¬ 
sion  measure  EM  defined  in  (16.23).  H II  regions 
also  have  a  strong  infrared  continuum  emission. 
This  is  thermal  radiation  from  dust  inside  the  neb¬ 
ula. 

H II  regions  are  formed  when  a  hot  O  or  B  star 
begins  to  ionise  its  surrounding  gas.  The  ionisa¬ 
tion  steadily  propagates  away  from  the  star.  Be¬ 
cause  neutral  hydrogen  absorbs  ultraviolet  radia¬ 
tion  so  efficiently,  the  boundary  between  the  H II 
region  and  the  neutral  gas  is  very  sharp.  In  a  ho¬ 
mogeneous  medium  the  H II  region  around  a  sin¬ 
gle  star  will  be  spherical,  forming  a  Stromgren 
sphere.  For  a  BO  V  star  the  radius  of  the  Strom¬ 
gren  sphere  is  50  pc  and  for  an  AO  V  star  only 
1  pc. 

The  temperature  of  a  HII  region  is  higher 
than  that  of  the  surrounding  gas,  and  it  therefore 
tends  to  expand.  After  millions  of  years,  it  will 
have  become  extremely  diffuse  and  will  eventu¬ 
ally  merge  with  the  general  interstellar  medium. 


1 6.3  Interstellar  Molecules 

The  first  interstellar  molecules  were  discov¬ 
ered  in  1937-1938,  when  molecular  absorp¬ 
tion  lines  were  found  in  the  spectra  of  some 
stars.  Three  simple  diatomic  molecules  were  de¬ 
tected:  methylidyne  CH,  its  positive  ion  CH+ 
and  cyanogen  CN.  A  few  other  molecules  were 
later  discovered  by  the  same  method  in  the  ul¬ 
traviolet.  Thus  molecular  hydrogen  H2  was  dis¬ 
covered  in  the  early  1970’s,  and  carbon  monox¬ 
ide ,  which  had  been  discovered  by  radio  ob¬ 
servations,  was  also  detected  in  the  ultraviolet. 
Molecular  hydrogen  is  the  most  abundant  in¬ 
terstellar  molecule,  followed  by  carbon  monox¬ 
ide. 

Molecular  Hydrogen  The  detection  and  study 
of  molecular  hydrogen  has  been  one  of  the 
most  important  achievements  of  UV  astronomy. 
Molecular  hydrogen  has  a  strong  absorption  band 
at  105  nm,  which  was  first  observed  in  a  rocket 
experiment  in  1970  by  George  R.  Carruthers ,  but 
more  extensive  observations  could  only  be  made 
with  the  Copernicus  satellite.  The  observations 
showed  that  a  significant  fraction  of  interstellar 
hydrogen  is  molecular,  and  that  this  fraction  in¬ 
creases  strongly  for  denser  clouds  with  higher  ex- 
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Fig.  16.20  Radio  map  of 
the  distribution  of  carbon 
monoxide  13  C 16  O  in  the 
molecular  cloud  near  the 
Orion  nebula.  The  curves 
are  lines  of  constant 
intensity.  (Kutner,  M.L., 
Evans  II,  N.J.,  Tucker, 
K.D.  (1976):  Astrophys.  J. 
209, 452) 


tinction.  In  clouds  with  visual  extinction  larger 
than  one  magnitude  essentially  all  the  hydrogen 
is  molecular. 

Hydrogen  molecules  are  formed  on  the  surface 
of  interstellar  grains,  which  thus  act  as  a  chem¬ 
ical  catalyst.  Dust  is  also  needed  to  shield  the 
molecules  from  the  stellar  UV  radiation,  which 
would  otherwise  destroy  them.  Molecular  hydro¬ 
gen  is  thus  found  where  dust  is  abundant.  It  is  of 
interest  to  know  whether  gas  and  dust  are  well 
mixed  or  whether  they  form  separate  clouds  and 
condensations. 

UV  observations  have  provided  a  reliable  way 
of  comparing  the  distribution  of  interstellar  gas 
and  dust.  The  amount  of  dust  between  the  ob¬ 
server  and  a  star  is  obtained  from  the  extinction  of 
the  stellar  light.  Furthermore,  the  absorption  lines 
of  atomic  and  molecular  hydrogen  in  the  ultravi¬ 
olet  spectrum  of  the  same  star  can  be  observed. 
Thus  the  total  amount  of  hydrogen  (atomic  + 
molecular)  between  the  observer  and  the  star  can 
also  be  determined. 

Observations  indicate  that  the  gas  and  dust 
are  well  mixed.  The  amount  of  dust  giving  rise 
to  one  magnitude  visual  extinction  corresponds 
to  1.9  x  1021  hydrogen  atoms  (one  molecule  is 
counted  as  two  atoms).  The  mass  ratio  of  gas  and 
dust  obtained  in  this  way  is  100. 


Radio  Spectroscopy  Absorption  lines  can  only 
be  observed  if  there  is  a  bright  star  behind  the 
molecular  cloud.  Because  of  the  large  dust  extinc¬ 
tion,  no  observations  of  molecules  in  the  densest 
clouds  can  be  made  in  the  optical  and  ultravio¬ 
let  spectral  regions.  Thus  only  radio  observations 
are  possible  for  these  objects,  where  molecules 
are  especially  abundant. 

Radio  spectroscopy  signifies  an  immense  step 
forward  in  the  study  of  interstellar  molecules.  In 
the  early  1960’s,  it  was  still  not  believed  that  there 
might  be  more  complicated  molecules  than  di¬ 
atomic  ones  in  interstellar  space.  It  was  thought 
that  the  gas  was  too  diffuse  for  molecules  to 
form  and  that  any  that  formed  would  be  destroyed 
by  ultraviolet  radiation.  The  first  molecular  radio 
line,  the  hydroxyl  radical  OH,  was  discovered  in 
1963.  Many  other  molecules  have  been  discov¬ 
ered  since  then.  By  2002,  about  130  molecules 
had  been  detected,  the  heaviest  one  being  the 
13-atom  molecule  HCnN. 

Molecular  lines  in  the  radio  region  may  be  ob¬ 
served  either  in  absorption  or  in  emission.  Ra¬ 
diation  from  diatomic  molecules  like  CO  (see 
Fig.  16.20)  may  correspond  to  three  kinds  of 
transitions.  (1)  Electron  transitions  correspond  to 
changes  in  the  electron  cloud  of  the  molecule. 
These  are  like  the  transitions  in  single  atoms,  and 
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Table  16.4  Some  molecules  observed  in  the  interstellar 
medium 


Molecule 

Name 

Year  of  discovery 

Discovered  in  the  optical  and  ultraviolet  region: 

CH 

methylidyne 

1937 

CH+ 

methylidyne  ion 

1937 

CN 

cyanogen 

1938 

h2 

hydrogen  molecule 

1970 

CO 

carbon  monoxide 

1971 

Discovered  in  the  radio  region: 


OH 

hydroxyl  radical 

1963 

CO 

carbon  monoxide 

1970 

CS 

carbon  monosulfide 

1971 

SiO 

silicon  monoxide 

1971 

SO 

sulfur  monoxide 

1973 

H20 

water 

1969 

HCN 

hydrogen  cyanide 

1970 

nh3 

ammonia 

1968 

h2co 

formaldehyde 

1969 

HCOOH 

formic  acid 

1975 

HCCNC 

isocyanoacetylene 

1991 

c2h4o 

vinyl  alcohol 

2001 

h2cccc 

cumulene  carbene 

1991 

(CH3)20 

dimethyl  ether 

1974 

c2h5oh 

ethanol 

1975 

HCnN 

cyanopentacetylene 

1981 

their  wavelengths  lie  in  the  optical  or  ultraviolet 
region.  (2)  Vibrational  transitions  correspond  to 
changes  in  the  vibrational  energy  of  the  molecule. 
Their  energies  are  usually  in  the  infrared  region. 
(3)  Most  important  for  radio  spectroscopy  are  the 
rotational  transitions ,  which  are  changes  in  the 
rotational  energy  of  the  molecule.  Molecules  in 
their  ground  state  do  not  rotate,  i.e.  their  angular 
momentum  is  zero,  but  they  may  be  excited  and 
start  rotating  in  collisions  with  other  molecules. 
For  example,  carbon  sulfide  CS  returns  to  its 
ground  state  in  a  few  hours  by  emitting  a  millime¬ 
tre  region  photon. 

A  number  of  interstellar  molecules  are  listed 
in  Table  16.4.  Many  of  them  have  only  been  de¬ 
tected  in  the  densest  clouds  (mainly  the  Sagittar¬ 
ius  B2  cloud  at  the  galactic  centre),  but  others  are 
very  common.  The  most  abundant  molecule  H2 
cannot  be  observed  at  radio  wavelengths,  because 


it  has  no  suitable  spectral  lines.  The  next  most 
abundant  molecules  are  carbon  monoxide  CO, 
the  hydroxyl  radical  OH  and  ammonia  NH3 ,  al¬ 
though  their  abundance  is  only  a  small  fraction 
of  that  of  hydrogen.  However,  the  masses  of  in¬ 
terstellar  clouds  are  so  large  that  the  number 
of  molecules  is  still  considerable.  (The  Sagittar¬ 
ius  B2  cloud  contains  enough  ethanol,  C2H5OH, 
for  1028  bottles  of  vodka.) 

Both  the  formation  and  survival  of  interstel¬ 
lar  molecules  requires  a  higher  density  than  is 
common  in  interstellar  clouds;  thus  they  are  most 
common  in  dense  clouds.  Molecules  are  formed 
in  collisions  of  atoms  or  simpler  molecules  or 
catalysed  on  dust  grains.  Molecular  clouds  must 
also  contain  a  lot  of  dust  to  absorb  the  ultravio¬ 
let  radiation  entering  from  outside  that  otherwise 
would  disrupt  the  molecules.  The  most  suitable 
conditions  are  thus  found  inside  dust  and  molec¬ 
ular  clouds  near  dense  dark  nebulae  and  H II  re¬ 
gions. 

Most  of  the  molecules  in  Table  16.4  have  only 
been  detected  in  dense  molecular  clouds  occur¬ 
ring  in  connection  with  H II  regions.  Almost  ev¬ 
ery  molecule  yet  discovered  has  been  detected  in 
Sagittarius  B2  near  the  galactic  centre.  Another 
very  rich  molecular  cloud  has  been  observed  near 
the  HII  region  Orion  A.  In  visible  light  this  re¬ 
gion  has  long  been  known  as  the  Orion  neb¬ 
ula  M42  (Fig.  16.18).  Inside  the  actual  HII  re¬ 
gions  there  are  no  molecules,  since  they  would  be 
rapidly  dissociated  by  the  high  temperature  and 
strong  ultraviolet  radiation.  Three  types  of  molec¬ 
ular  sources  have  been  found  near  HII  regions 
(Fig.  16.21): 

1 .  Large  gas  and  dust  envelopes  around  the  H II 

region. 

2.  Small  dense  clouds  inside  these  envelopes. 

3.  Very  compact  OH  and  H2O  maser  sources. 

The  large  envelopes  have  been  discovered  pri¬ 
marily  by  CO  observations.  OH  and  H2CO  have 
also  been  detected.  Like  in  the  dark  nebulae  the 
gas  in  these  clouds  is  probably  mainly  molecular 
hydrogen.  Because  of  the  large  size  and  density 
(n  ~  103-104  molecules/cm3)  of  these  clouds, 
their  masses  are  very  large,  105  or  even  106  solar 
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Table  16.5  The  five 
phases  of  interstellar  gas 


T 

[K] 

n 

[cm-3] 

1. 

Very  cold  molecular  gas  clouds  (mostly  hydrogen  H2) 

20 

>103 

2. 

Cold  gas  clouds  (mostly  atomic  neutral  hydrogen) 

100 

20 

3. 

Warm  neutral  gas  enveloping  the  cooler  clouds 

6000 

0.05-0.3 

4. 

Hot  ionised  gas  (mainly  H II  regions  around  hot  stars) 

8000 

>0.5 

5. 

Very  hot  and  diffuse  ionised  coronal  gas,  ionised  and 
heated  by  supernova  explosions 

106 

10-3 

Fig.  16.21  Infrared  map  of  the  central  part  of  the  Orion 
nebula.  In  the  lower  part  are  the  four  Trapezium  stars. 
Above  is  an  infrared  source  of  about  0.5"  diameter,  the 
Kleinmann-Low  nebula  (KL).  BN  is  an  infrared  point 
source,  the  Becklin-Neugebauer  object.  Other  infrared 
sources  are  denoted  IRS.  The  large  crosses  indicate  OH 
masers  and  the  small  crosses  H2O  masers.  On  the  scale 
of  Fig.  16.18  this  region  would  only  be  a  few  millimetres 
in  size.  (Goudis,  C.  (1982):  The  Orion  Complex:  A  Case 
Study  of  Interstellar  Matter  (Reidel,  Dordrecht)  p.  176) 

masses  (SgrB2).  They  are  among  the  most  mas¬ 
sive  objects  in  the  Milky  Way.  The  dust  in  molec¬ 
ular  clouds  can  be  observed  on  the  basis  of  its 
thermal  radiation.  Its  peak  falls  at  wavelengths  of 
10-100  pm,  corresponding  to  a  dust  temperature 
of  30-300  K. 

Some  interstellar  clouds  contain  very  small 
maser  sources.  In  these  the  emission  lines  of  OH, 


H2O  and  SiO  may  be  many  million  times  stronger 
than  elsewhere.  The  diameter  of  the  radiating  re¬ 
gions  is  only  about  5-10  au.  The  conditions  in 
these  clouds  are  such  that  radiation  in  some  spec¬ 
tral  lines  is  amplified  by  stimulated  emission  as  it 
propagates  through  the  cloud.  Hydroxyl  and  wa¬ 
ter  masers  occur  in  connection  with  dense  HII 
regions  and  infrared  sources,  and  appear  to  be  re¬ 
lated  to  the  formation  of  protostars.  In  addition 
maser  emission  (OH,  H2O  and  SiO)  occurs  in 
connection  with  Mira  variables  and  some  red  su¬ 
pergiant  stars.  This  maser  emission  comes  from 
a  molecule  and  dust  envelope  around  the  star, 
which  also  gives  rise  to  an  observable  infrared  ex¬ 
cess. 


1 6.4  The  Formation  of  Protostars 

The  mass  of  the  Milky  Way  is  about  1011  solar 
masses.  Since  its  age  is  about  1010  years,  stars 
have  been  forming  at  the  average  rate  of  10  M© 
per  year.  This  estimate  is  only  an  upper  limit  for 
the  present  rate,  because  earlier  the  rate  of  star 
formation  must  have  been  much  higher.  Since 
the  lifetime  of  O  stars  is  only  about  a  million 
years,  a  better  estimate  of  the  star  formation  rate 
can  be  made,  based  on  the  observed  number  of 
O  stars.  Accordingly,  it  has  been  concluded  that  at 
present,  new  stars  are  forming  in  the  Milky  Way 
at  a  rate  of  about  three  solar  masses  per  year. 

Stars  are  now  believed  to  form  inside  large 
dense  interstellar  clouds  mostly  located  in  the 
spiral  arms  of  the  Galaxy.  Under  its  own  grav¬ 
ity,  a  cloud  begins  to  contract  and  fragment  into 
parts  that  will  become  protostars.  The  observa¬ 
tions  seem  to  indicate  that  stars  are  not  formed 
individually,  but  in  larger  groups.  Young  stars  are 
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found  in  open  clusters  and  in  loose  associations, 
typically  containing  a  few  hundred  stars  which 
must  have  formed  simultaneously. 

Theoretical  calculations  confirm  that  the  for¬ 
mation  of  single  stars  is  almost  impossible.  An 
interstellar  cloud  can  contract  only  if  its  mass  is 
large  enough  for  gravity  to  overwhelm  the  pres¬ 
sure.  As  early  as  in  the  1920’s,  James  Jeans  cal¬ 
culated  that  a  cloud  with  a  certain  temperature 
and  density  can  condense  only  if  its  mass  is  high 
enough.  If  the  mass  is  too  small  the  pressure  of 
the  gas  is  sufficient  to  prevent  the  gravitational 
contraction.  The  limiting  mass  is  the  Jeans  mass 
(Sect.  6.11),  given  by 


/  T3 

Mj^3  x  10 4,/ —  M©, 

V  n 

where  n  is  the  density  in  atoms/m3  and  T  the  tem¬ 
perature. 

In  a  typical  interstellar  neutral  hydrogen  cloud 
n  =  106  and  T  =  100  K,  giving  the  Jeans  mass, 
30,000 Mq.  In  the  densest  dark  clouds  n  =  1012 
and  T  =  10  K  and  hence,  Mj  =  1  Mq. 

It  is  thought  that  star  formation  begins  in 
clouds  of  a  few  thousand  solar  masses  and  diam¬ 
eters  of  about  10  pc.  The  cloud  begins  to  contract, 
but  does  not  heat  up  because  the  liberated  energy 
is  carried  away  by  radiation.  As  the  density  in¬ 
creases,  the  Jeans  mass  thus  decreases.  Because 
of  this,  separate  condensation  nuclei  are  formed 
in  the  cloud,  which  go  on  contracting  indepen¬ 
dently:  the  cloud  fragments.  Fragmentation  is  fur¬ 
ther  advanced  by  the  increasing  rotation  velocity. 
The  original  cloud  has  a  certain  angular  momen¬ 
tum  which  is  conserved  during  the  contraction; 
thus  the  angular  velocity  must  increase. 

This  contraction  and  fragmentation  continues 
until  the  density  becomes  so  high  that  the  individ¬ 
ual  fragments  become  optically  thick.  The  energy 
liberated  by  the  contraction  can  then  no  longer 
escape,  and  the  temperature  will  begin  to  rise.  In 
consequence  the  Jeans  mass  begins  to  increase, 
further  fragmentation  ceases  and  the  rising  pres¬ 
sure  in  existing  fragments  stops  their  contrac¬ 
tion.  Some  of  the  protostars  formed  in  this  way 
may  still  be  rotating  too  rapidly.  These  may  split 


into  two,  thus  forming  double  systems.  The  fur¬ 
ther  evolution  of  protostars  has  been  described  in 
Sect.  11.2. 

Although  the  view  that  stars  are  formed  by 
the  collapse  of  interstellar  clouds  is  generally  ac¬ 
cepted,  many  details  of  the  fragmentation  process 
are  still  highly  conjectural.  Thus  the  effects  of  ro¬ 
tation,  magnetic  fields  and  energy  input  are  very 
imperfectly  known.  Why  a  cloud  begins  to  con¬ 
tract  is  also  not  certain;  one  theory  is  that  passage 
through  a  spiral  arm  compresses  clouds  and  trig¬ 
gers  contraction  (see  Sect.  17.4).  This  would  ex¬ 
plain  why  young  stars  are  predominantly  found  in 
the  spiral  arms  of  the  Milky  Way  and  other  galax¬ 
ies.  The  contraction  of  an  interstellar  cloud  might 
also  be  initiated  by  a  nearby  expanding  HII  re¬ 
gion  or  supernova  explosion. 

Star  formation  can  be  observed  particularly 
well  in  the  infrared,  since  the  temperatures  of  the 
condensing  clouds  and  protostars  are  of  the  or¬ 
der  100-1000  K  and  the  infrared  radiation  can 
escape  even  the  densest  dust  clouds.  For  exam¬ 
ple,  in  connection  with  the  Orion  nebula  there  is 
a  large  cloud  of  hydrogen,  found  in  radio  obser¬ 
vations,  containing  small  infrared  sources.  E.g. 
the  Becklin-Neugebauer  object  has  a  temperature 
of  a  couple  of  hundred  kelvins  but  a  luminosity 
that  is  thousandfold  compared  with  the  Sun.  It  is 
a  strong  H2O  maser  source,  located  next  to  a  large 
H II  region. 


1 6.5  Planetary  Nebulae 

Bright  regions  of  ionised  gas  do  not  occur  only 
in  connection  with  newly  formed  stars,  but  also 
around  stars  in  late  stages  of  their  evolution. 
The  planetary  nebulae  are  gas  shells  around 
small  hot  blue  stars.  As  we  have  seen  in  con¬ 
nection  with  stellar  evolution,  instabilities  may 
develop  at  the  stage  of  helium  burning.  Some 
stars  begin  to  pulsate,  while  in  others  the  whole 
outer  atmosphere  may  be  violently  ejected  into 
space.  In  the  latter  case,  a  gas  shell  expanding  at 
20-30  km/s  will  be  formed  around  a  small  and 
hot  (50,000-100,000  K)  star,  the  core  of  the  orig¬ 
inal  star. 

The  expanding  gas  in  a  planetary  nebula  is 
ionised  by  ultraviolet  radiation  from  the  central 
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Fig.  16.22  The  Helix 
nebula  (NGC  7293).  The 
planetary  nebulae  are 
formed  during  the  final 
stages  of  evolution  of 
solar-type  stars.  The 
centrally  visible  star  has 
ejected  its  outer  layers  into 
space.  (National  Optical 
Astronomy  Observatories, 
Kitt  Peak  National 
Observatory) 


star,  and  its  spectrum  contains  many  of  the  same 
bright  emission  lines  as  that  of  an  HII  region. 
Planetary  nebulae  are,  however,  generally  much 
more  symmetrical  in  shape  than  most  HII  re¬ 
gions,  and  they  expand  more  rapidly.  For  exam¬ 
ple,  the  well-known  Ring  nebula  in  Lyra  (M57) 
has  expanded  visibly  in  photographs  taken  at 
50-year  intervals.  In  a  few  ten  thousand  years,  the 
planetary  nebulae  disappear  in  the  general  inter¬ 
stellar  medium  and  their  central  stars  cool  to  be¬ 
come  white  dwarfs. 

The  planetary  nebulae  were  given  their  name 
in  the  19th  century,  because  certain  small  nebulae 
visually  look  quite  like  planets  such  as  Uranus. 
The  apparent  diameter  of  the  smallest  known 
planetary  nebulae  is  only  a  few  arc  seconds, 
whereas  the  largest  ones  (like  the  Helix  nebula) 
may  be  one  degree  in  diameter  (Fig.  16.22). 

The  brightest  emission  lines  are  often  due  to 
forbidden  transitions,  like  in  HII  regions.  For  ex¬ 
ample,  the  green  colour  of  the  central  parts  of 
the  Ring  nebula  in  Lyra  is  due  to  the  forbid¬ 
den  lines  of  doubly  ionised  oxygen  at  495.9  and 
500.7  nm.  The  red  colour  of  the  outer  parts  is  due 
to  the  hydrogen  B aimer  a  line  (656.3  nm)  and 
the  forbidden  lines  of  ionised  nitrogen  (654.8  nm, 
658.3  nm). 


The  total  number  of  planetary  nebulae  in  the 
Milky  Way  has  been  estimated  to  be  50,000. 
About  2000  have  actually  been  observed. 


1 6.6  Supernova  Remnants 

In  Chap.  1 1  we  have  seen  that  massive  stars  end 
their  evolution  in  a  supernova  explosion.  The  col¬ 
lapse  of  the  stellar  core  leads  to  the  violent  ejec¬ 
tion  of  the  outer  layers,  which  then  remain  as  an 
expanding  gas  cloud. 

About  120  supernova  remnants  (SNR’s)  have 
been  discovered  in  the  Milky  Way.  Some  of  them 
are  optically  visible  as  a  ring  or  an  irregular  neb¬ 
ula  (e.g.  the  Crab  nebula;  see  Fig.  16.23),  but 
most  are  detectable  only  in  the  radio  region  (be¬ 
cause  radio  emission  suffers  no  extinction). 

In  the  radio  region  the  SNR’s  are  extended 
sources  similar  to  H II  regions.  However,  unlike 
HII  regions  the  radiation  from  SNR’s  is  often 
polarised.  Another  characteristic  difference  be¬ 
tween  these  two  kinds  of  sources  is  that  whereas 
the  radio  brightness  of  H II  regions  grows  or  re¬ 
mains  constant  as  the  frequency  increases,  that  of 
SNR’s  falls  off  almost  linearly  (in  a  log  Iv  —  log  v 
diagram)  with  increasing  frequency  (Fig.  16.24). 

These  differences  are  due  to  the  different 
emission  processes  in  HII  regions  and  in  SNR’s. 


16.6  Supernova  Remnants 


351 


Fig.  16.23  The  Crab 
nebula  (Ml,  NGC  1952)  in 
Taurus  is  the  remnant  of 
a  supernova  explosion 
observed  in  1054.  The 
photograph  was  taken  at 
red  wavelengths.  The 
nebula  is  also  a  strong 
radio  source.  Its  energy 
source  is  the  central  rapidly 
rotating  neutron  star, 
pulsar,  which  is  the 
collapsed  core  of  the 
original  star.  (Palomar 
Observatory) 


Fig.  16.24  The  radio  spectra  of  typical  H II  regions  and 
supernova  remnants.  The  radiation  of  HII  regions  is  ther¬ 
mal  and  obeys  the  Rayleigh-Jeans  law,  I  oc  v2,  at  wave¬ 
lengths  larger  than  1  m.  In  supernova  remnants  the  in¬ 
tensity  decreases  with  increasing  frequency.  (After  Schef- 
fler,  H.,  Elsasser,  H.  (1987):  Physics  of  the  Galaxy  and 
the  Interstellar  Matter  (Springer,  Berlin,  Heidelberg,  New 
York)) 

In  an  H II  region,  the  radio  emission  is  free-free 
radiation  from  the  hot  plasma.  In  a  SNR  it  is 
synchrotron  radiation  from  relativistic  electrons 
moving  in  spiral  orbits  around  the  magnetic  field 


lines.  The  synchrotron  process  gives  rise  to  a  con¬ 
tinuous  spectrum  extending  over  all  wavelength 
regions.  For  example,  the  Crab  nebula  looks  blue 
or  green  in  colour  photographs  because  of  optical 
synchrotron  radiation. 

In  the  Crab  nebula  red  filaments  are  also  visi¬ 
ble  against  the  bright  background.  Their  emission 
is  principally  in  the  hydrogen  Ha  line.  The  hy¬ 
drogen  in  a  SNR  is  not  ionised  by  a  central  star 
as  in  the  H II  regions,  but  by  the  ultraviolet  syn¬ 
chrotron  radiation. 

The  supernova  remnants  in  the  Milky  Way  fall 
into  two  classes.  One  type  has  a  clearly  ring-like 
structure  (e.g.  Cassiopeia  A  or  the  Veil  nebula  in 
Cygnus;  see  Fig.  16.25);  another  is  irregular  and 
bright  at  the  middle  (like  the  Crab  nebula).  In  the 
remnants  of  the  Crab  nebula  type  there  is  always 
a  rapidly  rotating  pulsar  at  the  centre.  This  pulsar 
provides  most  of  the  energy  of  the  remnant  by 
continuously  injecting  relativistic  electrons  into 
the  cloud.  The  evolution  of  this  type  of  SNR  re¬ 
flects  that  of  the  pulsar  and  for  this  reason  has 
a  time  scale  of  a  few  ten  thousand  years. 

Ring-like  SNR’s  do  not  contain  an  energetic 
pulsar;  their  energy  comes  from  the  actual  super¬ 
nova  explosion.  After  the  explosion,  the  cloud  ex¬ 
pands  at  a  speed  of  10,000-20,000  km/s.  About 
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Fig.  16.25  The  Veil 
nebula  (NGC  6960  at  the 
right,  NGC  6992  at  the  left ) 
in  Cygnus  is  the  remnant  of 
a  supernova  explosion 
which  occurred  several  ten 
thousand  years  ago.  (Mt. 
Wilson  Observatory) 


50-100  years  after  the  explosion  the  remnant  be¬ 
gins  to  form  a  spherical  shell  as  the  ejected  gas 
starts  to  sweep  up  interstellar  gas  and  to  slow 
down  in  its  outer  parts.  The  swept-up  shell  ex¬ 
pands  with  a  decreasing  velocity  and  cools  until, 
after  about  100,000  years,  it  merges  into  the  inter¬ 
stellar  medium.  The  two  types  of  supernova  rem¬ 
nants  may  be  related  to  the  two  types  (I  and  II)  of 
supernovae. 


Box  16.1  (Synchrotron  Radiation)  Synchro¬ 
tron  radiation  was  first  observed  in  1948  by 
Frank  Elder ,  Robert  Langmuir  and  Herbert 
Pollack ,  who  were  experimenting  with  an  elec¬ 
tron  synchrotron,  in  which  electrons  were  ac¬ 
celerated  to  relativistic  energies  in  a  magnetic 


field.  It  was  observed  that  the  electrons  radi¬ 
ated  visible  light  in  a  narrow  cone  along  their 
momentary  direction  of  motion.  In  astrophysics 
synchrotron  radiation  was  first  invoked  as  an 
explanation  of  the  radio  emission  of  the  Milky 
Way,  discovered  by  Karl  Jansky  in  1931.  This 
radiation  had  a  spectrum  and  a  large  metre- 
wave  brightness  temperature  (more  than  105  K) 
which  were  inconsistent  with  ordinary  ther¬ 
mal  free-free  emission  from  ionised  gas.  In 
1950  Hannes  Alfven  and  Nicolai  Herlofson  as 
well  as  Karl- Otto  Kiepenheuer  proposed  that 
the  galactic  radio  background  was  due  to  syn¬ 
chrotron  radiation.  According  to  Kiepenheuer 
the  high-energy  cosmic  ray  electrons  would 
emit  radio  radiation  in  the  weak  galactic  mag- 
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netic  field.  This  explanation  has  turned  out  to 
be  correct.  Synchrotron  radiation  is  also  an 
important  emission  process  in  supernova  rem¬ 
nants,  radio  galaxies  and  quasars.  It  is  a  non- 
thermal  radiation  process,  i.e.  the  energy  of  the 
radiating  electrons  is  not  due  to  thermal  mo¬ 
tions. 


The  emission  of  synchrotron  radiation.  A  charged  par¬ 
ticle  (electron)  propagating  in  a  magnetic  field  moves 
in  a  spiral.  Because  of  the  centripetal  acceleration,  the 
particle  emits  electromagnetic  radiation 

The  origin  of  synchrotron  radiation  is 
schematically  shown  in  the  figure.  The  mag¬ 
netic  field  forces  the  electron  to  move  in  a  spi¬ 
ral  orbit.  The  electron  is  thus  constantly  accel¬ 
erated  and  will  emit  electromagnetic  radiation. 
According  to  the  special  theory  of  relativity, 
the  emission  from  a  relativistic  electron  will  be 
concentrated  in  a  narrow  cone.  Like  the  beam 
from  a  lighthouse,  this  cone  sweeps  across  the 
observer’s  field  of  vision  once  for  each  revo¬ 
lution.  Thus  the  observer  sees  a  sequence  of 
radiation  flashes  of  very  short  duration  com¬ 
pared  with  their  interval.  (In  the  total  emis¬ 
sion  of  a  large  number  of  electrons,  separate 
flashes  cannot  be  distinguished.)  When  this  se¬ 
ries  of  pulses  is  represented  as  a  sum  of  differ¬ 
ent  frequency  components  (Fourier  transform), 
a  broad  spectrum  is  obtained  with  a  maximum 
at 

Lnax  =  ClB±E  , 

where  B±  is  the  magnetic  field  component  per¬ 
pendicular  to  the  velocity  of  the  electron,  and  E 
its  energy,  a  is  a  constant  of  proportionality. 


The  table  gives  the  frequency  and  wave¬ 
length  of  the  maximum  as  functions  of  the 
electron  energy  for  the  typical  galactic  field 
strength  0.5  nT: 


^-max 

Wnax  [Hz] 

E  [eV] 

300  nm 

1015 

6.6  x  1012 

30  gm 

1013 

6.6  x  1011 

3  mm 

1011 

6.6  x  1010 

30  cm 

109 

6.6  x  109 

30  m 

107 

6.6  x  108 

To  produce  even  radio  synchrotron  radia¬ 
tion,  very  energetic  electrons  are  required,  but 
these  are  known  to  be  present  in  the  cosmic  ra¬ 
diation.  In  the  optical  galactic  background  radi¬ 
ation,  the  contribution  from  synchrotron  radia¬ 
tion  is  negligible,  but,  for  example,  in  the  Crab 
nebula,  a  significant  part  of  the  optical  emission 
is  due  to  this  mechanism. 


1 6.7  The  Hot  Corona  of  the  Milky 
Way 

As  early  as  1956  Lyman  Spitzer  showed  that  the 
Milky  Way  has  to  be  surrounded  by  a  large  en¬ 
velope  of  very  hot  gas  (Fig.  16.26).  Almost  two 
decades  later  the  Copernicus  satellite,  whose  sci¬ 
entific  program  was  directed  by  Spitzer,  found 
evidence  for  this  kind  of  gas,  which  began  to 
be  called  galactic  coronal  gas ,  in  analogy  with 
the  solar  corona.  The  satellite  observed  emis¬ 
sion  lines  of  e.g.  five  times  ionised  oxygen 
(OVI),  four  times  ionised  nitrogen  (NV)  and 
triply  ionised  carbon  (CIV).  The  formation  of 
these  lines  requires  a  high  temperature  (100,000- 
1,000,000  K),  and  a  high  temperature  is  also  in¬ 
dicated  by  the  broadening  of  the  lines. 

Galactic  coronal  gas  is  distributed  through 
the  whole  Milky  Way  and  extends  several  thou¬ 
sand  parsecs  from  the  galactic  plane.  Its  den¬ 
sity  is  only  of  the  order  of  10-3  atoms/cm3  (re¬ 
call  that  the  mean  density  in  the  galactic  plane 
is  1  atom/cm3).  Thus  coronal  gas  forms  a  kind 
of  background  sea,  from  which  the  denser  and 
cooler  forms  of  interstellar  matter,  such  as  neu¬ 
tral  hydrogen  and  molecular  clouds,  rise  as  is¬ 
lands.  In  the  early  1980’s  the  IUE  satellite  also 
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Fig.  16.26  Galaxy’s  hot  corona.  NGC  2403  is  a  spi¬ 
ral  galaxy,  similar  to  our  Milky  Way.  On  the  right,  it 
is  photographed  in  visual  light.  On  the  left,  in  a  VLA 
radio  image,  on  the  same  scale  with  the  optical  image, 

detected  similar  coronae  in  the  Large  Magellanic 
Cloud  and  in  the  spiral  galaxy  Ml 00.  Coronal  gas 
is  probably  quite  a  common  and  important  form 
of  matter  in  galaxies. 

Supernova  explosions  are  probably  the  source 
of  both  coronal  gas  and  its  energy.  When  a  super¬ 
nova  explodes,  it  forms  a  hot  bubble  in  the  sur¬ 
rounding  medium.  The  bubbles  from  neighbour¬ 
ing  supernovae  will  expand  and  merge,  forming 
a  foamlike  structure.  In  addition  to  supemovae, 
stellar  winds  from  hot  stars  may  provide  some  of 
the  energy  of  the  coronal  gas. 


1 6.8  Cosmic  Rays  and  the 

Interstellar  Magnetic  Field 

Cosmic  Rays  Elementary  particles  and  atomic 
nuclei  reaching  the  Earth  from  space  are  called 
cosmic  rays.  They  occur  throughout  interstellar 
space  with  an  energy  density  of  the  same  order 
of  magnitude  as  that  of  the  radiation  from  stars. 
Cosmic  rays  are  therefore  important  for  the  ioni¬ 
sation  and  heating  of  interstellar  gas. 

Since  cosmic  rays  are  charged,  their  direction 
of  propagation  in  space  is  constantly  changed 
by  the  magnetic  field.  Their  direction  of  arrival 


a  large  hydrogen  corona  is  seen  around  the  galaxy.  Large 
holes  created  by  supernova  explosions  are  seen  in  the  gas 
corona.  (Image  NRAO/AUI  and  Tom  Oosterloo,  Astron, 
The  Netherlands) 

therefore  gives  no  information  about  their  place 
of  origin.  The  most  important  properties  of  cos¬ 
mic  rays  that  can  be  observed  from  the  Earth 
are  their  particle  composition  and  energy  distri¬ 
bution.  As  noted  in  Sect.  3.6,  these  observations 
have  to  be  made  in  the  upper  atmosphere  or  from 
satellites,  since  cosmic  ray  particles  are  destroyed 
in  the  atmosphere. 

The  main  constituent  of  the  cosmic  rays  (about 
90  %)  is  hydrogen  nuclei  or  protons.  The  second 
most  important  constituent  (about  9  %)  is  helium 
nuclei  or  a  particles.  The  rest  of  the  particles  are 
electrons  and  nuclei  more  massive  than  helium. 

Most  cosmic  rays  have  an  energy  smaller  than 
109  eV.  The  number  of  more  energetic  particles 
drops  rapidly  with  increasing  energy.  The  most 
energetic  protons  have  an  energy  of  102°  eV,  but 
such  particles  are  very  rare — the  energy  of  one 
such  proton  could  lift  this  book  about  one  cen¬ 
timetre.  (The  largest  particle  accelerators  reach 
“only”  energies  of  10 12  eV.) 

The  distribution  of  low-energy  (less  than 
108  eV)  cosmic  rays  cannot  be  reliably  deter¬ 
mined  from  the  Earth,  since  solar  “cosmic  rays”, 
high-energy  protons  and  electrons  formed  in  so¬ 
lar  flares  fill  the  solar  system  and  strongly  affect 
the  motion  of  low-energy  cosmic  rays. 


1 6.8  Cosmic  Rays  and  the  Interstellar  Magnetic  Field 
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Fig.  16.27  The  polarisation  of  starlight.  The  dashes  give 
the  direction  and  degree  of  the  polarisation.  The  thinner 
dashes  correspond  to  stars  with  polarisation  smaller  than 
0.6  %;  the  thicker  dashes  to  stars  with  larger  polarisation. 

The  distribution  of  cosmic  rays  in  the  Milky 
Way  can  be  directly  inferred  from  gamma-ray  and 
radio  observations.  The  collisions  of  cosmic  ray 
protons  with  interstellar  hydrogen  atoms  gives 
rise  to  pions  which  then  decay  to  form  a  gamma- 
ray  background.  The  radio  background  is  formed 
by  cosmic  ray  electrons  which  emit  synchrotron 
radiation  in  the  interstellar  magnetic  field. 

Both  radio  and  gamma-ray  emission  are  stron¬ 
gly  concentrated  in  the  galactic  plane.  From  this 
it  has  been  concluded  that  the  sources  of  cosmic 
rays  must  also  be  located  in  the  galactic  plane.  In 
addition  there  are  individual  peaks  in  the  back¬ 
grounds  around  known  supernova  remnants.  In 
the  gamma-ray  region  such  peaks  are  observed  at 
e.g.  the  Crab  nebula  and  the  Vela  pulsar;  in  the  ra¬ 
dio  region  the  North  Polar  Spur  is  a  large,  nearby 
ring-like  region  of  enhanced  emission. 

Apparently  a  large  fraction  of  cosmic  rays 
have  their  origin  in  supernovae.  An  actual  super¬ 
nova  explosion  will  give  rise  to  energetic  parti¬ 
cles.  If  a  pulsar  is  formed,  observations  show  that 
it  will  accelerate  particles  in  its  surroundings.  Fi¬ 
nally  the  shock  waves  formed  in  the  expanding 
supernova  remnant  will  also  give  rise  to  relativis¬ 
tic  particles. 

On  the  basis  of  the  relative  abundances  of  var¬ 
ious  cosmic  ray  nuclei,  it  can  be  calculated  how 
far  they  have  travelled  before  reaching  the  Earth. 
It  has  been  found  that  typical  cosmic  ray  protons 


The  scale  is  shown  in  the  upper  left-hand  corner.  Stars 
with  polarisation  smaller  than  0.08  %  are  indicated  by 
a  small  circle.  (Mathewson,  D.S.,  Ford,  V.L.  (1970):  Mem. 
R.A.S.  74,  139) 

have  travelled  for  a  period  of  a  few  million  years 
(and  hence  also  a  distance  of  a  few  million  light- 
years)  from  their  point  of  origin.  Since  the  diame¬ 
ter  of  the  Milky  Way  is  about  100,000  light-years, 
the  protons  have  crossed  the  Milky  Way  tens  of 
times  in  the  galactic  field. 

The  Interstellar  Magnetic  Field  The  strength 
and  direction  of  the  interstellar  magnetic  field  are 
difficult  to  determine  reliably.  Direct  measure¬ 
ments  are  impossible,  since  the  magnetic  fields 
of  the  Earth  and  the  Sun  are  much  stronger.  How¬ 
ever,  using  various  sources  it  has  been  possible  to 
deduce  the  existence  and  strength  of  the  field. 

We  have  already  seen  that  interstellar  grains 
give  rise  to  interstellar  polarisation.  In  order  to 
polarise  light,  the  dust  grains  have  to  be  similarly 
oriented;  this  can  only  be  achieved  by  a  general 
magnetic  field.  Figure  16.27  shows  the  distribu¬ 
tion  of  interstellar  polarisation  over  the  sky.  Stars 
near  each  other  generally  have  the  same  polarisa¬ 
tion.  At  low  galactic  latitudes  the  polarisation  is 
almost  parallel  to  the  galactic  plane,  except  where 
one  is  looking  along  a  spiral  arm. 

More  precise  estimates  of  the  strength  of  the 
magnetic  field  can  be  obtained  from  the  rotation 
of  the  plane  of  polarisation  of  the  radio  radia¬ 
tion  from  distant  sources.  This  Faraday  rotation  is 
proportional  to  the  strength  of  the  magnetic  field 
and  to  the  electron  density.  Another  method  is  to 
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measure  the  Zeeman  splitting  of  the  21  cm  radio 
line.  These  measurements  have  fairly  consistently 
given  a  value  of  10-1°-10-9  T  for  the  strength  of 
the  interstellar  magnetic  field.  This  is  about  one 
millionth  of  the  interplanetary  field  in  the  solar 
system. 


16.9  Examples 

Example  16.1  Estimate  the  dust  grain  size  and 
number  density  in  the  galactic  plane. 

Let  us  compare  the  interstellar  extinction 
curve  in  Fig.  16.4(b)  with  the  Mie  scattering 
curves  in  Fig.  16.3.  We  see  that  the  leftmost 
parts  of  the  curves  may  correspond  to  each 
other:  the  interval  0  <  v  <  5  with  m  =  1.5  in 
Fig.  16.3  matches  the  interval  0  <  1/A  <  5  pm-1 
in  Fig.  16.4(b).  Remembering  that  v  =  27r a/X, 
this  suggests  a  constant  grain  radius  a ,  given  by 
2i ra  ~  1  pm,  or  0.16  pm. 

In  the  blue  wavelength  region  (A  =  0.44  pm), 
x  =  2.3  and,  according  to  the  upper  Fig.  16.3, 
£2 ex t  ~  2.  Using  A  =  2  mag  for  the  interstellar  ex¬ 
tinction  at  r  =  1  kpc,  we  get,  substituting  (16.5) 
into  (16.7),  n  ~  4  x  10-7  m-3.  This  should  give 
the  order  of  magnitude  of  the  interstellar  dust 
density. 

As  a  summary  we  could  say  that  a  consider¬ 
able  fraction  of  the  interstellar  extinction  might 
be  due  to  grains  of  diameter  0.3  pm  and  particle 
density  of  the  order  of  10-7  m-3  =  100  km-3. 

Example  16.2  Estimate  the  time  interval  be¬ 
tween  successive  collisions  of  a  hydrogen  atom 
in  interstellar  gas. 

Two  atoms  will  collide  if  the  separation  be¬ 
tween  their  centres  becomes  less  than  2 r,  where 
r  is  the  radius  of  the  atom.  Thus,  the  microscopic 
cross  section  for  the  collision  is  a  =  7r(2r)2  = 
4jtr2.  The  macroscopic  cross  section,  or  the  num¬ 
ber  of  collisions  of  an  H  atom  per  unit  length,  is 
then  U  =  n  cr,  where  n  is  the  number  density  of 
the  H  atoms.  The  mean  free  path  l  of  an  atom 
is  the  inverse  of  the  macroscopic  cross  section, 
l  =  1/27,  and  the  time  between  two  collisions  is 
t  =  1/  v,  where  v  is  the  velocity  of  the  atom. 

Considering  the  numerical  values,  the  Bohr  ra¬ 
dius  of  an  H  atom  is  r  =  5.3  x  10-11  m.  Taking 


n  =  1  cm-3  we  get  /  =  2.8  x  1013  m  ~  0.0009  pc. 
The  average  velocity  is  not  far  from  the  root  mean 
square  velocity  at  T  =  125  K,  given  by  (5.33): 

v  = ./—  =  1760  ms-1. 

V  m 

These  values  of  l  and  v  give  t  =  l/v  =  510  years 
for  the  collision  interval.  Taking  into  account  the 
velocity  distribution  in  the  gas,  the  mean  free  path 
appears  to  be  shorter  by  a  factor  of  1  /V2,  which 
reduces  the  time  to  about  400  years. 

Example  16.3  Consider  the  lowest  rotational 
transition  of  the  CO  molecule.  For  12CO  the  fre¬ 
quency  of  this  line  is  v(12CO)  =  115.27  GHz, 
and  for  13CO,  v(13CO)  =  110.20  GHz.  Estimate 
the  optical  thickness  of  each  line  in  a  molec¬ 
ular  cloud,  where  the  observed  brightness  tem¬ 
peratures  of  the  lines  are  7b(12CO)  =  40  K  and 
rb(13CO)  =  9K. 

For  the  12CO  line,  hv/k  =  5.5  K.  Thus,  the 
Rayleigh-Jeans  approximation  is  valid  if  the  tem¬ 
perature  is  considerably  higher  than  5  K.  This  is 
not  always  the  case,  but  the  measured  value  of 
7b  ( 12  CO)  suggests  that  the  approximation  can  be 
used. 

Ignoring  the  background,  (16.19)  gives 
7b  —  Texc  ( 1  —  e  v ) . 

The  optical  thickness  xv  is  proportional  to  the 
opacity  or  the  absorption  coefficient  av  [see 
(4.16)],  and  av  is  evidently  proportional  to  the 
number  of  CO  molecules  present.  Other  differ¬ 
ences  between  the  lines  are  small,  so  we  can  write 

ry(12CO)  «(12CO) 

_  ^  _ 

rv(13CO)  ~  tz(13CO)‘ 

Adopting  the  terrestrial  value  n(l2CO)/n(l3CO) 
=  89,  we  set 

rv(12CO)  =  89tv(13CO). 

Assuming  the  excitation  temperatures  equal  and 
denoting  rv(12CO)  by  r,  we  get 

7"exc(l  —  e-T)  =  40, 

7exc(l  —  e-r/89)  =  9. 
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The  solution  of  this  pair  of  equations  is 

tv(12CO)  =  23,  rv(13CO)  =  0.25, 
rexc  =  40  K. 

Thus,  the  12CO  line  seems  to  be  optically  thick, 
and  Tsxc  =  7b(12CO).  If  also  the  13CO  line 
were  optically  thick,  the  brightness  temperatures 
would  be  practically  equal,  and  the  optical  thick¬ 
nesses  could  not  be  determined. 


16.10  Exercises 

Exercise  16.1  Two  open  clusters,  which  are 
seen  near  each  other  in  the  galactic  plane,  have 
angular  diameters  a  and  3a ,  and  distance  mod¬ 
uli  16.0  and  11.0,  respectively.  Assuming  their 


actual  diameters  are  equal,  find  their  distances 
and  the  interstellar  extinction  coefficient  a  in 
(16.4). 

Exercise  16.2  Estimate  the  free  fall  velocity  on 
the  surface  of  a  spherical  gas  cloud  contracting 
under  the  influence  of  its  own  gravity.  Assume 
n( H2)  =  103  cm-3  and  R  =  5  pc. 

Exercise  16.3  The  force  F  exerted  by  a  mag¬ 
netic  field  B  on  a  charge  q  moving  with  veloc¬ 
ity  v  is  F  =  qv  x  B.  If  v  is  perpendicular  to  B , 
the  path  of  the  charge  is  circular.  Find  the  radius 
of  the  path  of  an  interstellar  proton  with  a  kinetic 
energy  of  1  MeV.  Use  B  =  0. 1  nT  for  the  galactic 
magnetic  field. 
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Several  collections  of  stars  can  be  picked  out  in 
the  sky,  even  with  the  naked  eye.  Closer  study  re¬ 
veals  that  they  really  do  form  separate  clusters  in 
space.  E.g.  the  Pleiades  in  Taurus  and  the  Hyades 
around  Aldebaran,  the  brightest  star  in  Taurus, 
are  such  open  star  clusters.  Almost  the  whole  of 
the  constellation  Coma  Berenices  is  also  an  open 
star  cluster.  Many  objects  appearing  as  nebulous 
patches  to  the  unaided  eye,  when  looked  at  with 
a  telescope,  turn  out  to  be  star  clusters,  like  Prae- 
sepe  in  the  constellation  Cancer,  or  double  cluster 
in  Perseus  (Fig.  17.1).  In  addition  to  open  clusters 
some  apparently  nebulous  objects  are  very  dense 
globular  clusters ,  such  as  those  in  Hercules  and 
in  Canes  Venatici  (Fig.  17.2). 

The  first  catalogue  of  star  clusters  was  pre¬ 
pared  by  the  French  astronomer  Charles  Messier 
in  1784.  The  first  version  contained  only  45 
objects,  but  later  Messier  himself  and  Pierre 
Mechain  expanded  the  catalogue  to  include  103 
objects.  Eater  seven  more  objects  were  added, 
possibly  observed  by  Messier.  Messier  was  inter¬ 
ested  in  comets,  not  in  those  fuzzy  objects,  and 
the  reason  for  the  catalogue  was  to  avoid  false 
alarms.  The  catalogue  contains  a  mixture  of  very 
different  objects,  like  about  30  globular  clusters 
and  the  same  number  of  open  clusters,  gas  nebu¬ 
lae  and  galaxies. 

A  larger  catalogue,  published  in  1888,  was  the 
New  General  Catalogue  of  Nebulae  and  Clus¬ 
ters  of  Stars  prepared  by  the  Danish  astronomer 
John  Louis  Emil  Dreyer.  The  catalogue  numbers 
of  objects  in  this  list  are  preceded  by  the  initials 
NGC.  For  example,  the  large  globular  cluster  in 


Hercules  is  object  M13  in  the  Messier  catalogue, 
and  it  is  also  known  as  NGC  6205.  The  NGC  cat¬ 
alogue  was  supplemented  with  the  Index  Cata¬ 
logue  in  1895  and  1910.  The  objects  of  this  cata¬ 
logue  are  given  the  initials  IC. 

With  a  small  telescope  stars  of  distant  clus¬ 
ters  and  galaxies  cannot  be  distinguished  as  sep¬ 
arate  objects;  instead  the  target  looks  nebulous. 
Therefore  their  true  nature  was  only  little  by  lit¬ 
tle  with  spectroscopy  and  large  telescopes.  That’s 
why  those  old  catalogues  contain  all  kinds  of  dif¬ 
ferent  objects. 

Even  now  there  is  no  catalogue  of  open  clus¬ 
ter  complete  to  some  limiting  magnitude.  The 
data  are  collected  from  several  catalogues  pub¬ 
lished  by  various  astronomers,  and  therefore  the 
nomenclature  is  not  consistent.  The  problem  is 
that  open  clusters  concentrate  close  the  plane  of 
Milky  Way,  and  there  are  a  lot  of  background 
stars.  Thus  identifying  the  cluster  members  may 
require  rather  detailed  studies  of  the  properties  of 
the  stars.  Also  the  members  may  already  have  dis¬ 
persed  to  a  very  wide  area  making  it  difficult  to 
decide  which  of  them  belong  to  the  same  clus¬ 
ter. 


17.1  Associations 

In  1947  the  Soviet  astronomer  Viktor  Amaza- 
spovich  Ambartsumyan  discovered  that  there  are 
groups  of  young  stars  scattered  over  so  large  re¬ 
gions  of  the  sky  that  they  would  be  very  difficult 
to  identify  merely  on  the  basis  of  their  appear¬ 
ance.  These  associations  may  have  a  few  tens  of 
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members.  One  association  is  found  around  the 
star  £  Persei,  and  in  the  region  of  Orion,  there  are 
several  associations. 

Associations  are  groups  of  very  young  stars. 
They  are  usually  identified  on  the  basis  either 
of  absolutely  bright  main  sequence  stars  or  of 
TTauri  stars.  According  to  the  type,  one  speaks 
of  OB  associations  and  TTauri  associations.  The 
most  massive  stars  of  spectral  class  O  stay  on  the 
main  sequence  for  only  a  few  million  years,  and 
therefore  associations  containing  them  are  neces¬ 
sarily  young.  The  T  Tauri  stars  are  even  younger 
stars  that  are  in  the  process  of  contracting  towards 
the  main  sequence. 

Studies  of  the  internal  motions  in  associations 
show  that  they  are  rapidly  dispersing.  There  are 


so  few  stars  in  an  association  that  their  gravity 
cannot  hold  them  together  for  any  length  of  time. 
The  observed  motions  have  often  confirmed  that 
the  stars  in  an  association  were  very  close  to¬ 
gether  a  few  million  years  ago  (Fig.  17.3). 

Large  amounts  of  interstellar  matter,  gas  and 
dust  nebulae  often  occur  in  connection  with  asso¬ 
ciations,  supplying  information  about  the  connec¬ 
tion  between  star  formation  and  the  interstellar 
medium.  Infrared  observations  have  shown  that 
stars  are  now  forming  or  have  recently  formed  in 
many  dense  interstellar  clouds. 

Associations  are  strongly  concentrated  in  the 
spiral  arms  in  the  plane  of  the  Milky  Way.  Both  in 
the  Orion  region  and  in  the  direction  of  Cepheus, 
three  generations  of  associations  have  been  iden- 


Fig.  17.1  Open  clusters,  (a)  The  Hyades  slightly  to  the 
lower  left  in  the  photograph.  Above  them  to  the  right  the 
Pleiades.  (Photograph  M.  Korpi.)  (b)  The  Pleiades  pho¬ 
tographed  with  the  Metsahovi  Schmidt  camera.  The  di¬ 
ameter  of  the  cluster  is  about  1°.  Reflection  nebulae  are 
visible  around  some  of  the  stars.  (Photograph  M.  Pouta- 


nen  and  H.  Virtanen,  Helsinki  University.)  h  and  x  Persei, 
the  double  cluster  in  Perseus.  The  separation  between 
the  clusters  is  about  25'.  Picture  taken  with  the  Met¬ 
sahovi  60-cm  Ritchey  Chretien  telescope.  (Photograph  T. 
Markkanen,  Helsinki  University) 
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Fig.  1 7.2  The  globular 
cluster  co  Centauri.  The 
picture  was  taken  with  the 
Danish  1.5-m  telescope  at 
La  Silla,  Chile.  Thanks  to 
the  excellent  seeing,  one 
can  see  through  the  entire 
cluster  in  some  places. 
(Photograph  T.  Korhonen, 
Turku  University) 


Fig.  17.3  £  Persei 
association.  O  and  B  stars 
are  shown  as  open  circles. 
The  proper  motion  vectors 
show  the  movements  of  the 
stars  in  the  next  500,000 
years 


tified,  the  oldest  ones  being  most  extended  and 
the  youngest  ones,  most  dense. 


1 7.2  Open  Star  Clusters 

Open  clusters  usually  contain  from  a  few  tens  to 
a  few  hundreds  of  stars.  The  kinetic  energy  of 
the  cluster  members,  the  differential  rotation  of 


the  Milky  Way  (Sect.  17.3)  and  external  gravita¬ 
tional  disturbances  tend  to  gradually  disperse  the 
open  clusters.  Still,  many  of  them  are  fairly  per¬ 
manent;  for  example,  the  Pleiades  is  many  hun¬ 
dreds  of  millions  of  years  old,  but  nevertheless, 
quite  a  dense  cluster. 

The  distances  of  star  clusters — and  also  of 
associations — can  be  obtained  from  the  photo- 


362 


17  Star  Clusters  and  Associations 


metric  or  spectroscopic  distances  of  their  bright¬ 
est  members.  For  the  nearest  clusters,  in  partic¬ 
ular  for  the  Hyades,  one  can  use  the  method  of 
kinematic  parallaxes ,  which  is  based  on  the  fact 
that  the  stars  in  a  cluster  all  have  the  same  average 
space  velocity  with  respect  to  the  Sun.  The  proper 
motions  in  the  Hyades  are  shown  in  Fig.  17.4. 
They  all  appear  to  be  directed  to  the  same  point. 
Figure  17.5  explains  how  this  convergence  can  be 
understood  as  an  effect  of  perspective,  if  all  clus¬ 
ter  members  have  the  same  velocity  vector  with 
respect  to  the  observer.  Let  6  be  the  angular  dis¬ 
tance  of  a  given  star  from  the  convergence  point. 
The  angle  between  the  velocity  of  the  star  and 


the  line  of  sight  will  then  also  be  6.  The  veloc¬ 
ity  components  along  the  line  of  sight  and  at  right 
angles  to  it,  vv  and  ut,  are  therefore  given  by 


vY  =  v  cos  6, 
vt  =  v  sin  0 . 


(17.1) 


The  radial  velocity  vY  can  be  measured  from  the 
Doppler  shift  of  the  stellar  spectrum.  The  tangen¬ 
tial  velocity  is  related  to  the  proper  motion  /i 
and  the  distance  r : 


vt  =  fir.  (17.2) 


Thus  the  distance  can  be  calculated: 
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Fig.  17.4  Proper  motions  of  the  Hyades.  The  vectors 
show  the  movement  of  the  stars  in  about  10,000  years, 
(van  Bueren,  H.G.  (1952):  Bull.  Astr.  Inst.  Neth.  11) 


Fig.  17.5  If  all  stars  move  in  the  same  direction,  their  tan¬ 
gential  velocity  components  appear  to  be  directed  towards 
the  convergence  point  K 


Vt  v  sin  0  Vr 
r  =  —  = - =  —  tan  <9. 

fi  fi  fi 
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By  means  of  this  method,  the  distances  of  the 
individual  stars  can  be  determined  from  the  mo¬ 
tion  of  the  cluster  as  a  whole.  Since  the  method 
of  (ground-based)  trigonometric  parallaxes  is  re¬ 
liable  only  out  to  a  distance  of  30  pc,  the  mov¬ 
ing  cluster  method  is  an  indispensable  way  of  de¬ 
termining  stellar  distances.  The  distance  of  the 
Hyades  obtained  in  this  way  is  about  40  pc.  The 
distance  obtained  from  the  trigonometric  paral¬ 
laxes  measured  directly  by  the  Hipparcos  satellite 
in  the  1990’s  is  46  pc.  The  Hyades  is  the  nearest 
open  cluster. 

The  observed  HR  diagram  or  the  correspond¬ 
ing  colour-magnitude  diagram  of  the  Hyades 
and  other  nearby  star  clusters  show  a  very  well- 
defined  and  narrow  main  sequence  (Fig.  17.6). 
Most  of  the  cluster  members  are  main  sequence 
stars;  there  are  only  a  few  giants.  There  are  quite 
a  few  stars  slightly  less  than  one  magnitude  above 
the  main  sequence.  These  are  apparently  binary 
stars  whose  components  have  not  been  resolved. 
To  see  this,  let  us  consider  a  binary,  where  both 
components  have  the  same  magnitude  m  and  the 
same  colour  index.  If  this  system  is  unresolved, 
the  colour  index  will  still  be  the  same,  but  the  ob¬ 
served  magnitude  will  be  m  -  0.75,  i.e.  slightly 
less  than  one  magnitude  brighter. 

The  main  sequences  of  open  clusters  are  gen¬ 
erally  located  in  the  same  section  of  the  HR  or 
colour-magnitude  diagram  (Fig.  17.7).  This  is  be¬ 
cause  the  material  from  which  the  clusters  formed 
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Fig.  17.6  Colour-magnitude  diagram  of  the  Hyades.  Ap¬ 
parent  visual  magnitude  on  the  left-hand  vertical  axis;  ab¬ 
solute  visual  magnitude  on  the  right-hand  one 


Fig.  17.7  Schematic  colour-magnitude  diagrams  of  star 
clusters.  M3  is  a  globular  cluster;  the  others  are  open  clus¬ 
ters.  Cluster  ages  are  shown  along  the  main  sequence.  The 
age  of  a  cluster  can  be  told  from  the  point  where  its  stars 
begin  to  turn  off  the  main  sequence.  (Sandage,  A.  (1956): 
Publ.  Astron.  Soc.  Pac.  68,  498) 


has  not  varied  much,  i.e.  their  initial  chemical 
composition  has  been  fairly  constant.  In  younger 
clusters  the  main  sequence  extends  to  brighter 
and  hotter  stars  and  earlier  spectral  types.  Usu¬ 
ally  one  can  clearly  see  the  point  in  the  diagram 


where  the  main  sequence  ends  and  bends  over  to¬ 
wards  the  giant  branch.  This  point  will  depend 
very  strongly  on  the  age  of  the  cluster.  It  can 
therefore  be  used  in  determining  the  ages  of  open 
clusters.  Star  clusters  are  of  central  importance  in 
the  study  of  stellar  evolution. 

The  colour-magnitude  diagrams  of  star  clus¬ 
ters  can  also  be  used  to  determine  their  distances. 
The  method  is  called  main  sequence  fitting.  By 
means  of  multicolour  photometry  the  reddening 
due  to  interstellar  dust  can  be  removed  from  the 
observed  colours  B  —  V  of  the  stars,  yielding  the 
intrinsic  colours  ( B  —  V)o.  Most  star  clusters  are 
so  far  away  from  us  that  all  cluster  members  can 
be  taken  to  be  at  the  same  distance.  The  distance 
modulus 


mVo  -  My  =  5  lg 


r 

10  pc 


(17.4) 


will  then  be  the  same  for  all  members.  In  (17.4), 
my0  is  the  apparent,  My  the  absolute  visual  mag¬ 
nitude  of  a  star,  and  r  the  distance.  It  has  been 
assumed  that  the  extinction  due  to  interstellar 
dust  Ay  has  been  determined  from  multicolour 
photometry  and  its  effect  removed  from  the  ob¬ 
served  visual  magnitude  my : 


my()  =  my  —  Ay. 


When  the  observed  colour-magnitude  dia¬ 
gram  of  the  cluster  is  plotted  using  the  apparent 
magnitude  my0  rather  than  the  absolute  magni¬ 
tude  My  on  the  vertical  axis,  the  only  change 
will  be  that  the  position  of  the  main  sequence 
is  shifted  vertically  by  an  amount  correspond¬ 
ing  to  the  distance  modulus.  The  observed  (my0, 
(B  —  V) o)  diagram  may  now  be  compared  with 
the  Hyades  (My,  (B  —  V)o)  diagram  used  as 
a  standard.  By  demanding  that  the  main  se¬ 
quences  of  the  two  diagrams  agree,  the  distance 
modulus  and  hence  the  distance  can  be  deter¬ 
mined.  The  method  is  very  accurate  and  efficient. 
It  can  be  used  to  determine  cluster  distances  out 
to  many  kiloparsecs. 


1 7.3  Globular  Star  Clusters 

Globular  star  clusters  usually  contain  about  105 
stars.  The  distribution  of  the  stars  is  spherically 
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symmetric,  and  the  central  densities  are  about  ten 
times  larger  than  in  open  clusters.  Stars  in  glob¬ 
ular  clusters  are  among  the  oldest  in  the  Milky 
Way,  and  therefore  they  are  of  great  importance 
for  studies  of  stellar  evolution.  There  are  about 
150-200  globular  clusters  in  the  Milky  Way. 

The  colour-magnitude  diagram  of  a  typical 
globular  cluster  is  shown  in  Fig.  17.8.  The  main 
sequence  only  contains  faint  red  stars;  there  is 
a  prominent  giant  branch,  and  the  horizontal  and 
asymptotic  branches  are  clearly  seen.  The  main 
sequence  is  lower  than  that  of  the  open  clusters, 
because  the  metal  abundance  is  much  lower  in  the 
globular  clusters. 

The  horizontal  branch  stars  have  a  known  ab¬ 
solute  magnitude,  which  has  been  calibrated  us¬ 
ing  principally  RRLyrae  type  variables.  Because 
the  horizontal  branch  stars  are  bright,  they  can 
be  observed  even  in  distant  clusters,  and  thus  us¬ 
ing  them  the  distances  of  globular  clusters  can  be 
well  determined. 

Using  the  known  distances,  the  linear  sizes  of 
globular  clusters  can  be  calculated.  It  is  found 
that  most  of  the  mass  is  concentrated  to  a  central 
core  with  a  radius  of  about  0.3-10  pc.  Outside 
this  there  is  an  extended  envelope  with  a  radius 
that  may  be  10-100  times  larger.  At  even  larger 
radii  stars  will  escape  from  the  cluster  because  of 
the  tidal  force  of  the  Galaxy. 


Fig.  17.8  Colour-magnitude  diagram  of  the  globular 
cluster  M5.  In  addition  to  the  main  sequence  one  can  see 
the  giant  branch  bending  to  the  right  and  to  its  left  the  hor¬ 
izontal  brunch.  (Arp,  H.  (1962):  Astrophys.  J.  135,  311) 


The  masses  of  globular  clusters  can  be  roughly 
estimated  from  the  virial  theorem,  if  the  stel¬ 
lar  velocities  in  the  cluster  have  been  measured. 
More  precise  values  are  calculated  by  fitting  theo¬ 
retical  models  to  the  observed  density  and  veloc¬ 
ity  distributions.  In  this  way  masses  in  the  range 
104— 106  Mq  have  been  obtained. 

The  globular  clusters  in  the  Milky  Way  fall 
into  two  classes.  In  the  classification  given  in 
Table  18.1  these  correspond  to  intermediate  and 
halo  population  II.  The  disk  globular  clusters  are 
concentrated  towards  the  centre  and  the  plane  of 
the  Milky  Way  and  they  form  a  system  that  is 
rotating  with  the  general  rotation  of  the  Milky 
Way.  In  contrast,  the  halo  clusters  are  almost 
spherically  distributed  in  an  extensive  distribution 
reaching  out  to  at  least  35  kpc.  The  system  of  halo 
clusters  does  not  rotate,  but  instead  the  velocities 
of  individual  clusters  are  uniformly  distributed  in 
all  directions.  The  abundance  of  heavy  elements 
is  also  different  in  the  two  classes  of  clusters.  For 
disk  clusters  it  is  typically  about  30  %  of  the  so¬ 
lar  value,  for  halo  clusters  it  is  only  about  1  %.  In 
some  clusters  the  heavy  element  abundances  are 
only  10~3  times  the  solar  value.  The  smallest  val¬ 
ues,  as  low  as  10-4-10-5  have  been  detected  in 
some  field  stars  of  the  halo.  They  therefore  give 
important  information  about  the  production  of  el¬ 
ements  in  the  early  Universe  and  during  the  for¬ 
mation  of  the  Milky  Way. 

All  globular  clusters  are  old,  and  the  halo  clus¬ 
ters  are  among  the  oldest  known  astronomical  ob¬ 
jects.  Determining  a  precise  age  is  difficult,  and 
requires  both  accurate  observations  of  the  turn¬ 
off  point  of  the  main  sequence  in  the  HR  dia¬ 
gram,  as  well  detailed  theoretical  stellar  evolu¬ 
tion  models.  The  ages  obtained  have  been  about 
13  x  109  years.  This  age  is  close  to  the  age  of 
the  Universe  calculated  from  its  rate  of  expansion 
(see  Chap.  20). 


17.4  Example 

Example  17.1  Assume  that  a  globular  cluster 
has  a  diameter  of  40  pc  and  contains  100,000  stars 
of  one  solar  mass  each. 

(a)  Use  the  virial  theorem  to  find  the  average  ve¬ 
locity  of  the  stars.  You  can  assume  that  the 
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average  distance  between  stars  equals  the  ra¬ 
dius  of  the  cluster. 

(b)  Find  the  escape  velocity. 

(c)  Comparing  these  velocities,  can  you  tell 
something  about  the  stability  of  the  cluster? 


(a)  First,  we  have  to  estimate  the  potential  en¬ 
ergy.  There  are  n(n  —  l)/2  ^  n2 /2  pairs  of 
stars  in  the  cluster,  and  the  average  distance 
of  each  pair  is  R.  Thus  the  potential  energy  is 
about 

2  2 
m  n 


where  m  =  1  M© .  The  kinetic  energy  is 


(b)  The  escape  velocity  from  the  edge  of  the 
cluster  is 


2  Gmn 


vP  = 


R 


=  y/4v^  =  2v  =  6  kms  1 


(c)  No.  The  average  velocity  seems  to  be  smaller 
than  the  escape  velocity,  but  it  was  derived 
from  the  virial  theorem  assuming  that  the 
cluster  is  stable. 
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T  =  - mv2n , 

2 

where  v  is  the  root  mean  square  velocity.  Ac¬ 
cording  to  the  virial  theorem  we  have  T  = 
—  1/2  U,  whence 

1  9  1  m2  n2 

-mv  n  =  -G - . 

2  2  R  2 

Solving  for  the  velocity  we  get 
v2 

Gmn 

~  2  R 

6.7x10“ 11  m3kg-1  s“2x2.0xl03°  kgxlO5 
~~  40x3.1  x  1016  m 

=  1 . 1  X  107  ITT  S-2, 


Exercise  17.1  A  globular  cluster  consists  of 
100,000  stars  of  the  solar  absolute  magnitude. 
Calculate  the  total  apparent  magnitude  of  the 
cluster,  if  its  distance  is  10  kpc. 

Exercise  17.2  If  the  apparent  magnitude  of  the 
cluster  of  the  previous  exercise  is  10,  what  is  its 
distance  if  the  interstellar  absorption  in  the  direc¬ 
tion  of  the  cluster  is  1.5  mag/kpc? 

Exercise  17.3  The  Pleiades  open  cluster  con¬ 
tains  230  stars  within  4  pc.  Estimate  the  veloci¬ 
ties  of  the  stars  in  the  cluster  using  the  virial  the¬ 
orem.  For  simplicity,  let  the  mass  of  each  star  be 
replaced  by  1  M© . 


which  gives  v  %  3  kms  1 


The  Milky  Way 


On  clear,  moonless  nights  a  nebulous  band  of 
light  can  be  seen  stretching  across  the  sky.  This 
is  the  Milky  Way  (Fig.  18.1).  The  name  is  used 
both  for  the  phenomenon  in  the  sky  and  for  the 
large  stellar  system  causing  it.  The  Milky  Way 
system  is  also  called  the  Galaxy — with  a  capital 
letter.  The  general  term  galaxy  is  used  to  refer  to 
the  countless  stellar  systems  more  or  less  like  our 
Milky  Way. 

The  band  of  the  Milky  Way  extends  round  the 
whole  celestial  sphere.  It  is  a  huge  system  con¬ 
sisting  mostly  of  stars,  among  them  the  Sun.  The 
stars  of  the  Milky  Way  form  a  flattened  disk-like 
system.  In  the  direction  of  the  plane  of  the  disk 
huge  numbers  of  stars  are  visible,  whereas  rela¬ 
tively  few  are  seen  in  the  perpendicular  direction. 
The  faint  light  of  distant  stars  merges  into  a  uni¬ 
form  glow,  and  therefore  the  Milky  Way  appears 
as  a  nebulous  band  to  the  naked  eye.  A  long- 
exposure  photograph  reveals  hundreds  of  thou¬ 
sands  of  stars  (Fig.  18.2). 

In  the  early  17th  century  Galileo  Galilei ,  us¬ 
ing  his  first  telescope,  discovered  that  the  Milky 
Way  consists  of  innumerable  stars.  Already  in 
the  late  1 8th  century  William  Herschel  attempted 
to  determine  the  size  and  shape  of  the  Milky 
Way  by  means  of  star  counts.  Only  early  in  the 
20th  century  did  the  Dutch  astronomer  Jacobus 
Kapteyn  obtain  the  first  estimate  of  the  size  of 
the  Milky  Way.  The  true  size  of  the  Milky  Way 
and  the  Sun’s  position  in  it  became  clear  finally 
in  the  1920’s  from  Harlow  Shapley’s  studies  of 
the  space  distribution  of  globular  clusters. 


In  studying  the  structure  of  the  Milky  Way,  it  is 
convenient  to  choose  a  spherical  coordinate  sys¬ 
tem  so  that  the  fundamental  plane  is  the  symme¬ 
try  plane  of  the  Milky  Way.  This  is  defined  as  the 
symmetry  plane  of  the  distribution  of  neutral  hy¬ 
drogen,  and  it  agrees  quite  closely  with  the  sym¬ 
metry  plane  defined  by  the  distribution  of  stars  in 
the  solar  neighbourhood  (within  a  few  kpc). 

The  basic  direction  in  the  fundamental  plane 
has  been  chosen  to  be  the  direction  of  the  centre 
of  the  Milky  Way.  This  is  located  in  the  constella¬ 
tion  Sagittarius  ( a  =  17  h  45.7  min,  8  =  —  29°  OCf, 
epoch  2000.0)  at  a  distance  of  about  8.5  kpc.  The 
galactic  latitude  is  counted  from  the  plane  of  the 
Galaxy  to  its  pole,  going  from  0°  to  +  90° ,  and 
to  the  galactic  south  pole,  from  0°  to  —  90°.  The 
galactic  coordinate  system  is  shown  in  Fig.  18.3 
(see  also  Sect.  2.8). 


1 8.1  Methods  of  Distance 
Measurement 

In  order  to  study  the  structure  of  the  Milky  Way, 
one  needs  to  know  how  various  kinds  of  objects, 
such  as  stars,  star  clusters  and  interstellar  matter, 
are  distributed  in  space.  The  most  important  ways 
of  measuring  the  distances  will  first  be  consid¬ 
ered. 

Trigonometric  Parallaxes  The  method  of  tri¬ 
gonometric  parallaxes  is  based  on  the  apparent 
yearly  back-and-forth  movement  of  stars  in  the 
sky,  caused  by  the  orbital  motion  of  the  Earth. 
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Fig.  18.1  The  nebulous  band  of  the  Milky  Way  stretches  across  the  entire  sky.  (Photograph  M.  and  T.  Keskiila,  Lund 
Observatory) 


From  Earth-based  observations  the  trigonometric 
parallaxes  can  be  reliably  measured  out  to  a  dis¬ 
tance  of  about  30  pc;  beyond  100  pc  this  method 
is  no  longer  useful.  The  situation  is,  however, 
changing.  The  limit  has  already  been  pushed  to 
a  few  hundred  parsecs  by  the  Hipparcos  satellite, 
and  Gaia  will  mean  another  major  leap  in  the  ac¬ 
curacy. 

The  Motion  of  the  Sun  with  Respect  to  the 
Neighbouring  Stars.  The  Local  Standard  of 
Rest  The  motion  of  the  Sun  with  respect  to  the 
neighbouring  stars  is  reflected  in  their  proper  mo¬ 
tions  and  radial  velocities  (Fig.  18.4).  The  point 
towards  which  the  Sun’s  motion  among  the  stars 
seems  to  be  directed  is  called  the  apex.  The  op¬ 
posite  point  is  the  antapex.  The  stars  near  the 
apex  appear  to  be  approaching;  their  radial  veloc¬ 
ities  have  the  largest  negative  values,  on  the  av¬ 
erage.  In  the  direction  of  the  antapex  the  largest 
(positive)  radial  velocities  are  observed.  On  the 
great  circle  perpendicular  to  the  apex-antapex  di¬ 
rection,  the  radial  velocities  are  zero  on  the  aver¬ 
age,  but  the  proper  motions  are  large.  The  average 
proper  motions  decrease  towards  the  apex  and  the 
antapex,  but  always  point  from  the  apex  towards 
the  antapex. 


In  order  to  study  the  true  motions  of  the  stars, 
one  has  to  define  a  coordinate  system  with  respect 
to  which  the  motions  are  to  be  defined.  The  most 
practical  frame  of  reference  is  defined  so  that  the 
stars  in  the  solar  neighbourhood  are  at  rest,  on 
the  average.  More  precisely,  this  local  standard 
of  rest  (LSR)  is  defined  as  follows. 

Let  us  suppose  the  velocities  of  the  stars  be¬ 
ing  considered  are  distributed  at  random.  Their 
velocities  with  respect  to  the  Sun,  i.e.  their  radial 
velocities,  proper  motions  and  distances,  are  as¬ 
sumed  to  be  known.  The  local  standard  of  rest  is 
then  defined  so  that  the  mean  value  of  the  veloc¬ 
ity  vectors  is  opposite  to  the  velocity  of  the  Sun 
with  respect  to  the  LSR.  Clearly  the  mean  veloc¬ 
ity  of  the  relevant  stars  with  respect  to  the  LSR 
will  then  be  zero.  The  motion  of  the  Sun  with  re¬ 
spect  to  the  LSR  is  found  to  be: 


Apex  coordinates 

a  =  18  h  00  min  =  270° 

1  =  56° 

5  =  +30° 

b  =  +23° 

Solar  velocity 

vq  =  19.7  kms-1 

The  apex  is  located  in  the  constellation  of  Her¬ 
cules.  When  the  sample  of  stars  used  to  deter¬ 
mine  the  LSR  is  restricted  to  a  subset  of  all  the 
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Fig.  18.2  A  section  of 
about  40°  of  the  Milky 
Way  between  the 
constellations  of  Cygnus 
and  Aquila.  The  brightest 
star  at  the  upper  right  is 
Vega  (a  Lyrae). 
(Photograph  Palomar 
Observatory) 


stars  in  the  solar  neighbourhood,  e.g.  to  stars  of 
a  given  spectral  class,  the  sample  will  usually 
have  slightly  different  kinematic  properties,  and 
the  coordinates  of  the  solar  apex  will  change  cor¬ 
respondingly. 

The  velocity  of  an  individual  star  with  respect 
to  the  local  standard  of  rest  is  called  the  pecu¬ 
liar  motion  of  the  star.  The  peculiar  velocity  of 
a  star  is  obtained  by  adding  the  velocity  of  the 
Sun  with  respect  to  the  LSR  to  the  measured  ve¬ 
locity.  Naturally  the  velocities  should  be  treated 
as  vectors. 

The  local  standard  of  rest  is  at  rest  only  with 
respect  to  a  close  neighbourhood  of  the  Sun.  The 
Sun  and  the  nearby  stars,  and  thus  also  the  LSR, 


are  moving  round  the  centre  of  the  Milky  Way  at 
a  speed  that  is  ten  times  greater  than  the  typical 
peculiar  velocities  of  stars  in  the  solar  neighbour¬ 
hood  (Fig.  18.5). 

Statistical  Parallaxes  The  velocity  of  the  Sun 
with  respect  to  neighbouring  stars  is  about 
20  kms-1.  This  means  that  in  one  year,  the  Sun 
moves  about  4  au  with  respect  to  the  stars. 

Let  us  consider  a  star  S  (Fig.  18.6),  whose  an¬ 
gular  distance  from  the  apex  is  &  and  which  is  at 
a  distance  r  from  the  Sun.  In  a  time  interval  t  the 
star  will  move  away  from  the  apex  at  the  angular 
velocity  u/t  =  pc  a  because  of  the  solar  motion. 
In  the  same  time  interval,  the  Sun  will  move  the 
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distance  s .  The  sine  theorem  for  triangles  yields 
,  s  sin  1}  s  sin  ft 

r^r'=— - « - ,  (18.1) 

sinw  u 

because  the  distance  remains  nearly  unchanged 
and  the  angle  u  is  very  small.  In  addition  to  the 
component  /xa  due  to  solar  motion,  the  observed 
proper  motion  has  a  component  due  to  the  pe¬ 
culiar  velocity  of  the  star.  This  can  be  removed 


Fig.  18.3  The  directions  to  the  galactic  centre  and  the 
North  galactic  pole  (NGP)  in  equatorial  coordinates.  The 
galactic  longitude  /  is  measured  from  the  galactic  centre 
along  the  galactic  plane.  The  coordinates  of  the  Galactic 
centre  are  precessed  from  the  defining  equinox  1950  and 
are  not  very  accurate  (see  A.P.  Lane  (1979),  PASP,  91, 
405) 


Fig.  18.4  Because  of  the  motion  of  the  Sun  towards  the 
apex,  the  average  radial  velocity  of  the  nearby  stars  ap¬ 
pears  largest  in  the  apex  and  antapex  directions 


by  taking  an  average  of  (18.1)  for  a  sample  of 
stars,  since  the  peculiar  velocities  of  the  stars  in 
the  solar  neighbourhood  can  be  assumed  to  be 
randomly  distributed.  By  observing  the  average 
proper  motion  of  objects  known  to  be  at  the  same 
distance  one  thus  obtains  their  actual  distance. 
A  similar  statistical  method  can  be  applied  to  ra¬ 
dial  velocities. 

Objects  that  are  at  the  same  distance  can  be 
found  as  follows.  We  know  that  the  distance  mod¬ 
ulus  m  —  M  and  the  distance  r  are  related  accord¬ 
ing  to: 

ra  —  M  =  51g(r/10  pc)  +  A(r),  (18.2) 

where  A  is  the  interstellar  extinction.  Thus  ob¬ 
jects  that  have  the  same  apparent  and  the  same 
absolute  magnitude  will  be  at  the  same  distance. 
It  should  be  noted  that  we  need  not  know  the  ab¬ 
solute  magnitude  as  long  as  it  is  the  same  for  all 
stars  in  the  sample.  Suitable  classes  of  stars  are 


Fig.  18.5  The  local  standard  of  rest,  defined  by  the  stars 
in  the  solar  neighbourhood,  moves  with  respect  to  the 
galactic  centre.  However,  the  average  value  of  the  stellar 
peculiar  velocities  with  respect  to  the  LSR  is  zero 


Fig.  18.6  When  the  Sun  has  moved  the  distance  s  to¬ 
wards  the  apex,  the  direction  to  the  star  S  appears  to  have 
changed  by  the  angle  u 
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main  sequence  A4  stars,  RRLyrae  variables  and 
classical  cepheids  with  some  given  period.  The 
stars  in  a  cluster  are  also  all  at  the  same  distance. 
This  method  has  been  used,  for  example,  to  de¬ 
termine  the  distance  to  the  Hyades  as  explained 
in  Sect.  17.2. 

Parallaxes  based  on  the  peculiar  or  apex  mo¬ 
tion  of  the  Sun  are  called  statistical  or  secular  par¬ 
allaxes. 

Main  Sequence  Fitting  If  the  distance  of  a 
cluster  is  known,  it  is  possible  to  plot  its  HR  dia¬ 
gram  with  the  absolute  magnitude  as  the  vertical 
coordinate.  Another  cluster,  whose  distance  is  to 
be  determined,  can  then  be  plotted  in  the  same 
diagram  using  the  apparent  magnitudes  as  the 
vertical  coordinate.  Now  the  vertical  distance  of 
the  main  sequences  tells  how  much  the  apparent 
magnitudes  differ  from  the  absolute  ones.  Thus 
the  distance  modulus  m  —  M  can  be  measured. 
This  method,  known  as  the  main  sequence  fitting, 
works  for  clusters  whose  stars  are  roughly  at  the 
same  distance;  if  the  distances  vary  too  much, 
a  clear  main  sequence  cannot  be  distinguished. 

Photometric  Parallaxes  The  determination  of 
the  distance  directly  from  (18.2)  is  called  the 
photometric  method  of  distance  determination 
and  the  corresponding  parallax,  the  photomet¬ 
ric  parallax.  The  most  difficult  task  when  using 
this  method  usually  involves  finding  the  absolute 
magnitude;  there  are  many  ways  of  doing  this. 
For  example,  the  two-dimensional  MKK  spec¬ 
tral  classification  allows  one  to  determine  the  ab¬ 
solute  magnitude  from  the  spectrum.  The  abso¬ 
lute  magnitudes  of  cepheids  can  be  obtained  from 
their  periods.  A  specially  useful  method  for  star 
clusters  is  the  procedure  of  main  sequence  fitting. 
A  condition  for  the  photometric  method  is  that 
the  absolute  magnitude  scale  first  be  calibrated  by 
some  other  method. 

Trigonometric  parallaxes  do  not  reach  very 
far.  For  example,  even  with  the  Hipparcos  satel¬ 
lite,  only  a  few  cepheid  distances  have  been  ac¬ 
curately  measured  by  this  method.  The  method 
of  statistical  parallaxes  is  indispensable  for  cal¬ 
ibrating  the  absolute  magnitudes  of  bright  ob¬ 
jects.  When  this  has  been  done,  the  photometric 


Fig.  18.7  The  observed  luminosity  function  0(My)  and 
the  initial  luminosity  function  ^  (My)  for  main  sequence 
stars  in  the  solar  neighbourhood.  The  functions  give  the 
number  of  stars  per  cubic  parsec  in  the  magnitude  inter¬ 
val  [My  —  1/2,  My  +  1/2];  they  are  actually  the  prod¬ 
ucts  D(P  and  D& ,  where  D  is  the  stellar  density  function 
(in  the  solar  neighbourhood) 

method  can  be  used  to  obtain  distances  of  objects 
even  further  away. 

Other  examples  of  indicators  of  brightness,  lu¬ 
minosity  criteria,  are  characteristic  spectral  lines 
or  the  periods  of  cepheids.  Again,  their  use  re¬ 
quires  that  they  first  be  calibrated  by  means  of 
some  other  method.  It  is  a  characteristic  fea¬ 
ture  of  astronomical  distance  determinations  that 
the  measurement  of  large  distances  is  based  on 
knowledge  of  the  distances  to  nearer  objects. 


1 8.2  Stellar  Statistics 

The  Stellar  Luminosity  Function  By  system¬ 
atically  observing  all  stars  in  the  solar  neighbour¬ 
hood,  one  can  find  the  distribution  of  their  abso¬ 
lute  magnitudes.  This  is  given  by  the  luminosity 
function  @(M)  (Fig.  18.7),  which  gives  the  rela¬ 
tive  number  of  main  sequence  stars  with  absolute 
magnitudes  in  the  range  [M  —  1/2,  M  +  1/2].  No 
stars  appear  to  be  forming  at  present  in  the  re¬ 
gion  of  space  where  the  luminosity  function  has 
been  determined.  The  age  of  the  Milky  Way  is 
10-15  Ga,  which  means  that  all  stars  less  massive 
than  0.9  Mq,  will  still  be  on  the  main  sequence. 
On  the  other  hand,  more  massive  stars,  formed 
early  in  the  history  of  the  Milky  Way,  will  have 
completed  their  evolution  and  disappeared.  Low- 
mass  stars  have  accumulated  in  the  luminosity 
function  for  many  generations  of  star  formation, 


372 


18  The  Milky  Way 


whereas  bright,  high-mass  stars  are  the  result  of 
recent  star  formation. 

By  taking  into  account  the  different  main  se¬ 
quence  lifetimes  of  stars  of  different  masses  and 
hence  of  different  magnitudes,  one  can  deter¬ 
mine  the  initial  luminosity  function  0 (AT),  which 
gives  the  brightness  distribution  at  the  time  of  star 
formation,  the  zero  age  main  sequence  luminos¬ 
ity  function.  The  relation  between  the  function  0 
and  the  observed  luminosity  function  is 

0(M)  =  0(M)To/tE(M),  (18.3) 

where  To  is  the  age  of  the  Milky  Way  and 
tE(M)  is  the  main  sequence  lifetime  of  stars  of 
magnitude  M.  Here  we  assume  that  the  birth  rate 
of  stars  of  magnitude  M  has  remained  constant 
during  the  lifetime  of  the  Milky  Way.  The  initial 
luminosity  function  is  shown  in  Fig.  18.7. 


The  Fundamental  Equation  of  Stellar  Statis¬ 
tics.  The  Stellar  Density  A  crucial  problem  for 
studies  of  the  structure  of  the  Milky  Way  is  to  find 
out  how  the  density  of  stars  varies  in  space.  The 
number  of  stars  per  unit  volume  at  a  distance  r  in 
the  direction  (/,  b)  from  the  Sun  is  given  by  the 
stellar  density  D  =  D(r,  /,  b). 

The  stellar  density  cannot  be  directly  observed 
except  in  the  immediate  neighbourhood  of  the 
Sun.  However,  it  can  be  calculated  if  one  knows 
the  luminosity  function  and  the  interstellar  ex¬ 
tinction  as  a  function  of  distance  in  a  given  direc¬ 
tion.  In  addition  the  number  of  stars  per  unit  solid 
angle  (e.g.  per  square  arc  second)  can  be  deter¬ 
mined  as  a  function  of  limiting  apparent  magni¬ 
tude  by  means  of  star  counts  (Fig.  18.8). 

Let  us  consider  the  stars  within  the  solid  an¬ 
gle  co  in  the  direction  (/,  b)  and  in  the  distance 
range  [r,  r  +  dr]  (Fig.  18.9).  We  let  their  lumi¬ 
nosity  function  @(M)  be  the  same  as  in  the  so¬ 
lar  neighbourhood  and  their  unknown  stellar  den¬ 
sity  D.  The  absolute  magnitude  M  of  the  stars  of 
apparent  magnitude  m  is,  as  usual, 

M  —  m  —  5  lg(r/10  pc)  —  A(r). 

The  number  of  stars  in  the  apparent  magnitude 
interval  [m  —  0.5,  ra  +  0.5]  in  the  volume  element 


Fig.  18.8  The  stellar  density  is  determined  by  means  of 
star  counts.  In  practice,  the  counting  is  done  on  photo¬ 
graphic  plates.  (Cartoon  S.  Harris) 


Fig.  18.9  The  size  of  the  volume  element  at  distance  r  in 
the  direction  (/,  b )  is  cor2  dr 


dV  =  cor2  dr  at  distance  r  is  (Fig.  18.9) 
d N(m)  =  D(r,  /,  b) 

dV. 
(18.4) 

The  stars  of  apparent  magnitude  m  in  the  given 
area  of  the  sky  will  in  reality  be  at  many  differ¬ 
ent  distances.  In  order  to  obtain  their  total  num¬ 
ber  N(m ),  one  has  to  integrate  d N(m)  over  all 
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Fig.  18.10  The  stellar 
density  near  the  Sun. 

(a)  The  stellar  density  of 
spectral  classes  A2-A5  in 
the  galactic  plane, 
according  to 

S. W.  McCuskey.  The 
numbers  next  to  the 
isodensity  curves  give  the 
number  of  stars  in 
10,000  pc3,  (b)  The 
distribution  of  different 
spectral  classes 
perpendicularly  to  the 
galactic  plane  according  to 

T.  Elvius.  The  density  in 
the  galactic  plane  has  been 
normalised  to  one 


a)  l  =  180°  b) 


D 


distances  r : 


'OO 


N(m)  =  f  D(r,  /,  b) 

Jo 


X  0 


m  —  51g 


10  pc 


- A(r ) 


cor2  dr. 
(18.5) 


Equation  (18.5)  is  called  the  fundamental  equa¬ 
tion  of  stellar  statistics.  Its  left-hand  side,  the 
number  of  stars  in  the  apparent  magnitude  inter¬ 
val  [m  —  0.5,  m  +  0.5]  in  the  solid  angle  co ,  is 
obtained  from  the  observations:  one  counts  the 
stars  of  different  magnitudes  in  a  chosen  area  of 
a  photographic  plate.  The  luminosity  function  is 
known  from  the  solar  neighbourhood.  The  extinc¬ 
tion  A(r)  can  be  determined  for  the  chosen  areas, 
for  instance,  by  means  of  multicolour  photome¬ 
try.  In  order  to  solve  the  integral  equation  (18.5) 
for  D(r,l,b ),  several  methods  have  been  devel¬ 
oped,  but  we  shall  not  go  into  them  here. 

Figure  18.10(a)  shows  the  stellar  density  in  the 
solar  neighbourhood  in  the  plane  of  the  Milky 
Way,  and  Fig.  18.10(b)  in  the  direction  perpen¬ 
dicular  to  the  plane.  There  are  several  individual 
concentrations,  but  e.g.  spiral  structure  cannot  be 
observed  in  such  a  limited  region  of  space. 


The  Distribution  of  Bright  Objects  Using 
stellar  statistical  methods,  one  can  only  study 
the  close  neighbourhood  of  the  Sun,  out  to  about 
1  kpc  at  the  most.  Absolutely  faint  objects  can¬ 


not  be  observed  very  far.  Since  the  solar  neigh¬ 
bourhood  appears  to  be  fairly  representative  of 
the  general  properties  of  the  Milky  Way,  its  study 
is  naturally  important,  giving  information  e.g.  on 
the  distributions  and  luminosity  functions  of  stars 
of  various  spectral  types.  However,  in  order  to  get 
an  idea  of  the  larger- scale  structure  of  the  Milky 
Way,  one  has  to  make  use  of  objects  that  are  as 
absolutely  bright  as  possible,  and  which  can  be 
observed  even  at  large  distances. 

Examples  of  suitable  objects  are  stars  of  early 
spectral  types,  HII  regions,  OB  associations, 
open  star  clusters,  cepheids,  RRFyrae  stars,  su¬ 
pergiants  and  giants  of  late  spectral  types,  and 
globular  clusters.  Some  of  these  objects  differ 
greatly  in  age,  such  as  the  young  OB  associations, 
on  the  one  hand,  and  the  old  globular  clusters,  on 
the  other.  Any  differences  in  their  space  distribu¬ 
tion  tell  us  about  changes  in  the  general  structure 
of  the  Milky  Way. 

The  young  optical  objects,  the  HII  regions, 
OB  associations  and  open  clusters,  are  strongly 
concentrated  in  the  plane  of  the  Milky  Way  (Ta¬ 
ble  18.1).  Figure  18.11  shows  that  they  also  ap¬ 
pear  to  be  concentrated  in  three  drawn-out  bands, 
at  least  within  the  observed  region.  Since  these 
types  of  objects  in  other  galaxies  are  known  to 
be  part  of  a  spiral  structure,  the  observed  bands 
in  the  Milky  Way  have  been  interpreted  as  por¬ 
tions  of  three  spiral  arms  passing  through  the  so¬ 
lar  neighbourhood.  Stars  of  later  spectral  types 
seem  to  be  much  more  evenly  distributed.  Apart 
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Table  18.1  Populations  of  the  Milky  Way;  z  is  the  vertical  distance  from  the  galactic  plane,  and  vr  the  velocity  com¬ 
ponent  perpendicular  to  the  galactic  plane 


Population 

Typical  objects 

Average  age 
[109  a] 

z  [pc] 

vr  [k/s] 

Metal 

abundance 

Halo  population  II 

Subdwarfs,  globular  clusters  RR  Lyr 
{P  >  0.4  d) 

14-12 

2000 

75 

0.001 

Intermediate  population  II 

Long  period  variables 

12-10 

700 

25 

0.005 

Disc  population 

Planetary  nebulae,  novae  bright  red 
giants 

12-2 

400 

18 

0.01-0.02 

Old  population  I 

A  stars,  Me  dwarfs  classical  cepheids 

2-0.1 

160 

10 

0.02 

Young  population  I 

Gas,  dust,  supergiants,  T  Tau  stars 

0.1 

120 

0.03-0.04 

Fig.  18.11  The 

distribution  of  various 
objects  in  the  galactic 
plane.  Three  condensations 
can  be  discerned:  the 
Sagittarius  arm  {lowest), 
the  local  arm  near  the  Sun 
and  {outermost)  the 
Perseus  arm 


180°  0  O-BO  associations 


from  a  few  special  directions,  interstellar  dust 
limits  observations  in  the  galactic  plane  to  within 
3-4  kpc. 

Old  objects,  particularly  the  globular  clusters, 
have  an  almost  spherical  distribution  about  the 
centre  of  the  Milky  Way  (Fig.  18.12).  The  space 
density  of  old  objects  increases  towards  the  galac¬ 
tic  centre.  They  can  be  used  to  determine  the  dis¬ 
tance  of  the  Sun  from  the  galactic  centre;  the 
value  of  this  distance  is  about  8.5  kpc.  More 
recent  measurements  using  other  methods  have 
given  a  distance  of  8  kpc  (26,000  light  years). 

Stellar  Populations  Studies  of  the  motions  of 
the  stars  in  the  Milky  Way  have  revealed  that  the 


orbits  of  stars  moving  in  the  galactic  plane  are  al¬ 
most  circular.  These  stars  are  also  usually  young, 
a  few  hundred  million  years  at  the  most.  They 
also  contain  a  relatively  large  amount  of  heavy 
elements,  about  2-4  %.  The  interstellar  mate¬ 
rial  similarly  moves  in  the  galactic  plane  in  al¬ 
most  circular  orbits.  On  the  basis  of  their  motions 
and  their  chemical  composition,  the  interstellar 
medium  and  the  youngest  stars  are  collectively 
referred  to  as  population  /. 

Outside  the  plane  of  the  Milky  Way,  an  almost 
spherically  symmetric  halo  extends  out  to  over 
50  kpc  and  even  further  out  there  is  a  corona.  The 
stellar  density  is  largest  near  the  galactic  centre 
and  decreases  outwards.  The  halo  contains  very 
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Fig.  18.12  The  ^qq 

distribution  of  globular 
clusters.  (From  S.R. 
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F.  Sanchez  (Eds.)  Globular 
Clusters,  Cambridge 
University  Press,  1999) 
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little  interstellar  matter,  and  its  stars  are  old,  per¬ 
haps  up  to  13  x  109  years.  These  stars  are  also 
very  metal-poor.  Their  orbits  may  be  very  ec¬ 
centric  and  show  no  preference  for  the  galactic 
plane.  On  the  basis  of  these  criteria,  one  defines 
stars  of  population  II.  Typical  population  II  ob¬ 
jects  are  the  globular  clusters,  and  the  RRLyrae 
and  W  Virginis  stars. 

While  the  Sun  and  the  closest  stars  move  much 
like  the  stars  of  population  I,  the  stars  of  popula¬ 
tion  II  have  large  velocities  with  respect  to  the  lo¬ 
cal  standard  of  rest,  up  to  more  than  300  kms-1. 
In  reality  their  velocities  at  the  solar  distance 
from  the  galactic  centre  are  quite  small  and  may 
sometimes  be  opposite  to  the  direction  of  mo¬ 
tion  of  the  LSR.  The  large  relative  velocities 
only  reflect  the  motion  of  the  LSR  with  a  veloc¬ 
ity  of  about  220  kms-1  round  the  galactic  cen¬ 
tre. 

Between  these  two  extremes,  there  is  a  se¬ 
quence  of  intermediate  populations.  In  addition 
to  populations  I  and  II,  one  generally  also  speaks 
of  a  disk  population,  including  the  Sun,  for  in¬ 
stance.  The  typical  motions,  chemical  compo¬ 
sition  and  age  of  the  various  populations  (Ta¬ 
ble  18.1)  contain  information  about  the  evolu¬ 
tion  of  our  Galaxy  and  about  the  formation  of  its 
stars. 


1 8.3  The  Rotation  of  the  Milky  Way 

Differential  Rotation.  Oort’s  Formulas  The 

flatness  of  the  Milky  Way  is  already  suggestive 
of  a  general  rotation  about  an  axis  normal  to  the 
galactic  plane.  Observations  of  the  motions  both 
of  stars  and  of  interstellar  gas  have  confirmed 
this  rotation  and  shown  it  to  be  differential.  This 
means  that  the  angular  velocity  of  rotation  de¬ 
pends  on  the  distance  from  the  galactic  centre 
(Fig.  18.13).  Thus  the  Milky  Way  does  not  ro¬ 
tate  like  a  rigid  body.  Near  the  Sun,  the  rotational 
velocity  decreases  with  radius. 

The  observable  effects  of  the  galactic  rotation 
were  derived  by  the  Dutch  astronomer  Jan  H. 
Oort.  Let  us  suppose  the  stars  are  moving  in  cir¬ 
cular  orbits  about  the  galactic  centre  (Fig.  18.14). 
This  approximation  is  acceptable  for  population  I 
stars  and  gas.  The  star  S ,  seen  from  the  Sun  O 
at  galactic  longitude  l  at  distance  r,  has  circular 
velocity  V  at  a  distance  R  from  the  centre.  Sim¬ 
ilarly  for  the  Sun  the  galactic  radius  and  velocity 
are  Ro  and  Vo-  The  relative  radial  velocity  vT  of 
the  star  with  respect  to  the  Sun  is  the  difference 
between  the  projections  of  the  circular  velocities 
on  the  line  of  sight: 

vT  =  V  cos  a  —  Vq  sin/,  (18.6) 
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a)  Velocity  w.r.t  the  Milky  Way 


d)  Tangential  velocity 


Fig.  18.13  The  effect  of  differential  rotation  on  the  ra¬ 
dial  velocities  and  proper  motions  of  stars,  (a)  Near  the 
Sun  the  orbital  velocities  of  stars  decrease  outwards  in  the 
Galaxy,  (b)  The  relative  velocity  with  respect  to  the  Sun 
is  obtained  by  subtracting  the  solar  velocity  from  the  ve¬ 


locity  vectors  in  (a),  (c)  The  radial  components  of  the  ve¬ 
locities  with  respect  to  the  Sun.  This  component  vanishes 
for  stars  on  the  same  orbit  as  the  Sun.  (d)  The  tangential 
components  of  the  velocities 


Fig.  18.14  In  order  to  derive  Oort’s  formulas,  the  veloc¬ 
ity  vectors  of  the  Sun  and  the  star  S  are  divided  into  com¬ 
ponents  along  the  line  O  S  and  normal  to  it 


where  a  is  the  angle  between  the  velocity  vector 
of  the  star  and  the  line  of  sight.  From  Fig.  18.14 
the  angle  CSQ  =  a  +  90°.  By  applying  the  sine 


theorem  to  the  triangle  CSQ  one  obtains 

sin  (a  +  90°)  Ro 

sin  l  R 

or 

R0 

cos  a  =  — sin/.  (18.7) 

R 

Denoting  the  angular  velocity  of  the  star  by 
co  =  V/R  and  that  of  the  Sun  by  coo  =  Vo/Ro, 
one  obtains  the  observable  radial  velocity  in  the 
form 

vv  =  Ro(co  —  coo)  sin/.  (18.8) 

The  tangential  component  of  the  relative  ve¬ 
locity  of  the  Sun  and  the  star  is  obtained  as  fol¬ 
lows.  From  Fig.  18.14, 

vt  =  V  sin  a  —  Vo  cos  l  =  Rco  sin  a  —  .Rococos/. 
The  triangle  QCP  gives 

R  sin  a  =  Ro  cos  /  —  r, 

and  hence 

V{  =  Ro(oj  —  ojo)  cos  /  —  cor.  (18.9) 
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Oort  noted  that  in  the  close  neighbourhood  of 
the  Sun  (r  <<C  ^o),  the  difference  of  the  angular 
velocities  will  be  very  small.  Therefore  a  good 
approximation  for  the  exact  equations  (18.8)  and 
(1 8.9)  is  obtained  by  keeping  only  the  first  term  of 
the  Taylor  series  of  co  —  coo  in  the  neighbourhood 
of  R  =  R0  : 


CO  —  COQ  = 


d  co 

dR 


(R  ~  Ro)  + 


R=R0 


Using  co=V/R  and  V (Ro)  =  Vq ,  one  finds 


CO  —  COQ  ^ 


1 


R 


0  L 


dV\ 

dR  /  R=R0 


Vo 


(R 


For  R  &  Ro  r,  the  difference  R  —  Ro  ^ 
— r  cos/.  One  thus  obtains  an  approximate  form 


r  cos  /  sin  l 


r=r0-\ 


or 


vr^Ar  sin  21,  (18.10) 

where  A  is  a  characteristic  parameter  of  the  solar 
neighbourhood  of  the  Galaxy,  the  first  Oort  con¬ 
stant: 


1 

A  =  - 
2 


Vo 


dV 

dR 


r=r0A 


(18.11) 


For  the  tangential  relative  velocity,  one  simi¬ 
larly  obtains,  since  cor  ~  coor: 


vt 


U0 


dV 

dR 


r  cos2  /  —  coor. 


RO  \^/R=R0  J 

Because  2  cos2  /  =  1  +  cos  2/,  this  may  be  written 


U  %  Ar  cos  2 1  +  Br , 


(18.12) 


where  A  is  the  same  as  before  and  B ,  the  second 
Oort  constant ,  is 


1 

B  =  — 
2 


y0  (dv 
+ 


dR 


r=r0A 


(18.13) 


Fig.  18.15  The  velocity  components  due  to  differen¬ 
tial  rotation  according  to  Oort’s  formulas  as  functions 
of  galactic  longitude,  (a)  Radial  velocities  for  objects  at 
a  distance  of  1  and  2  kpc.  (Compare  with  Fig.  18.13.) 
Strictly,  the  longitude  at  which  the  radial  velocity  vanishes 
depends  on  the  distance.  Oort’s  formulas  are  valid  only  in 
the  close  vicinity  of  the  Sun.  (b)  Proper  motions 


Equation  (18.10)  says  that  the  observed  radial 
velocities  of  stars  at  the  same  distance  should 
be  a  double  sine  curve  as  a  function  of  galactic 
longitude.  This  has  been  confirmed  by  observa¬ 
tions  (Fig.  18.15(a)).  If  the  distance  to  the  stars 
involved  is  known,  the  amplitude  of  the  curve  de¬ 
termines  the  value  of  the  Oort  constant  A. 

Independently  of  distance,  the  proper  motions 
of  the  stars  form  a  double  sine  wave  as  a  function 
of  galactic  longitude,  as  seen  in  Fig.  18.15(b). 
The  amplitude  of  the  curve  is  A  and  its  mean 
value,  B. 

In  1927  on  the  basis  of  this  kind  of  analy¬ 
sis,  Oort  established  that  the  observed  motions 
of  the  stars  indicated  a  differential  rotation  of 
the  Milky  Way.  Taking  into  account  an  extensive 
set  of  observational  data,  the  International  As¬ 
tronomical  Union  IAU  has  confirmed  the  present 
recommended  values  for  the  Oort  constants: 


The  proper  motion  n  =  i \/r  is  then  given  by  the  A  =  15kms-1  kpc-1,  B  =  -lOkms-1  kpc-1. 
expression 

The  Oort  constants  obey  some  interesting  re- 
(18.14)  lations.  By  subtracting  (18.13)  from  (18.11),  one 


/a  ^  A  cos  2/  +  B. 
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obtains 

Vo 

A-B  =  —  =oj0.  (18.15) 

Ro 

Adding  (18.13)  and  (18.11)  gives 

/ dV\ 

A+s  =  -{ (18l6) 

Knowing  the  values  of  A  and  B ,  one  can  cal¬ 
culate  the  angular  velocity  coo  =  0. 005 3" /year, 
which  is  the  angular  velocity  of  the  local  standard 
of  rest  around  the  galactic  centre. 

The  circular  velocity  of  the  Sun  and  the  LSR 
can  be  measured  in  an  independent  way  by  us¬ 
ing  extragalactic  objects  as  a  reference.  In  this 
way  a  value  of  about  220  kms-1  has  been  ob¬ 
tained  for  Vo-  Using  (18.15)  one  can  now  calcu¬ 
late  the  distance  of  the  galactic  centre  Ro.  The 
result  is  about  8.5  kpc,  in  good  agreement  with 
the  distance  to  the  centre  of  the  globular  clus¬ 
ter  system.  The  direction  to  the  galactic  centre 
obtained  from  the  distribution  of  radial  veloci¬ 
ties  and  proper  motions  by  means  of  (18.10)  and 
(18.14)  also  agrees  with  other  measurements. 

The  orbital  period  of  the  Sun  in  the  Galaxy  ac¬ 
cording  to  these  results  is  about  2.5  x  108  years. 
Since  the  Sun’s  age  is  nearly  5  x  109  years,  it 
has  made  about  20  revolutions  around  the  galac¬ 
tic  centre.  At  the  end  of  the  previous  revolution, 
the  Carboniferous  period  had  ended  on  Earth  and 
the  first  mammals  would  soon  appear. 

The  Distribution  of  Interstellar  Matter  Ra¬ 
dio  emission  from  interstellar  gas,  in  particular 
that  of  neutral  hydrogen,  is  not  strongly  absorbed 
or  scattered  by  interstellar  dust.  It  can  therefore 
be  used  to  map  the  structure  of  the  Milky  Way  on 
large  scales.  Radio  signals  can  be  detected  even 
from  the  opposite  edge  of  the  Milky  Way. 

The  position  of  a  radio  source,  for  example  an 
H I  cloud,  in  the  Galaxy  cannot  be  directly  deter¬ 
mined.  However,  an  indirect  method  exists,  based 
on  the  differential  rotation  of  the  Galaxy. 

Figure  18.16  is  a  schematic  view  of  a  situation 
in  which  gas  clouds  on  the  circles  P\ ,  P2 , . . .  are 
observed  in  the  direction  /  (—90°  <  l  <  90°).  The 
angular  velocity  increases  inwards,  and  therefore 
the  greatest  angular  velocity  along  the  line  of 


Fig.  18.16  Clouds  P\ ,  P2,  ...  seen  in  the  same  direction 
at  various  distances 

sight  is  obtained  at  the  point  /\,  where  the  line 
of  sight  is  tangent  to  a  circle.  This  means  that  the 
radial  velocity  of  the  clouds  in  a  fixed  direction 
grows  with  distance  up  to  the  maximum  velocity 
at  cloud  Pk : 

^r,max  =  R&  (^  ^o)  ?  (18.17) 

where  Rk  =  Rosin/.  The  distance  of  cloud  Pk 
from  the  Sun  is  r  =  Rocos/.  When  r  increases 
further,  vr  decreases  monotonically.  Figure  18.17 
shows  how  the  observed  radial  velocity  in  a  given 
direction  varies  with  distance  r,  if  the  gas  moves 
in  circular  orbits  and  the  angular  velocity  de¬ 
creases  outwards. 

The  neutral  hydrogen  21  cm  line  has  been  par¬ 
ticularly  important  for  mapping  the  Milky  Way. 
Figure  18.18  gives  a  schematic  view  of  how  the 
hydrogen  spectral  line  is  made  up  of  the  radia¬ 
tion  of  many  individual  concentrations  of  neutral 
hydrogen,  clouds  or  spiral  arms.  The  line  com¬ 
ponent  produced  by  each  cloud  has  a  wavelength 
which  depends  on  the  radial  velocity  of  the  cloud 
and  a  strength  depending  on  its  mass  and  density. 
The  total  emission  is  the  sum  of  these  contribu¬ 
tions. 

By  making  observations  at  various  galactic 
longitudes  and  assuming  that  the  clouds  form  at 
least  partly  continuous  spiral  arms,  the  distribu- 
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Fig.  18.17  The  radial  velocity  as  a  function  of  distance 
(shown  schematically) 

tion  of  neutral  hydrogen  in  the  galactic  plane 
can  be  mapped.  Figure  16.17  shows  a  map  of 
the  Milky  Way  obtained  from  21  cm  line  obser¬ 
vations  of  neutral  hydrogen.  It  appears  that  the 
neutral  hydrogen  is  concentrated  in  spiral  arms. 
However,  interpretation  of  the  details  is  difficult 
because  of  the  uncertainties  of  the  map.  In  order 
to  obtain  the  distances  to  the  gas  clouds,  one  has 
to  know  the  rotation  curve ,  the  circular  velocity 
as  a  function  of  the  galactic  radius.  This  is  deter¬ 
mined  from  the  same  radial  velocity  observations 
and  involves  assumptions  concerning  the  density 
and  rotation  of  the  gas.  The  interpretation  of  the 
spiral  structure  obtained  from  radio  observations 
is  also  still  uncertain.  For  example,  it  is  difficult 
to  fit  the  radio  spiral  structure  to  the  one  obtained 
near  the  Sun  from  optical  observations  of  young 
stars  and  associations. 

The  Rotation,  Mass  Distribution  and  Total 

Mass  of  the  Milky  Way  In  (18.17)  the  galac¬ 
tic  longitude  /  gives  the  galactic  radius  74  of  the 
clouds  with  maximum  radial  velocity.  By  mak¬ 
ing  observations  at  different  longitudes,  one  can 
therefore  use  (18.17)  to  determine  the  angular  ve¬ 
locity  of  the  gas  for  various  distances  from  the 
galactic  centre.  (Circular  motions  must  be  as¬ 
sumed.)  In  this  way,  the  rotation  curve  co  =  co(R) 


Fig.  18.18  Clouds  at  different  distances  have  different 
velocities  and  therefore  give  rise  to  emission  lines  with 
different  Doppler  shifts.  The  observed  flux  density  profile 
0 continuous  curve )  is  the  sum  of  the  line  profiles  of  all 
the  individual  line  profiles  (« dashed  curves).  The  numbers 
of  the  line  profiles  correspond  to  the  clouds  in  the  upper 
picture 

and  the  corresponding  velocity  curve  V  =  V  (R) 
(=  coR)  are  obtained. 

Figure  18.19  shows  the  rotation  curve  of  the 
Milky  Way.  Its  central  part  rotates  like  a  rigid 
body,  i.e.  the  angular  velocity  is  independent  of 
the  radius.  Outside  this  region,  the  velocity  first 
drops  and  then  begins  to  rise  gradually.  A  maxi¬ 
mum  velocity  is  reached  at  about  8  kpc  from  the 
centre.  Near  the  Sun,  a  little  further  from  the  cen¬ 
tre,  the  rotational  velocity  is  about  220  kms-1. 
According  to  earlier  opinions,  the  velocity  con¬ 
tinues  to  decrease  outwards.  This  would  mean 
that  most  of  the  mass  is  inside  the  solar  radius. 
This  mass  could  then  be  determined  from  Ke¬ 
pler’s  third  law.  According  to  (6.34), 

M  =  R0V^/G. 

Using  the  values  Rq  =  8.5  kpc  and  Vo  = 
220  km  s- 1 ,  one  obtains 

AT  =  1.9  x  1041  kg  =  1.0  x  1011  Mq. 
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Fig.  18.19  Rotation  curve 
of  the  Milky  Way  based  on 
the  motions  of  hydrogen 
clouds.  Each  point 
represents  one  cloud.  The 
thick  line  represents  the 
rotation  curve  determined 
by  Maarten  Schmidt  in 
1965.  If  all  mass  were 
concentrated  within  the 
radius  20  kpc,  the  curve 
would  continue  according 
to  Kepler’s  third  law 
( broken  line).  The  rotation 
curve  determined  by  Leo 
Blitz  on  the  basis  of  more 
recent  observations  begins 
to  rise  again  at  12  kpc 


V 


The  escape  velocity  at  radius  R  is 

[2GM  r 

Ve  =  y  — =  vV2.  (18.18) 

This  gives  an  escape  velocity  near  the  Sun  Ve  = 
310  kms-1.  One  therefore  should  not  see  many 
stars  moving  in  the  direction  of  galactic  rotation, 
l  =  90°,  with  velocities  larger  than  90  kms-1 
with  respect  to  the  local  standard  of  rest,  since  the 
velocity  of  such  stars  would  exceed  the  escape  ve¬ 
locity.  This  has  been  confirmed  by  observations. 

The  preceding  considerations  have  been  based 
on  the  assumption  that  near  the  Sun,  the  whole 
mass  of  the  Galaxy  can  be  taken  to  be  concen¬ 
trated  in  a  central  point.  If  this  were  true,  the 
rotation  curve  should  be  of  the  Keplerian  form, 
V  oc  R~1/2.  That  this  is  not  the  case  can  be  estab¬ 
lished  from  the  values  of  the  Oort  constants. 

The  derivative  of  the  Keplerian  relation 

v  =  =  Vgmr~1/2 

yields 

d  V  1  r -  o/?  1  V 

—  =  — Vgmr~3/2  = - . 

dR  2  2  R 

Using  the  properties  (18.15)  and  (18.16)  of  the 
Oort  constants,  one  finds 

(18.19) 


for  a  Keplerian  rotation  curve.  This  disagrees 
with  the  observed  value  and  thus  the  assumed  Ke¬ 
plerian  law  does  not  apply. 

The  mass  distribution  in  the  Milky  Way  can 
be  studied  on  the  basis  of  the  rotation  curve.  One 
looks  for  a  suitable  mass  distribution,  such  that 
the  correct  rotation  curve  is  reproduced.  Recently 
distant  globular  clusters  have  been  discovered, 
showing  that  the  Milky  Way  is  larger  than  ex¬ 
pected.  Also,  observations  of  the  rotation  curve 
outside  the  solar  circle  suggest  that  the  rotational 
velocity  might  begin  to  rise  again.  These  results 
suggest  that  the  mass  of  the  Galaxy  might  be  as 
much  as  ten  times  larger  than  had  been  thought. 


1 8.4  Structural  Components  of  the 
Milky  Way 

We  have  seen  how  the  structure  of  the  Milky  Way 
can  be  globally  described  by  means  of  an  almost 
spherical  halo  of  old  stars  and  a  disk  of  gas  and 
young  and  middle-aged  stars.  Due  to  its  structure 
the  Milky  Way  belongs  to  disk  galaxies,  which 
are  discussed  in  the  next  chapter.  In  a  more  de¬ 
tailed  picture  more  small-scale  features  can  also 
be  distinguished. 

The  Thick  Disk  In  the  traditional  scheme  di¬ 
viding  the  stars  of  the  Milky  Way  into  a  se¬ 
ries  of  populations  it  was  left  undecided  whether 


(A-B)/{A  +  B)  =  2 
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the  populations  should  be  considered  as  qualita¬ 
tively  different  classes  or  merely  steps  along  a 
continuous  sequence.  As  the  quality  and  quan¬ 
tity  of  observations  have  improved,  it  has  become 
clear  that  what  was  defined  as  the  intermediate 
population  II  really  represents  a  separate  compo¬ 
nent  of  the  Milky  Way  with  a  pattern  of  element 
abundances  stellar  motions  that  separate  it  clearly 
from  the  old  (thin)  disk.  This  population  is  now 
referred  to  as  the  thick  galactic  disk.  A  thick  disk 
has  also  been  detected  in  some  other  galaxies,  but 
it  does  not  appear  to  be  a  universal  feature  of  all 
disk  galaxies. 

The  Galactic  Bar  As  will  be  seen  in  the  next 
chapter,  Sect.  19.1,  a  large  fraction  of  all  disk 
galaxies  are  barred ,  with  an  elongated  light  distri¬ 
bution  at  the  centre.  The  first  indication  that  this 
might  also  be  the  case  for  the  Milky  Way  was 
found  in  velocity  measurements  of  neutral  hydro¬ 
gen,  which  were  incompatible  with  gas  moving 
along  circular  orbits.  In  1971  W.W.  Shane  showed 
that  the  motions  of  the  gas  could  be  explained, 
if  there  is  a  central  bar  pointing  about  20°  away 
from  the  direction  of  the  Galactic  centre. 

It  is  more  difficult  to  detect  a  bar  in  observa¬ 
tions  of  the  stars.  This  was  first  done  using  the 
COBE  satellite,  which  (apart  from  mapping  the 
cosmic  microwave  background,  see  Sect.  19.7) 
also  made  a  map  of  the  sky  at  the  infrared  wave¬ 
lengths  dominated  by  the  light  of  old  stars.  Be¬ 
cause  of  perspective,  the  nearer  end  of  the  bar  at 
positive  galactic  longitude  will  look  slightly  dif¬ 
ferent  than  the  farther  end.  Such  an  asymmetry 
was  present  in  the  infrared  map,  consistent  with 
a  bar  with  an  axial  ratio  of  0.6. 

Later  confirmation  of  the  existence  of  the  bar 
has  come  from  mapping  of  the  central  distribu¬ 
tion  of  old  stars  using  near-infrared  photometric 
distances,  see  Fig.  18.20. 

Spiral  Structure  As  mentioned  earlier,  the 
Milky  Way  appears  to  be  a  spiral  galaxy 
(Fig.  18.21),  but  there  is  no  general  agreement  on 
the  detailed  form  of  the  spiral  pattern.  For  exam¬ 
ple,  in  1976  Y.M.  Georgelin  and  Y.P.  Georgelin 
determined  the  distances  of  HII  regions  by  ra¬ 
dio  and  optical  observations.  In  the  optical  region 


Fig.  18.20  The  Milky  Way  bar  viewed  from  the  North 
Galactic  pole.  Solid  symbols  indicate  the  mean  positions 
of  red  clump  giants  in  given  directions,  and  the  thick 
grey  lines  represent  the  range  of  their  distances.  The  line 
through  the  mean  distances  illustrates  a  bar  of  3  kpc  length 
inclined  22.5°  to  the  direction  to  the  Galactic  centre. 
The  contour  map  shows  a  model  of  the  bar  derived  from 
infrared  observations.  (C.  Babusiaux,  G.  Gilmore  2005, 
MNRAS  358,  1309,  Fig.  6) 


-10  -5  o  h  10 

X  (kpc) 

Fig.  18.21  General  view  of  the  spiral  pattern  of  the  Milky 
Way.  Different  tracers  of  spiral  arms  lead  to  somewhat  dif¬ 
ferent  patterns,  but  they  tend  to  agree  that  a  four-armed 
pattern  like  the  one  indicated  here  gives  the  best  over¬ 
all  representation.  The  names  of  the  arms  are  those  most 
commonly  used.  See  also  Fig.  18.11.  (Y.  Xu  et  al.  2006, 
Science  311,  54) 
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Fig.  1 8.22  Left :  The  central  parts  of  the  Milky  Way  in  a 
radio  picture.  The  observations  were  made  with  the  VLA 
telescope.  (Image  Kassim  et  al.,  Naval  Research  Labora¬ 
tory.)  Right :  The  image  by  the  Chandra  satellite  shows  the 

their  method  is  independent  of  assumptions  about 
the  galactic  rotation  law.  They  then  fitted  four  spi¬ 
ral  arms  through  the  H II  regions. 

Later  investigations  using  a  variety  of  meth¬ 
ods,  both  optical  and  radio  have  confirmed  that 
a  four-armed  pattern  gives  the  best  descrip¬ 
tion  of  the  spiral  structure  in  the  Sun’s  vicinity 
(Fig.  18.22).  The  pitch  angle  of  the  spiral  in  this 
model  is  about  11.3°.  Three  of  the  arms  start  at 
the  position  of  the  galactic  bar. 

The  cause  of  the  spiral  structure  is  a  long¬ 
standing  problem.  A  small  perturbation  in  the 
disk  will  quickly  be  stretched  into  a  spiral  shape 
by  differential  rotation.  However,  such  a  spiral 
would  disappear  in  a  few  galactic  revolutions, 
a  few  hundred  million  years. 

An  important  step  forward  in  the  study  of  the 
spiral  structure  was  the  density  wave  theory  de¬ 
veloped  by  Chia-Chiao  Lin  and  Frank  H.  Shu 
in  the  1960’s.  The  spiral  structure  is  taken  to  be 
a  wavelike  variation  of  the  density  of  the  disk. 
The  spiral  pattern  rotates  as  a  solid  body  with  an 
angular  velocity  smaller  than  that  of  the  galactic 
rotation,  while  the  stars  and  gas  in  the  disk  pass 
through  the  wave. 


surroundings  of  the  supermassive  black  hole  SgraA*  at  the 
centre  of  the  Milky  Way.  The  width  of  the  picture  is  8.4r, 
corresponding  20  parsecs  at  the  distance  of  the  centre  of 
the  Milky  Way.  (NASA/CXC/MIT/F.K.  Baganoff  et  al.) 

The  density  wave  theory  explains  in  a  natural 
way  why  young  objects,  like  molecular  clouds, 
H II  regions  and  bright  young  stars  are  found  in 
the  spiral  arms.  As  gas  passes  through  the  wave, 
it  is  strongly  compressed.  The  internal  gravity  of 
the  gas  clouds  then  becomes  more  important  and 
causes  them  to  collapse  and  form  stars. 

It  takes  about  107  years  for  the  material  to  pass 
through  a  spiral  arm.  By  that  time,  the  hot,  bright 
stars  have  finished  their  evolution,  their  ultravio¬ 
let  radiation  has  ceased  and  the  H II  regions  have 
disappeared.  The  less  massive  stars  formed  in  the 
spiral  arms  are  spread  out  in  the  disk  by  their  pe¬ 
culiar  velocities. 

It  is  not  yet  clear  what  gives  rise  to  the  spiral 
wave.  For  some  further  discussion  of  spiral  struc¬ 
ture,  see  Sect.  19.4. 

The  Galactic  Centre  Our  knowledge  of  the 
centre  of  the  Milky  Way  is  mostly  based  on  radio 
and  infrared  observations  (Fig.  18.22).  In  the  op¬ 
tical  region  the  view  towards  the  centre  is  blocked 
already  by  the  dark  clouds  in  the  Sagittarius  spiral 
arm  about  2  kpc  from  us.  The  galactic  centre  is  in¬ 
teresting  because  it  may  be  a  small-scale  version 
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of  the  much  more  violently  active  nuclei  of  some 
external  galaxies  (see  Sect.  19.7).  It  therefore  pro¬ 
vides  opportunities  to  study  at  close  hand  phe¬ 
nomena  related  to  active  galaxies.  At  the  galac¬ 
tic  centre  there  us  supermassive  black  hole  with  a 
mass  of  nearly  5  x  10 6  Mo 

As  one  approaches  the  galactic  centre  the  stel¬ 
lar  density  continues  to  rise  towards  a  sharp  cen¬ 
tral  peak.  In  contrast,  the  galactic  gas  disk  has 
a  central  hole  of  radius  about  3  kpc.  This  may 
be  due  to  the  galactic  bar,  which  will  channel  gas 
into  the  galactic  nucleus  leaving  a  gas-free  zone 
at  larger  radii. 

Inside  the  central  hole  there  is  a  dense  gas 
disk.  Its  radius  is  about  1.5  kpc  in  neutral  hy¬ 
drogen,  but  most  of  its  mass  is  molecular  and 
concentrated  within  300  pc  of  the  nucleus.  In 
this  region  the  mass  of  molecular  gas  is  about 
108  Mq,  or  5  %  of  the  total  molecular  mass  of 
the  Milky  Way.  The  molecular  clouds  are  prob¬ 
ably  confined  by  the  pressure  from  surrounding 
very  hot  (T  ~  108  K)  gas.  This  hot  gas  may  then 
expand  vertically,  forming  a  galactic  wind.  Gas 
lost  to  a  wind  or  to  star  formation  is  replenished 
with  infalling  gas  from  larger  radii. 

The  central  10  pc  are  dominated  by  the  ra¬ 
dio  continuum  source  SgrA  and  a  dense  star 
cluster  observed  in  the  infrared.  There  is  also 
molecular  gas  with  complex  motions  and  signs 
of  star  formation  activity.  Within  SgrA  there 
is  a  unique  point-like  radio  continuum  source 
known  as  SgrA*.  The  position  of  SgrA*  agrees 
to  within  i"  with  the  centre  of  a  cluster  of  stars 
that  is  much  denser  than  anything  observed  in  the 
galactic  disk. 

The  stellar  density  of  the  central  star  cluster 
can  be  as  high  as  3  x  1014  stars  per  cubic  parsec, 
while  around  the  Sun  it  is  just  one  star  per  cubic 
parsec.  The  stars  of  the  cluster  orbit  the  central 
black  hole.  Motions  of  the  innermost  stars  have 
been  followed  since  the  end  of  the  1990’s.  A  cou¬ 
ple  of  stars  have  already  managed  to  make  a  full 
revolution  around  the  centre  (Fig.  18.23),  the  pe¬ 
riods  being  16  and  1 1.5  years.  When  the  period  is 
known  and  the  size  of  the  orbit  is  seen  the  mass  of 
the  central  body  can  be  calculated.  The  researches 
working  with  the  Keck  telescope  got  in  2014  a 
value  of  4.6  db  0.7  x  106  solar  masses. 


Fig.  18.23  Motions  of  stars  near  the  centre  of  the  Milky 
Way  during  over  15  years.  The  background  image  was 
taken  in  2014  with  the  Keck  telescope.  (Kuva  Andrea 
Ghez/UCLAAV.M.  Keck  Telescopes) 

According  to  VLB  I  measurements  the  diam¬ 
eter  of  the  accretion  disk  surrounding  the  black 
hole  is  less  than  10  au.  Eruptions  observed  at 
infrared  wavelengths,  lasting  only  about  twenty 
minutes,  point  to  the  same  size.  It  can  be  calcu¬ 
lated  that  the  diameter  of  the  black  hole  is  about 
0.2  au. 


1 8.5  The  Formation  and  Evolution 
of  the  Milky  Way 

Like  all  galaxies  the  Milky  Way  is  supposed  to 
have  formed  through  the  collapse  of  a  region  of 
higher  than  average  density  in  the  Universe.  The 
general  view  of  how  this  happened  will  be  pre¬ 
sented  in  the  next  two  chapters, in  particular  in 
Sects.  19.8  and  20.7.  Some  traces  of  the  forma¬ 
tion  process  will  be  preserved  in  the  properties  of 
local  stars  of  different  ages.  These  represent  in¬ 
formation  about  the  formation  of  the  Milky  Way 
that  is  not  available  for  other  galaxies. 

The  Ages  of  Stars  The  most  direct  way  of 
studying  the  evolution  of  the  Milky  Way  is  by 
means  of  the  observed  ages  of  stars.  The  tra¬ 
ditional  sequence  of  populations  described  in 
Sect.  18.2  is  also  a  sequence  corresponding  to 


384 


18  The  Milky  Way 


Fig.  18.24  The 

age-metallicity  relation  of 
the  Milky  Way  for  different 
components.  TDO:  thin 
disc  open  clusters;  TDG: 
thick  disc  globulars; 

B:  bulge;  YHG:  young 
halo  globulars;  OHG:  old 
halo  globulars.  The  shaded 
areas  correspond  to  thin 
disc  field  stars,  thick  disc 
field  stars,  and  halo  field 
stars,  in  order  of  increasing 
age.  (K.  Freeman, 

J.  Bland-Hawthom  2002, 
ARAA  40,  487,  Fig.  2) 


stars  of  different  ages.  The  oldest  component,  the 
stellar  halo  forms  an  almost  spherical  distribu¬ 
tion  of  stars  with  ages  12-14  Ga,  representing  the 
oldest  part  of  the  Milky  Way. 

In  contrast,  the  old  and  young  population  I 
consists  of  stars  with  ages  less  than  10  Ga,  which 
were  initially  formed  in  a  very  thin  layer  that  has 
been  thickened  by  encounters  with  spiral  arms 
and  molecular  cloud  complexes. 

Intermediate  between  these  two,  as  we  have 
noted,  is  the  thick  disk  with  an  age  about  10- 
12  Ga.  Another  intermediate  population  in  the 
Milky  Way  is  the  central  bulge  including  the 
galactic  bar,  and  containing  stars  of  ages  7-11  Ga. 

Chemical  Enrichment  The  history  of  the  for¬ 
mation  of  the  Milky  Way  is  preserved  in  the  prop¬ 
erties  of  its  older  stars,  above  all  in  their  “chemi¬ 
cal”  composition,  i.e.  the  abundances  of  elements 
and  isotopes  heavier  than  helium.  (All  these  heav¬ 
ier  elements  are  often  referred  to  as  “metals”  in 
the  astrophysical  jargon.) 

Only  hydrogen  and  helium  were  present  as  the 
first  stars  formed.  As  successive  generations  of 
stars  evolved,  nuclear  reactions  produced  heavy 
elements,  some  of  which  were  returned  to  the  in¬ 
terstellar  gas  in  supernova  explosions  or  stellar 
winds.  These  heavy  elements  were  then  incorpo¬ 
rated  into  the  following  generation  of  stars,  grad¬ 


ually  increasing  the  metal  abundance  in  the  inter¬ 
stellar  medium. 

Some  of  the  stars  formed  were  of  low  mass, 
and  had  life-times  long  enough  that  they  are  still 
present.  Their  chemical  composition  will  reflect 
the  abundances  in  the  interstellar  medium  at  the 
time  they  were  formed.  Studying  the  chemical 
abundances  of  stars  of  different  ages  therefore 
gives  information  on  the  star  formation  history  of 
the  Milky  Way,  both  on  the  rate  at  which  stars 
have  formed  at  a  given  epoch  and  on  the  masses 
and  other  characteristics  of  those  stars  when  they 
formed. 

One  commonly  used  indicator  of  the  metal 
abundance  in  stars  is  the  mass  of  iron  relative 
to  hydrogen,  in  logarithmic  units  and  measured 
relative  to  the  solar  values,  [Fe/H].  Figure  18.24 
shows  the  value  of  [FE/H]  for  different  types  of 
objects  of  various  ages.  The  general  picture  is 
that  the  metal  abundance  rose  rapidly  during  the 
first  billion  years  and  has  afterwards  grown  only 
slowly.  The  lowest  values  of  [Fe/H]  discovered  in 
old  halo  stars  are  about  —5. 

Many  models  have  been  developed  describ¬ 
ing  the  chemical  evolution  of  the  Milky  Way 
and  other  galaxies,  incorporating  prescriptions 
for  star  formation,  and  infall  of  gas  from  the 
outside.  In  particular  the  rapid  early  rise  of  the 
metal  abundance  is  difficult  to  explain  in  the  sim- 
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plest  models.  This  difficulty,  the  scarcity  of  old 
metal-poor  disk  stars,  has  become  known  as  the 
“G  dwarf  problem”,  since  the  oldest  stars  still 
on  the  main  series  are  of  spectral  class  G.  The 
most  direct  way  that  the  G  dwarf  problem  can  be 
avoided  is  to  suppose  that  a  large  portion  of  the 
interstellar  gas  was  accreted  after  the  formation 
of  the  oldest  stars. 

Formation  of  the  Milky  Way  Galaxies  are 
thought  to  have  formed  when  gas  clouds  of  higher 
than  average  density  in  the  Universe  collapsed 
under  their  own  gravity.  As  the  gas  is  compressed 
stars  are  born  in  it.  After  the  end  of  the  collapse 
the  cloud  settles  into  a  quasi- stationary  state. 
Evolution  continues  at  a  slower  pace.  Evolving 
stars  return  chemically  enriched  gas  to  interstel¬ 
lar  space,  where  it  is  mixed  with  remaining,  un¬ 
processed  gas  and  star  formation  continues. 

There  are  two  competing  pictures  of  how 
galaxy  formation  proceeds.  In  the  monolithic  col¬ 
lapse  model  it  is  assumed  that  large  galaxies  form 
coherently  in  the  collapse  of  a  massive  cloud  con¬ 
taining  the  bulk  of  the  material  constituting  the 
galaxy.  In  the  hierarchical  picture  most  of  the 
stars  form  in  much  smaller  clouds,  which  later  ag¬ 
glomerate  together  to  form  the  galaxies  we  now 
observe.  The  evidence  for  these  pictures  in  other 
galaxies  will  be  considered  in  the  next  chapter. 

The  case  of  the  Milky  Way  already  shows  that 
the  contrast  of  the  monolithic  and  the  hierarchical 
picture  is  oversimplified.  Some  aspects  of  galac¬ 
tic  structure  fit  more  naturally  into  a  monolithic 
theory.  For  example,  the  rapid  collapse  of  the 
halo,  followed  by  a  more  gradual  build-up  of  the 
disk  fits  with  this  picture.  The  pattern  of  chemical 
abundances  are  also  homogeneous  in  a  way  that  is 
more  compatible  with  a  pattern  of  star  formation 
that  was  uniform  throughout  galactic  history. 

Other  observations  are  indicative  of  a  hier¬ 
archical  formation  history.  For  example,  stellar 
abundance  patterns  in  the  thick  disk  differ  from 
those  in  the  old  thin  disk.  The  most  natural  way  of 
explaining  this  is  that  the  thick  disk  formed  by  the 
accretion  of  one  or  more  small  satellite  galaxies 
with  different  histories  of  star  formation.  Another 
sign  of  the  importance  of  the  infall  of  satellite 
galaxies  is  the  existence  of  systems  such  as  the 


Sagittarius  dwarf  galaxy,  which  appears  to  be  in 
the  course  of  being  disrupted  by  the  Milky  Way. 

There  is  about  five  times  as  much  dark  matter 
as  visible  matter.  Recent  simulations  have  con¬ 
firmed  that  this  dark  matter  has  an  essential  role 
in  the  formation  of  galaxies.  During  the  first  bil¬ 
lion  years  of  the  universe  the  dark  matter  formed 
gravitational  condensates  to  which  the  visible 
matter  was  concentrated  to  form  initial  galax¬ 
ies.  Formation  of  the  current  galaxies  has  been 
a  complicated  process  that  involved  effects  of  the 
dark  matter,  interactions,  collisions  and  mergers 
of  galaxies. 


18.6  Examples 

Example  18.1  Show  that  if  the  stars  are  uni¬ 
formly  distributed  in  space  and  there  is  no  in¬ 
terstellar  extinction,  the  number  of  stars  brighter 
than  apparent  magnitude  m  is 

No(m)  =  N0( 0)  x  10°'6m. 

Fet  us  suppose  first  that  all  stars  have  the  same 
absolute  magnitude  M.  The  distance  in  parsecs  of 
those  stars  having  the  apparent  magnitude  m  is 

r  =  10  x  10a2(m-M). 

In  order  to  appear  brighter  than  m,  a  star  has  to 
be  within  a  sphere  of  radius  r.  Because  the  stellar 
density  is  constant,  the  number  of  such  stars  is 
proportional  to  the  volume: 

No(m)  oc  r3  oc  10°'6m. 

The  result  does  not  depend  on  the  absolute 
magnitudes  of  the  stars,  so  that  the  same  result 
still  applies  when  the  magnitudes  are  not  equal, 
as  long  as  the  luminosity  function  does  not  de¬ 
pend  on  distance.  Thus  the  equation  is  generally 
valid  under  the  stated  conditions. 

Example  18.2  (The  Estimation  of  Distances  by 
Means  of  Oort’s  Formulas)  An  object  in  the 
galactic  plane  at  longitude  /  =  45°  has  the  radial 
velocity  of  30  kms-1  with  respect  to  the  FSR. 
What  is  its  distance? 
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According  to  (18.10), 

vY  =  Ar  sin  2/. 


Thus 

vr  30kms-1 

r  =  — ; - = - t-  =  2  kpc. 

A  sin  2/  15  km  s-1  kpc  1 

In  practice,  the  peculiar  velocities  are  so  large  that 
this  method  cannot  be  used  for  distance  determi¬ 
nation.  Oort’s  formulas  are  mostly  suitable  only 
for  statistical  studies. 

Example  18.3  (Discussion  of  the  Gravitational 
Field  of  a  Uniform  Disk)  It  can  be  shown  that  the 
gravitational  field  of  a  homogeneous  infinite  thin 
disk  is  constant  and  directed  towards  the  plane  of 
the  disk.  If  the  mass  per  unit  area  of  the  disk  is  a , 
the  gravitational  field  is 

g  ~  ItxGo. 

A  test  particle  located  outside  the  plane  will 
therefore  get  a  constant  acceleration  towards 
the  plane.  Taking  a  numerical  example,  assume 
a  mass  of  1011  M©  distributed  uniformly  on  a  cir¬ 
cular  disk  20  kpc  in  diameter.  The  mass  per  unit 
area  will  be  a 

1011  x  2  x  1030  kg 
°  ~~  7T (104  x  3.086  x  1016  m)2 

=  0.67  kgm-2. 

The  corresponding  gravitational  field  is 
g  =  2.8  x  10-10  ms-2. 

Let  a  star  be  located  at  d  =  1  kpc  above  the  plane, 
initially  at  rest  (not  very  near  the  edge  of  the  disk, 
in  order  to  keep  our  approximation  valid).  The 
disk  will  pull  it  towards  the  plane,  and  when  the 
star  crosses  the  plane,  it  has  acquired  a  velocity 
given  by 


The  time  required  to  reach  the  plane  is 
t  —  v/g  =  15  x  106  a. 


18.7  Exercises 

Exercise  18.1  Assume  that  the  Sun  and  a  star 
move  around  the  Galaxy  at  the  same  speed  in  the 
same  circular  orbit  in  the  galactic  plane.  Show 
that  the  proper  motion  of  the  star  is  independent 
of  its  distance.  How  big  is  this  proper  motion? 

Exercise  18.2  (a)  A  cepheid  has  a  radial  velocity 
of  80  kms-1,  and  its  galactic  longitude  is  145°. 
What  is  the  distance  of  the  cepheid? 

(b)  The  period  of  the  cepheid  is  3.16  d,  and 
the  apparent  visual  magnitude  is  12.3.  What  is  the 
distance  derived  from  this  information?  Are  the 
distances  consistent? 

Exercise  18.3  (a)  How  many  of  the  nearest  stars 
(Table  C.  15)  are  also  among  the  brightest  stars 
(Table  C.16)?  Explain. 

(b)  If  the  stellar  density  were  constant,  how 
many  stars  would  there  be  within  the  distance  of 
Canopus? 

Exercise  18.4  (a)  Assume  that  the  Galaxy  is 

a  homogeneous  disk  and  the  Sun  lies  in  the  cen¬ 
tral  plane  of  the  disk.  The  absolute  magnitude  of 
a  star  is  M ,  galactic  latitude  b ,  and  distance  from 
the  central  plane  z.  What  is  the  apparent  magni¬ 
tude  of  the  star,  if  the  extinction  inside  the  Galaxy 
is  a  mag  kpc-1? 

(b)  Assume  that  the  thickness  of  the  galactic 
disk  is  200  pc.  Find  the  apparent  magnitude  of 
a  star  with  M  =  0.0,  b  =  30°,  distance  r  =  1  kpc, 
and  a  =  1  mag  kpc-1. 


v  —  yjlgd  =  1 30  km  s  1 . 
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The  galaxies  are  the  fundamental  building  blocks 
of  the  Universe.  Some  of  them  are  very  simple  in 
structure,  containing  only  normal  stars  and  show¬ 
ing  no  particular  individual  features.  There  are 
also  galaxies  that  are  almost  entirely  made  of  neu¬ 
tral  gas.  On  the  other  hand,  others  are  complex 
systems,  built  up  from  many  separate  interact¬ 
ing  components — stars,  neutral  and  ionised  gas, 
dust,  molecular  clouds,  magnetic  fields,  cosmic 
rays. . . .  The  galaxies  may  form  small  groups  or 
large  clusters  in  space.  At  the  centre  of  many 
galaxies,  there  is  a  compact  nucleus  that  may 
sometimes  be  so  bright  that  it  overwhelms  all  the 
normal  radiation  of  the  galaxy. 

The  luminosity  of  the  brightest  normal  galax¬ 
ies  may  correspond  to  10 12  solar  luminosities, 
but  most  of  them  are  much  fainter — the  smallest 
ones  that  have  been  discovered  are  about  105  L©. 
Since  galaxies  do  not  have  a  sharp  outer  edge, 
to  some  extent  their  masses  and  radii  depend  on 
how  these  quantities  are  defined.  If  only  the  bright 
central  parts  are  included,  a  giant  galaxy  may  typ¬ 
ically  have  a  mass  of  about  1013  M©,  and  a  radius 
of  30  kpc,  and  a  dwarf,  correspondingly,  107  M©, 
and  0.5  kpc. 

When  the  existence  of  dark  matter  was  con¬ 
firmed  the  whole  field  of  galactic  research  was 
changed.  It  was  only  the  dark  matter  that  helped 
to  give  a  consistent  model  for  the  origin  and  evo¬ 
lution  of  galaxies. 

The  fraction  of  dark  matter  of  the  total  mass  of 
galaxies  varies  considerably.  In  big  galaxies  there 
is  about  five  times  as  much  dark  matter  as  ordi¬ 
nary  visible  matter.  In  the  smallest  galaxies  the 


ratio  of  the  dark  matter  to  the  visible  matter  can 
be  hundreds  or  even  thousands. 

When  smaller  and  smaller  galaxies  have  been 
found  the  boundary  between  star  clusters  and 
galaxies  has  become  obscure.  The  smallest  galax¬ 
ies  are  similar  to  dense  star  clusters.  Therefore 
some  researches  have  redefined  galaxies:  a  star 
system  is  a  galaxy  if  in  addition  to  stars  it  con¬ 
tains  also  dark  matter. 


1 9.1  The  Classification  of  Galaxies 

A  useful  first  step  towards  an  understanding  of 
galaxies  is  a  classification  based  on  their  various 
forms.  Although  such  a  morphological  classifica¬ 
tion  must  always  be  to  some  extent  subjective,  it 
provides  a  framework  within  which  the  quanti¬ 
tative  properties  of  galaxies  can  be  discussed  in 
a  systematic  fashion.  However,  it  should  always 
be  remembered  that  the  picture  thus  obtained  will 
be  limited  to  those  galaxies  that  are  large  and 
bright  enough  to  be  easily  visible  in  the  sky.  An 
idea  of  the  consequent  limitations  can  be  obtained 
from  Fig.  19.1,  showing  the  radii  and  magnitudes 
of  normal  galaxies.  One  sees  that  only  within 
a  narrow  region  of  this  diagram  can  galaxies  be 
easily  found.  If  a  galaxy  has  too  large  a  radius  for 
its  magnitude  (small  surface  brightness),  it  will 
disappear  in  the  background  light  from  the  night 
sky.  On  the  other  hand,  if  its  radius  is  too  small, 
it  looks  like  a  star  and  is  not  noticed  on  a  photo¬ 
graphic  plate.  In  the  following,  we  shall  mainly 
be  concerned  with  bright  galaxies  that  fit  within 
these  limits. 

387 


©  Springer- Verlag  Berlin  Heidelberg  2017 

H.  Karttunen  et  al.  (eds.),  Fundamental  Astronomy,  DOI  10.1007/978-3-662-53045-0_19 


388 


19  Galaxies 


If  a  classification  is  to  be  useful,  it  should 
at  least  roughly  correspond  to  important  phys- 


Fig.  1 9.1  Magnitudes  and  diameters  of  observable  extra- 
galactic  objects.  Objects  to  the  upper  left  look  like  stars. 
The  quasars  in  this  region  have  been  discovered  on  the  ba¬ 
sis  of  their  spectra.  Objects  to  the  lower  right  have  a  sur¬ 
face  brightness  much  smaller  than  that  of  the  night  sky. 
In  recent  years  large  numbers  of  low  surface  brightness 
galaxies  have  been  discovered  in  this  region.  (Arp,  H. 
(1965):  Astrophys.  J.  142,  402) 


ical  properties  of  the  galaxies.  Most  classifica¬ 
tions  accord  in  their  main  features  with  the  one 
put  forward  by  Edwin  Hubble  in  1926.  Hubble’s 
own  version  of  the  Hubble  sequence  is  shown 
in  Fig.  19.2.  The  various  types  of  galaxies  are 
ordered  in  a  sequence  from  early  to  late  types. 
There  are  three  main  types:  elliptical ,  lenticu¬ 
lar ,  and  spiral  galaxies.  The  spirals  are  divided 
into  two  sequences,  normal  and  barred  spirals. 
In  addition,  Hubble  included  a  class  of  irregular 
galaxies.  In  lenticular  and  spiral  galaxies  the  mat¬ 
ter  is  concentrated  in  a  flat  disk  and  thus  they  can 
be  called  collectively  as  disk  galaxies. 

The  elliptical  galaxies  (Fig.  19.6)  appear  in  the 
sky  as  elliptical  concentrations  of  stars,  in  which 
the  density  falls  off  in  a  regular  fashion  as  one 
goes  outwards.  Usually  there  are  no  signs  of  in¬ 
terstellar  matter  (dark  bands  of  dust,  bright  young 
stars).  The  ellipticals  differ  from  each  other  only 
in  shape  and  on  this  basis  they  are  classified  as 
E0,  El, . . . ,  E7.  If  the  major  and  minor  axes  of  an 
elliptical  galaxy  are  a  and  b ,  its  type  is  defined  to 
be  Eft,  where 

n  =  Ul-b-\  (19.1) 

An  E0  galaxy  thus  looks  circular  in  the  sky.  The 
apparent  shape  of  an  E  galaxy  depends  on  the 
direction  from  which  it  is  seen.  In  reality  an 
E0  galaxy  may  therefore  be  truly  spherical  or 
it  may  be  a  circular  disk  viewed  directly  from 
above. 


Fig.  19.2  The  Hubble  sequence  in  Hubble’s  1936  ver-  (S  and  Irr  II);  18.5  (Irr  I  and  dE).  (Hubble,  E.P.  (1936): 
sion.  At  this  stage  the  existence  of  type  SO  was  still  The  Realm  of  the  Nebulae  (Yale  University  Press,  New 
doubtful.  Photographs  of  the  Hubble  types  are  shown  in  Haven)) 

Figs.  18.6  and  18.15  (E);  18.3  and  18.4  (SO  and  S);  18.12 
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NGC488  Type  Sab  NGC  628  M74  Type  Sc 


Fig.  19.3  The  classification  of  normal  spiral  and  SO  galaxies.  (Mt.  Wilson  Observatory) 


A  later  addition  to  the  Hubble  sequence  is 
a  class  of  giant  elliptical  galaxies  denoted  cD. 
These  are  generally  found  in  the  middle  of  clus¬ 
ters  of  galaxies.  They  consist  of  a  central  part 
looking  like  a  normal  elliptical  surrounded  by  an 
extended  fainter  halo  of  stars. 

In  the  Hubble  sequence  the  lenticulars  or 
SO  galaxies  are  placed  between  the  elliptical  and 


the  spiral  types.  Like  the  ellipticals  they  contain 
only  little  interstellar  matter  and  show  no  signs 
of  spiral  structure.  However,  in  addition  to  the 
usual  elliptical  stellar  component,  they  also  con¬ 
tain  a  flat  disk  made  up  of  stars.  In  this  respect 
they  are  like  spiral  galaxies  (Figs.  19.3,  19.4). 

The  characteristic  feature  of  spiral  galaxies  is 
a  more  or  less  well-defined  spiral  pattern  in  the 
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NGC  2859 


Type  SBO 


NGC  175 


Type  SBab(s) 


NGC  2523 


Type  SBb(r) 


NGC  1073 


Type  SBc(sr) 


NGC  1300 


Type  SBb(s) 


NGC  2525 


Type  SBc(s) 


Fig.  19.4  Different  types  of  SBO  and  SB  galaxies.  The  type  (r)  or  (s)  depends  on  whether  the  galaxy  has  a  central  ring 
or  not.  (Mt.  Wilson  Observatory) 


disk.  Spiral  galaxies  consist  of  a  central  bulge , 
which  is  structurally  similar  to  an  E  galaxy,  and 
of  a  stellar  disk,  like  in  an  SO  galaxy.  In  addi¬ 
tion  to  these,  there  is  a  thin  disk  of  gas  and  other 
interstellar  matter,  where  young  stars  are  being 
born,  forming  the  spiral  pattern.  There  are  two  se¬ 
quences  of  spirals,  normal  Sa-Sb-Sc,  and  barred 


SBa-SBb-SBc  spirals.  In  the  barred  spirals  the 
spiral  pattern  ends  at  a  central  bar,  whereas  in 
the  normal  spirals  the  spiral  pattern  may  end  at 
an  inner  ring  or  continue  all  the  way  to  the  cen¬ 
tre.  The  position  of  a  galaxy  within  the  spiral  se¬ 
quence  is  determined  on  the  basis  of  three  criteria 
(which  are  not  always  in  agreement):  later  types 
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Fig.  19.5  Above :  The 
Small  Magellanic  Cloud 
(Hubble  type  Irr  I),  a  dwarf 
companion  of  the  Milky 
Way  (Royal  Observatory, 
Edinburgh).  Below.  The 
Sculptor  Galaxy,  a  dE 
dwarf  spheroidal.  (ESO) 


have  a  smaller  central  bulge,  more  narrow  spiral 
arms  and  a  more  open  spiral  pattern.  The  Milky 
Way  Galaxy  is  thought  to  be  of  type  SABbc  (in¬ 
termediate  between  Sb  and  Sc,  and  between  nor¬ 
mal  and  barred  spirals). 

The  classical  Hubble  sequence  is  essentially 
based  on  bright  galaxies;  faint  galaxies  have  been 
less  easy  to  fit  into  it  (Fig.  19.5).  For  example, 
the  irregular  galaxies  of  the  original  Hubble  se¬ 
quence  can  be  divided  into  the  classes  Irr  I  and 


Irr  II.  The  Irr  I  galaxies  form  a  continuation  of 
the  Hubble  sequence  towards  later  types  beyond 
the  Sc  galaxies.  They  are  rich  in  gas  and  contain 
many  young  stars.  Type  Irr  II  are  somewhat  irreg¬ 
ular  small  ellipticals  containing  only  few  young 
stars  but  a  lot  of  interstellar  dust. 

The  smallest  irregular  galaxies  are  classified 
as  dlrr  (dwarf  irregulars).  Another  large  group 
of  dwarf  galaxies  are  the  spherical  or  elliptic 
dwarf  galaxies  dSph  (dwarf  spheroidals)  and  dE 
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Fig.  19.6  M32  (type  E2), 
a  small  elliptical 
companion  of  the 
Andromeda  Galaxy. 
(NOAO/Kitt  Peak 
National  Observatory) 


(dwarf  ellipsoids).  They  are  much  smaller  than 
ordinary  elliptic  galaxies  or  the  stellar  density  is 
much  lower.  Recently  new  kinds  of  even  smaller 
galaxies  have  been  found.  The  include  ultrafaint 
dwarfs  (uFd),  blue  compact  dwarfs  (BCD)  and 
ultracompact  dwarfs  (UCD)  (Fig.  19.7). 

In  addition  to  the  precise  morphological  clas¬ 
sification  a  simpler  classification  based  on  colours 
was  introduced  at  the  beginning  of  the  2000’s. 
The  idea  to  classify  fast  the  millions  of  new 
galaxies  found  by  modern  digital  survey  pro¬ 
grams  as  well  as  the  very  faint  galaxies  whose 
shape  is  difficult  to  distinguish.  When  galaxies 
are  plotted  like  stars  in  a  colour-magnitude  dia¬ 
gram  they  are  seen  to  form  two  distinct  regions: 
red  galaxies  (red  sequence)  and  blue  galaxies 
(blue  sequence)  (Fig.  19.8).  The  red  galaxies  are 
mainly  ellipticals  with  old  stars  and  blue  galaxies 
mainly  spirals  with  younger  stars. 
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Fig.  19.7  The  ultracompact  dwarf  M60-UCD1  is  close  to 
the  galaxy  M60.  The  mass  of  the  dwarf  galaxy  is  200  x  106 
solar  masses  but  the  radius  is  only  100  lightyears.  (Photo 
NASA,  ESA,  CXC,  J.  Strader  (Michigan  State  Univer¬ 
sity)) 


Distances  In  order  to  determine  the  absolute  lu¬ 
minosities  and  linear  dimensions  of  galaxies  one 
needs  to  know  their  distances.  Distances  are  also 
needed  in  order  to  estimate  the  masses  of  galax¬ 
ies,  because  these  estimates  depend  on  the  abso¬ 
lute  linear  size.  Distances  within  the  Local  Group 


can  be  measured  by  the  same  methods  as  in¬ 
side  the  Milky  Way,  most  importantly  by  means 
of  variable  stars.  On  the  very  large  scale  (be¬ 
yond  50  Mpc),  the  distances  can  be  deduced  on 
the  basis  of  the  expansion  of  the  Universe  (see 
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Fig.  19.8  A  colour-magnitude  graph  contains  144,000 
galaxies  from  the  Sloan  digital  sky  survey  (SDSS).  Bright¬ 
ness  increases  from  left  to  right.  Galaxies  are  clearly  di¬ 
vided  into  two  regions,  red  and  blue.  (Blanton  et  al.,  2003, 
ApJ  592,819) 

Sect.  20.1).  In  order  to  connect  these  two  re¬ 
gions  one  needs  methods  of  distance  determina¬ 
tion  based  on  the  properties  of  individual  galax¬ 
ies. 

To  some  extent  local  distances  can  be  deter¬ 
mined  using  structural  components  of  galaxies, 
such  as  the  sizes  of  HII  regions  or  the  mag¬ 
nitudes  of  globular  clusters.  However,  to  mea¬ 
sure  distances  of  tens  of  megaparsecs,  one  needs 
a  distance-independent  method  to  determine  the 
absolute  luminosities  of  entire  galaxies.  Several 
such  methods  have  been  proposed.  For  example, 
a  luminosity  classification  has  been  introduced 
for  late  spiral  types  by  Sidney  van  den  Bergh.  This 
is  based  on  a  correlation  between  the  luminosity 
of  a  galaxy  and  the  prominence  of  its  spiral  pat¬ 
tern.  Nowadays  this  method  is  not  considered  ac¬ 
curate  enough. 

Other  distance  indicators  are  obtained  if  there 
is  some  intrinsic  property  of  the  galaxy,  which  is 
correlated  with  its  total  luminosity,  and  which  can 
be  measured  independently  of  the  distance.  Such 
properties  are  the  colour,  the  surface  brightness 
and  the  internal  velocities  in  galaxies.  All  of  these 
have  been  used  to  measure  distances  to  both  spi¬ 
ral  and  elliptical  galaxies. 


For  example,  the  absolute  luminosity  of  a  gala¬ 
xy  should  depend  on  its  mass.  The  mass,  in  turn, 
will  be  reflected  in  the  velocities  of  stars  and  gas 
in  the  galaxy.  Accordingly  there  is  a  relationship 
between  the  absolute  luminosity  and  the  velocity 
dispersion  (in  ellipticals)  and  the  rotational  ve¬ 
locity  (in  spirals).  Since  rotational  velocities  can 
be  measured  very  accurately  from  the  width  of 
the  hydrogen  21 -cm  line,  the  latter  relationship 
(known  as  the  Tully-Fisher  relation)  is  perhaps 
the  best  distance  indicator  currently  available. 

The  luminosity  of  the  brightest  galaxies  in 
clusters  has  been  found  to  be  reasonably  constant. 
This  fact  can  be  used  to  measure  even  larger  dis¬ 
tances,  providing  a  method  which  is  important  in 
cosmology. 

Luminosities  The  definition  of  the  total  lumi¬ 
nosity  of  a  galaxy  is  to  some  extent  arbitrary, 
since  galaxies  do  not  have  a  sharp  outer  edge.  The 
usual  convention  is  to  measure  the  luminosity  of 
a  galaxy  out  to  a  given  value  of  the  surface  bright¬ 
ness,  e.g.  to  26.5  mag/sq.arcsec.  For  a  given  Hub¬ 
ble  type,  the  total  luminosity  L  may  vary  widely. 

As  in  the  case  of  stars,  the  distribution  of 
galaxy  luminosities  is  described  by  the  luminos¬ 
ity  function  <P(L)  (Fig.  19.9).  This  is  defined  so 
that  the  space  density  of  galaxies  with  luminosi¬ 
ties  between  L  and  L  +  dL  is  @(L)dL.  It  can 
be  determined  from  the  observed  magnitudes  of 
galaxies,  once  their  distances  have  been  estimated 
in  some  way.  In  practice,  one  assumes  some  suit¬ 
able  functional  form  for  @(L ),  which  is  then  fit¬ 
ted  to  the  observations.  One  common  form  is 
Schechter’s  luminosity  function, 

0(L)dL  =  0*(Aye-^*d(A).  (19.2) 

The  values  of  the  parameters  <£*,  L*,  a  are  obser- 
vationally  determined  for  different  types  of  ob¬ 
jects;  in  general,  they  will  be  functions  of  posi¬ 
tion. 

The  shape  of  the  luminosity  function  is  de¬ 
scribed  by  the  parameters  a  and  L*.  The  relative 
number  of  faint  galaxies  is  described  by  a .  Since 
its  observed  value  is  about  —1.1,  the  density  of 
galaxies  grows  monotonically  as  one  goes  to¬ 
wards  fainter  luminosities.  The  luminosity  func¬ 
tion  falls  off  steeply  above  the  luminosity  L*, 
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Fig.  19.9  Compound  luminosity  function  of  thirteen 
clusters  of  galaxies.  The  open  symbols  have  been  ob¬ 
tained  by  omitting  the  cD  galaxies.  The  distribution  is  then 
well  described  by  (19.2).  The  cD  galaxies  (filled  symbols ) 
cause  a  deviation  at  the  bright  end.  (Schechter,  P.  (1976): 
Astrophys.  J.  203,  297) 

which  therefore  represents  a  characteristic  lu¬ 
minosity  of  bright  galaxies.  The  observed  L* 
corresponds  to  an  absolute  magnitude  M*  = 
—21.0  mag.  The  corresponding  magnitude  for  the 
Milky  Way  Galaxy  is  probably  —20.2  mag.  The 
cD  giant  galaxies  do  not  obey  this  brightness  dis¬ 
tribution;  their  magnitudes  may  be  —24  mag  and 
even  brighter. 

The  parameter  0*  is  proportional  to  the  space 
density  of  galaxies  and  is  therefore  a  strong  func¬ 
tion  of  position.  Since  the  total  number  density 
of  galaxies  predicted  by  relation  (19.2)  is  infi¬ 
nite,  we  define  n *  =  density  of  galaxies  with 
luminosity  >  L*.  The  observed  average  value 
of  ft*  over  a  large  volume  of  space  is  n*  = 
3.5  x  10-3  Mpc-3.  The  mean  separation  between 
galaxies  corresponding  to  this  density  is  4  Mpc. 
Since  most  galaxies  are  fainter  than  L*,  and 
since,  in  addition,  they  often  belong  to  groups, 
we  see  that  the  distances  between  normal  galax¬ 
ies  are  generally  not  much  larger  than  their  diam¬ 
eters. 


Masses  The  distribution  of  mass  in  galaxies  is 
a  crucial  quantity,  both  for  cosmology  and  for 
theories  of  the  origin  and  evolution  of  galax¬ 
ies.  Observationally  it  is  determined  from  the 
velocities  of  the  stars  and  interstellar  gas.  To¬ 
tal  masses  of  galaxies  can  also  be  derived  from 
their  motions  in  clusters  of  galaxies.  The  results 
are  usually  given  in  terms  of  the  corresponding 
mass-luminosity  ratio  M/L,  using  the  solar  mass 
and  luminosity  as  units.  The  value  measured  in 
the  solar  neighbourhood  of  the  Milky  Way  is 
M/L  =  3.  If  M/L  were  constant,  the  mass  distri¬ 
bution  could  be  determined  from  the  observed  lu¬ 
minosity  distribution  by  multiplying  with  M/L. 
Unfortunately,  there  is  no  universally  valid  value 
for  the  ratio  M/L. 

The  masses  of  elliptical  galaxies  may  be  ob¬ 
tained  from  the  stellar  velocity  dispersion  given 
by  the  broadening  of  spectral  lines.  The  method 
is  based  on  the  virial  theorem  (see  Sect.  6.10), 
which  says  that  in  a  system  in  equilibrium,  the 
kinetic  energy  T  and  the  potential  energy  U  are 
related  according  to  the  equation 

2T  +  U  =  0.  (19.3) 

Since  ellipticals  rotate  slowly,  the  kinetic  energy 
of  the  stars  may  be  written 

T  =  Mv2/2,  (19.4) 

where  M  is  the  total  mass  of  the  galaxy  and  v  the 
velocity  width  of  the  spectral  lines.  The  potential 
energy  is 

U  =  -GM2/2R,  (19.5) 

where  R  is  a  suitable  average  radius  of  the  galaxy 
that  can  be  estimated  or  calculated  from  the  light 
distribution.  Introducing  (19.4)  and  (19.5)  into 
(19.3)  we  obtain: 

M  =  2v2R/G.  (19.6) 

From  this  formula  the  mass  of  an  elliptical 
galaxy  can  be  calculated  when  v2  and  R  are 
known.  Some  observations  of  velocities  in  ellipti- 
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Fig.  19.10  Velocity  of  rotation  V (R)  [kms-1]  and  ve-  the  former  is  not.  (Davies,  R.L.  (1981):  Mon.  Not.  R.  As- 
locity  dispersion  <j(R )  [kms-1]  as  functions  of  radius  tron.  Soc.  194,  879) 

[kpc]  for  types  E2  and  E5.  The  latter  galaxy  is  rotating, 


Fig.  19.11  Rotation 
curves  for  seven  spiral 
galaxies.  (Rubin,  V.C., 
Ford,  W.K.,  Thonnard,  N. 
(1978):  Astrophys.  J. 
(Lett.)  225,  L107) 


cal  galaxies  are  given  in  Fig.  19.10.  These  will  be 
further  discussed  in  Sect.  19.4.  The  value  of  M/L 
derived  from  such  observations  is  about  10  within 
a  radius  of  10  kpc.  The  mass  of  a  bright  elliptical 
might  thus  be  up  to  10 13  AT©. 

The  masses  of  spiral  galaxies  are  obtained 
from  their  rotation  curve  v(R ),  which  gives  the 
variation  of  their  rotational  velocity  with  radius. 
Assuming  that  most  of  the  mass  is  in  the  almost 
spherical  bulge,  the  mass  within  radius  R ,  M(R ), 
can  be  estimated  from  Kepler’s  third  law: 

M(R)  =  Rv(R)2/G.  (19.7) 

Some  typical  rotation  curves  are  shown  in 
Fig.  19.11.  In  the  outer  parts  of  many  spirals, 


v(R)  does  not  depend  on  R.  This  means  that 
M(R)  is  directly  proportional  to  the  radius — the 
further  out  one  goes,  the  larger  the  interior  mass 
is.  Since  the  outer  parts  of  spirals  are  very  faint, 
at  large  radii  the  value  of  M/L  is  directly  pro¬ 
portional  to  the  radius.  For  the  disk,  one  finds 
that  M/L  =  8  for  early  and  M/L  =  4  for  late 
spiral  types.  The  largest  measured  total  mass  is 
2  x  10 12  Mq. 

In  order  to  measure  the  mass  at  even  larger 
radii  where  no  emission  can  be  detected,  motions 
in  systems  of  galaxies  have  to  be  used.  One  pos¬ 
sibility  is  to  use  pairs  of  galaxies.  In  principle,  the 
method  is  the  same  as  for  binary  stars.  However, 
because  the  orbital  period  of  a  binary  galaxy  is 
about  109  years,  only  statistical  information  can 


396 


19  Galaxies 


Fig.  19.12  Stars  and  hot  gas  in  the  Hydra  galaxy  clus¬ 
ter.  Left :  Optical  image  of  the  centre  of  the  cluster.  Right : 
An  X-ray  image  of  the  same  area  in  the  same  scale  by 

be  obtained  in  this  way.  The  results  are  still  uncer¬ 
tain,  but  seem  to  indicate  values  of  M/L  =  20-30 
at  pair  separations  of  about  50  kpc. 

A  fourth  method  to  determine  galaxy  masses 
is  to  apply  the  virial  theorem  to  clusters  of  galax¬ 
ies,  assuming  that  these  are  in  equilibrium.  The 
kinetic  energy  T  in  (19.4)  can  then  be  calcu¬ 
lated  from  the  observed  redshifts  and  the  poten¬ 
tial  energy  U ,  from  the  separations  between  clus¬ 
ter  galaxies.  If  it  is  assumed  that  the  masses  of 
galaxies  are  proportional  to  their  luminosities,  it 
is  found  that  M/L  is  about  200  within  1  Mpc  of 
the  cluster  centre.  However,  there  is  a  large  vari¬ 
ation  from  cluster  to  cluster. 

There  are  also  two  newer  and  more  accu¬ 
rate  methods  for  mass  determination.  The  fifth 
method  utilises  the  X-ray  radiation  coming  from 
the  hot  gas  around  galaxies  or  galaxy  clusters 
(Fig.  19.12).  The  temperature  of  the  gas  around 
galaxy  clusters  is  as  high  as  107,  even  E8  K.  Since 
cloud  seems  to  be  more  or  less  stable  there  must 
be  a  massive  gravitational  well  preventing  the  gas 
from  escaping.  The  dependence  between  the  total 
mass  of  the  galaxy  cluster  and  the  gas  tempera¬ 
ture  is  almost  linear:  the  hotter  the  gas  the  more 
mass  there  is.  The  total  mass  of  the  richest  galaxy 


the  Chandra  satellite.  The  temperature  of  the  gas  is  about 
4  x  107  K.  (Optical  image  UK  Schmidt/DSSl/ESO,  SRC, 
X-ray  image  NASA/CXC/SAO) 

clusters  is  over  1015  solar  masses,  and  only  about 
1  %  of  this  is  in  the  form  of  visible  matter. 

Gravitational  lenses  (Sect.  19.7)  are  the  most 
accurate  method.  A  light  ray  passing  by  a  galaxy 
is  bent  and  the  amount  of  bending  depends  on  the 
mass  of  the  galaxy.  The  method  is  generally  used 
to  determine  masses  of  galaxy  clusters  but  it  can 
also  be  used  for  individual  galaxies  if  the  lensing 
galaxy  forms  several  images  of  the  background 
object  or  clearly  distorts  its  shape.  Gravitational 
lenses  have  finally  confirmed  that  only  about  one 
sixth  of  the  mass  of  galaxies  is  ordinary  matter 
(like  stars  or  gas)  and  the  rest  dark  matter. 

Dark  Matter  the  most  precise  value  for  the 
dark  matter  has  been  obtained  by  the  WM  AP  and 
Planck  satellites  which  have  studied  the  details 
of  the  cosmic  background  radiation.  As  it  will  be 
mentioned  in  Chap.  20,  about  31  %  of  the  total 
content  (mass  +  energy)  of  the  universe  is  mat¬ 
ter.  The  final  results  of  the  Planck  satellite  pub¬ 
lished  in  2015  show  that  the  contribution  of  ordi¬ 
nary  matter  is  5  %  and  of  cold  dark  matter  26  %. 

The  distribution  of  the  dark  matter  can  nowa¬ 
days  be  determined  by  weak  and  strong  gravita¬ 
tional  lenses.  The  result  is  that  the  dark  matter 
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Fig.  19.13  Distribution  of  dark  matter  around  the  galaxy 
cluster  MACS  J07 17.5+3745.  The  background  image  was 
taken  by  the  Hubble  telescope.  Superimposed  is  the  distri¬ 


bution  of  dark  matter  calculated  using  strong  and  weak 
gravitational  lenses.  (NASA,  ESA,  Harald  Ebeling  (Uni¬ 
versity  of  Hawaii  at  Manoa)  &  Jean-Paul  Kneib  (LAM)) 


appears  as  wide  halos  around  galaxies  and  galaxy 
clusters  (Fig.  19.13). 

The  nature  of  the  dark  matter  is  not  yet  known. 
It  interacts  with  baryons  almost  only  by  its  grav¬ 
ity.  It  is  known  to  be  “cold”,  which  means  that 
the  velocities  of  the  particles  are  low  unlike  with 
neutrinos  that  always  travel  almost  at  the  speed  of 
light. 

It  is  assumed  that  the  dark  matter  consists 
of  unknown  elementary  particles.  Possible  candi¬ 
dates  are  light  axions  or  heavy  particles  predicted 
by  supersymmetry  theories  (WIMP,  weakly  in¬ 
teracting  massive  particle)  like  neutralinos.  Their 
mass  might  be  of  the  order  100  GeV-1  TeV,  or 
hundreds  of  proton  masses.  Underground  scintil¬ 
lation  detectors  and  the  LHC  accelerator  at  the 
CERN  are  used  to  search  for  particles  of  the  dark 
matter. 


1 9.3  Galactic  Structures 

Ellipticals  and  Bulges  In  all  galaxies  the  oldest 
stars  have  a  more  or  less  round  distribution.  In  the 
Milky  Way  this  component  is  represented  by  the 
population  II  stars.  Its  inner  parts  are  called  the 


bulge,  and  its  outer  parts  are  often  referred  to  as 
the  halo.  There  does  not  appear  to  be  any  phys¬ 
ically  significant  difference  between  the  bulge 
and  the  halo.  The  population  of  old  stars  can  be 
best  studied  in  ellipticals,  which  only  contain  this 
component.  The  bulges  of  spiral  and  SO  galaxies 
are  very  similar  to  ellipticals  of  the  same  size. 

The  surface  brightness  distribution  in  elliptical 
galaxies  essentially  depends  only  on  the  distance 
from  the  centre  and  the  orientation  of  the  major 
and  minor  axis.  If  r  is  the  radius  along  the  major 
axis,  the  surface  brightness  7(r)  is  well  described 
by  de  Vaucouleurs  ’  law : 


(19.8) 


The  constants  in  (19.8)  have  been  chosen  so 
that  half  of  the  total  light  of  the  galaxy  is  ra¬ 
diated  from  within  the  radius  re  and  the  sur¬ 
face  brightness  at  that  radius  is  7e.  The  parame¬ 
ters  re  and  7e  are  determined  by  fitting  (19.8)  to 
observed  brightness  profiles.  Typical  values  for 
elliptical,  normal  spiral  and  SO  galaxies  are  in 
the  ranges  re  =  1-10  kpc  and  7e  corresponds  to 
20-23  magnitudes  per  square  arc  second. 
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Fig.  19.1 4  The  distribution  of  surface  brightness  in  E  and 
cD  galaxies.  Ordinate :  surface  magnitude,  mag/sq.arcsec; 
abscissa :  (radius  [kpc])1/4.  Equation  (18.8)  corresponds 
to  a  straight  line  in  this  representation.  It  fits  well  with 
an  E  galaxy,  but  for  type  cD  the  luminosity  falls  off  more 


slowly  in  the  outer  regions.  Comparison  with  Fig.  19.15 
shows  that  the  brightness  distribution  in  SO  galaxies  be¬ 
haves  in  a  similar  fashion.  cD  galaxies  have  often  been 
erroneously  classified  as  SO.  (Thuan,  T.X.,  Romanishin, 
W.  (1981):  Astrophys.  J.  248,  439) 


Fig.  19.15  The 

distribution  of  surface 
brightness  in  types  SO 
and  Sb.  Ordinate : 
mag/sq.arc  sec;  abscissa : 
radius  [arc  sec] .  The 
observed  surface  brightness 
has  been  decomposed  into 
a  sum  of  bulge  and  disc 
contributions.  Note  the 
larger  disc  component  in 
type  Sb.  (Boroson,  T. 
(1981):  Astrophys.  J. 

Suppl.  46,  177) 


Although  de  Vaucouleurs’  law  is  a  purely 
empirical  relation,  it  still  gives  a  remarkably 
good  representation  of  the  observed  light  distri¬ 
bution.  However,  in  the  outer  regions  of  ellipti¬ 
cal  galaxies,  departures  may  often  occur:  the  sur¬ 
face  brightness  of  dwarf  spheroidals  often  falls 
off  more  rapidly  than  (19.8),  perhaps  because 
the  outer  parts  of  these  galaxies  have  been  tom 
off  in  tidal  encounters  with  other  galaxies.  In 
the  giant  galaxies  of  type  cD,  the  surface  bright¬ 
ness  falls  off  more  slowly  (see  Fig.  19.14).  Such 
galaxies  are  usually  at  the  centre  of  dense  galaxy 


clusters,  and  during  billions  of  years  they  have 
absorbed  several  smaller  galaxies.  The  stars  of 
the  disrupted  galaxies  usually  remain  orbiting 
the  giant  galaxy  in  its  outskirts  increasing  the 
brightness  of  the  outer  parts.  cD  galaxies  have 
been  called  the  cannibals  of  the  world  of  galax¬ 
ies. 

Although  the  isophotes  in  elliptical  galaxies 
are  ellipses  to  a  good  approximation,  their  el- 
lip  ticities  and  the  orientation  of  their  major  axes 
may  vary  as  a  function  of  radius.  Different  galax¬ 
ies  differ  widely  in  this  respect,  indicating  that 
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the  structure  of  ellipticals  is  not  as  simple  as  it 
might  appear.  In  particular,  the  fact  that  the  direc¬ 
tion  of  the  major  axis  sometimes  changes  within 
a  galaxy  suggests  that  some  ellipticals  may  not 
be  axially  symmetric  in  shape.  A  probable  reason 
is  that  many  galaxies  of  different  shapes  rotating 
in  different  directions  have  assimilated  to  the  cur¬ 
rent  galaxies. 

From  the  distribution  of  surface  brightness, 
the  three-dimensional  structure  of  a  galaxy  may 
be  inferred  as  explained  in  Box  19.1.  The  rela¬ 
tion  (19.8)  gives  a  brightness  profile  which  is  very 
strongly  peaked  towards  the  centre.  The  real  dis¬ 
tribution  of  axial  ratios  for  ellipticals  can  be  sta¬ 
tistically  inferred  from  the  observed  one.  On  the 
(questionable)  assumption  that  they  are  rotation- 
ally  symmetric,  one  obtains  a  broad  distribution 
with  a  maximum  corresponding  to  types  E3-E4. 
If  the  true  shape  is  not  axisymmetric,  it  cannot 
even  statistically  be  uniquely  determined  from 
the  observations. 

Disks  A  bright,  massive  stellar  disk  is  char¬ 
acteristic  for  SO  and  spiral  galaxies,  which  are 
therefore  called  disk  galaxies.  There  are  indica¬ 
tions  that  in  some  ellipticals  there  is  also  a  faint 
disk  hidden  behind  the  bright  bulge.  In  the  Milky 
Way  the  disk  is  formed  by  population  I  stars. 

The  distribution  of  surface  brightness  in  the 
disk  is  described  by  the  expression 

I  (r)  =  Iq  e~r/r°.  (19.9) 

Figure  19.11  shows  how  the  observed  radial 
brightness  distribution  can  be  decomposed  into 
a  sum  of  two  components:  a  centrally  domi¬ 
nant  bulge  and  a  disk  contributing  significantly  at 
larger  radii.  The  central  surface  brightness  /o  typ¬ 
ically  corresponds  to  21-22  mag./sq.arcsec,  and 
the  radial  scale  length  ro  =  1-5  kpc.  In  Sc  galax¬ 
ies  the  total  brightness  of  the  bulge  is  generally 
slightly  smaller  than  that  of  the  disk,  whereas  in 
earlier  Hubble  types  the  bulge  has  a  larger  total 
brightness.  The  thickness  of  the  disk,  measured 
in  galaxies  that  are  seen  edge-on,  may  typically 
be  about  1.2  kpc.  Sometimes  the  disk  has  a  sharp 
outer  edge  at  about  4  ro. 


The  Interstellar  Medium  Elliptical  and  SO  ga¬ 
laxies  contain  very  little  interstellar  gas.  How¬ 
ever,  in  some  ellipticals  neutral  hydrogen  amount¬ 
ing  to  about  0.1  %  of  the  total  mass  has  been  de¬ 
tected,  and  in  the  same  galaxies  there  are  also 
often  signs  of  recent  star  formation.  In  some 
SO  galaxies  much  larger  gas  masses  have  been 
observed,  but  the  relative  amount  of  gas  is  very 
variable  from  one  galaxy  to  another.  The  lack  of 
gas  in  these  galaxies  is  rather  unexpected,  since 
during  their  evolution  the  stars  release  much  more 
gas  than  is  observed. 

The  relative  amount  of  neutral  hydrogen  in 
spiral  galaxies  is  correlated  with  their  Hubble 
type.  Thus  Sa  spirals  contain  about  2  %,  Sc  spi¬ 
rals  10  %,  and  Irr  I  galaxies  up  to  30  %  or  more. 

The  distribution  of  neutral  atomic  hydrogen 
has  been  mapped  in  detail  in  nearby  galaxies  by 
means  of  radio  observations.  In  the  inner  parts  of 
galaxies  the  gas  forms  a  thin  disk  with  a  fairly 
constant  thickness  of  about  200  pc,  sometimes 
with  a  central  hole  of  a  few  kpc  diameter.  The  gas 
disk  may  continue  far  outside  the  optical  disk,  be¬ 
coming  thicker  and  often  warped  from  the  central 
disk  plane. 

Most  of  the  interstellar  gas  in  spiral  galaxies  is 
in  the  form  of  molecular  hydrogen.  The  hydrogen 
molecule  cannot  be  observed  directly,  but  the  dis¬ 
tribution  of  carbon  monoxide  has  been  mapped 
by  radio  observations.  The  distribution  of  molec¬ 
ular  hydrogen  can  then  be  derived  by  assuming 
that  the  ratio  between  the  densities  of  CO  and  H2 
is  everywhere  the  same,  although  this  may  not 
always  be  true.  It  is  found  that  the  distribution 
obeys  a  similar  exponential  law  as  the  young  stars 
and  H II  regions,  although  in  some  galaxies  (such 
as  the  Milky  Way)  there  is  a  central  density  min¬ 
imum.  The  surface  density  of  molecular  gas  may 
be  five  times  larger  than  that  of  H  I,  but  because 
of  its  strong  central  concentration  its  total  mass  is 
only  perhaps  two  times  larger. 

The  distribution  of  cosmic  rays  and  magnetic 
fields  in  galaxies  can  be  mapped  by  means  of 
radio  observations  of  the  synchrotron  radiation 
from  relativistic  electrons.  The  strength  of  the 
magnetic  field  deduced  in  this  way  is  typically 
0.5-1  nT.  The  observed  emission  is  polarised, 
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Fig.  19.16  A  spiral  galaxy  NGC  5907  seen  sideways.  The  galaxy  is  surrounded  by  star  streams  that  are  remnants  of 
small  dwarf  galaxies  merged  to  the  big  galaxy.  (Photo  Gabany,  Martinez-Delgado  et  al.,  2010,  AJ  140,  962) 


showing  that  the  magnetic  field  is  fairly  well- 
ordered  on  large  scales.  Since  the  plane  of  po¬ 
larisation  is  perpendicular  to  the  magnetic  field, 
the  large-scale  structure  of  the  magnetic  field  can 
be  mapped.  However,  the  plane  of  polarisation  is 
changed  by  Faraday  rotation,  and  for  this  reason 
observations  at  several  wavelengths  are  needed  in 
order  to  determine  the  direction  of  the  field.  The 
results  show  that  the  field  is  generally  strongest 
in  the  plane  of  the  disk,  and  is  directed  along 
the  spiral  arms  in  the  plane.  The  field  is  thought 
to  have  been  produced  by  the  combined  action 
of  rising  elements  of  gas,  perhaps  produced  by 
supernova  explosions,  and  the  differential  rota¬ 
tion,  in  principle  in  the  same  way  as  the  pro¬ 
duction  of  solar  magnetic  fields  was  explained  in 
Chap.  13. 

The  Outer  Parts  of  Galaxies  Even  the  outer¬ 
most  parts  outside  the  visible  disk  are  not  empty. 


Recently  they  have  been  seen  to  contain  several 
different  kinds  of  matter.  The  most  abundant  is 
the  dark  matter  extending  around  the  galaxy  as  a 
sphere  or  slightly  flattened  ellipsoid  to  a  distance 
of  tens  of  times  further  than  the  visible  matter. 

Hot  gas  is  the  second  largest  component.  Its 
high  temperature  and  great  abundance  were  sur¬ 
prising.  For  example,  the  Milky  Way  is  sur¬ 
rounded  by  a  region  of  hot  gas  extending  as  a 
spherical  cloud  as  far  as  to  150  kiloparsecs.  The 
temperature  of  the  gas  is  1-2.5  x  106  K  and  the 
total  mass  about  the  same  order  of  magnitude  as 
the  total  mass  of  the  stars  in  the  Milky  Way. 

The  third  component  comprises  of  separate 
gas  clouds  having  greater  velocities  than  other 
matter  of  the  galaxy.  Some  of  the  clouds  come 
from  the  centre  of  the  Milky  Way,  some  from 
other  galaxies. 

Yet  another  component  consists  of  star  clusters 
and  star  streams.  In  addition  to  globular  clusters 
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galaxies  are  accompanied  by  star  concentrations 
that  are  remnants  of  dwarf  galaxies  assimilated 
to  the  galaxy.  For  example,  the  globular  cluster 
M54  is  not  an  original  Milky  Way  cluster  but  a 
remnant  of  the  Sagittarius  dwarf  ellipsoid  moving 
through  the  central  part  of  the  Milky  Way.  Dis¬ 
rupting  dwarf  galaxies  are  sometimes  seen  as  star 
streams  around  the  galaxy  (Fig.  19.16). 


Box  19.1  (Three-Dimensional  Shape  of  Galax¬ 
ies)  Equations  (19.8)  and  (19.9)  describe  the 
distribution  of  galactic  light  projected  on  the 
plane  of  the  sky.  The  actual  three-dimensional 
luminosity  distribution  in  a  galaxy  is  obtained 
by  inverting  the  projection.  This  is  easiest  for 
spherical  galaxies. 


Let  us  suppose  that  a  spherical  galaxy  has 
the  projected  luminosity  distribution  I  (r)  (e.g. 
as  in  (19.8)).  With  coordinates  chosen  ac¬ 
cording  to  the  figure,  I  (r)  is  given  in  terms 
of  the  three-dimensional  luminosity  distribu¬ 
tion  p(R)  by 


p(R)dz. 


Since  z2  =  R2  —  r2,  a  change  of  the  variable  of 
integration  yields 


Hr)  =2 


p(R)RdR 
Hr2  -  r2 


This  is  known  as  an  Abel  integral  equation 
for  p(R),  and  has  the  solution 


P(R)  = 


1  d  r°°  /(r)rdr 
TtRdRjR  Hr2  -  R2 

1  r°°  (d//dr)dr 
n  Jr  Hr2  R2  ’ 


Introducing  the  observed  /(r)  into  this  expres¬ 
sion,  one  obtains  the  actual  luminosity  dis¬ 
tribution  p(R).  In  the  figure  the  solid  curve 
shows  the  three-dimensional  luminosity  distri¬ 
bution  obtained  from  the  Vaucouleurs’  law  (the 
dashed  line). 


If  the  galaxy  is  not  spherical,  its  three- 
dimensional  shape  can  only  be  determined  if 
its  inclination  with  respect  to  the  line  of  sight 
is  known.  Since  galactic  disks  are  thin  and  of 
constant  thickness,  the  inclination  i  of  a  disk 
galaxy  is  obtained  directly  from  the  axis  ratio 
of  its  projected  image:  sin  i  =  b/a. 

When  the  inclination  is  known,  the  real  axis 
ratio  of  the  bulge  qo  can  be  determined  from  the 
projected  value  q.  For  a  rotationally  symmetric 
bulge  the  relation  between  q  and  qo  is 


1  -% 

The  flattenings  of  disk  galaxy  bulges  obtained 
from  this  relation  lie  in  the  range  qo  =  0.3-0. 5. 
Since  the  inclinations  of  ellipticals  are  gener¬ 
ally  unknown,  only  the  statistical  distribution 
of  q  can  be  determined  from  that  of  qo. 


1 9.4  Dynamics  of  Galaxies 

We  have  seen  how  the  masses  of  galaxies  can  be 
derived  from  observed  velocities  of  stars  and  gas. 
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The  same  observations  can  be  used  to  study  the 
internal  distribution  of  mass  in  more  detail. 

Slowly  Rotating  Systems  The  dynamics  of  el¬ 
liptical  galaxies  and  disk  galaxy  bulges  are  stud¬ 
ied  by  means  of  the  Doppler  shifts  and  broad- 
enings  of  stellar  absorption  lines.  Since  a  given 
absorption  line  is  the  sum  of  contributions  from 
many  individual  stars,  its  Doppler  shift  gives  their 
mean  velocity,  while  its  broadening  is  increased 
by  an  amount  depending  on  the  dispersion  of  stel¬ 
lar  velocities  around  the  mean.  By  observing  how 
the  wavelengths  and  widths  of  spectral  lines  be¬ 
have  as  functions  of  the  radius,  one  can  get  some 
insight  into  the  distribution  of  mass  in  the  galaxy. 

Examples  of  the  observed  radial  dependence 
of  the  rotational  velocity  and  velocity  disper¬ 
sion  derived  for  some  ellipticals  were  given  in 
Fig.  19.10.  The  observed  rotational  velocities  are 
often  small  (<100  kms-1),  while  the  velocity 
dispersion  may  typically  be  about  200  km  s-1 .  If 
elliptical  galaxies  were  in  fact  ellipsoids  of  revo¬ 
lution,  there  should  be  a  statistical  relation  (when 
projection  effects  have  been  taken  into  account) 
between  flatness,  rotational  velocity  and  velocity 
dispersion.  Such  a  relationship  has  been  observed 
for  fainter  ellipticals  and  for  disk  galaxy  bulges. 
However,  some  of  the  brightest  ellipticals  rotate 
very  slowly.  Therefore  their  flattening  cannot  be 
due  to  rotation. 

The  radial  dependence  of  the  velocity  disper¬ 
sion  gives  information  on  the  distribution  of  mass 
within  the  galaxy.  Since  it  also  depends  on  how 
the  shapes  of  stellar  orbits  in  the  galaxy  are  dis¬ 
tributed,  its  interpretation  requires  detailed  dy¬ 
namical  models. 

Some  ellipticals  have  different  velocity  sys¬ 
tems.  An  extreme  case  is  NGC  4550  classified  as 
E7/S0.  Half  of  its  stars  orbit  the  centre  in  one  di¬ 
rection  and  the  rest  in  the  opposite  direction.  This 
indicates  that  the  galaxy  is  a  merger  of  two  spiral 
galaxies  that  originally  rotated  in  opposite  direc¬ 
tions.  Mutual  distances  of  stars  in  ellipticals  are 
so  large  that  collisions  do  not  occur  and  the  cur¬ 
rent  system  is  very  stable. 

Rotation  Curves  In  spiral  galaxies  the  distri¬ 
bution  of  mass  can  be  studied  directly  using  the 


observed  rotational  velocities  of  the  interstellar 
gas.  This  can  be  observed  either  at  optical  wave¬ 
lengths  from  the  emission  lines  of  ionised  gas 
in  H II  regions  or  at  radio  wavelengths  from  the 
hydrogen  21  cm  line.  Typical  galactic  rotation 
curves  were  shown  in  Fig.  19.11. 

The  qualitative  behaviour  of  the  rotation  curve 
in  all  spiral  galaxies  is  similar  to  the  rotation 
curve  of  the  Milky  Way:  there  is  a  central  por¬ 
tion,  where  the  rotational  velocity  is  directly  pro¬ 
portional  to  the  radius,  corresponding  to  rigid 
body  rotation.  At  a  few  kpc  radius  the  curve  turns 
over  and  becomes  flat,  i.e.  the  rotational  veloc¬ 
ity  does  not  depend  on  the  radius.  In  early  Hub¬ 
ble  types,  the  rotation  curve  rises  more  steeply 
near  the  centre  and  reaches  larger  velocities  in 
the  flat  region  (Sa  about  300  kms-1,  Sc  about 
200  kms-1).  A  higher  rotational  velocity  indi¬ 
cates  a  larger  mass  according  to  (19.7),  and  thus 
Sa  types  must  have  a  larger  mass  density  near 
the  centre.  This  is  not  unexpected,  since  a  more 
massive  bulge  is  one  of  the  defining  properties  of 
early  type  spirals. 

A  decrease  of  the  rotational  velocity  at  large 
radii  would  be  an  indication  that  most  of  the  mass 
is  inside  that  radius.  In  some  galaxies  such  a  de¬ 
crease  has  been  detected,  in  others  the  rotational 
velocity  remains  constant  as  far  out  as  the  obser¬ 
vations  can  reach. 

Spiral  Structure  Spiral  galaxies  (Fig.  19.17) 
are  relatively  bright  objects.  Some  have  a  well- 
defined,  large-scale  two-armed  spiral  pattern, 
whereas  in  others  the  spiral  structure  is  made  up 
of  a  large  number  of  short  filamentary  arms.  From 
galaxies  where  the  pattern  is  seen  in  front  of  the 
central  bulge,  it  has  been  deduced  that  the  sense 
of  winding  of  the  spiral  is  trailing  with  respect 
to  the  rotation  of  the  galaxy.  However,  the  rule  is 
not  absolute:  there  are  also  galaxies  with  leading 
arms. 

The  spiral  structure  is  most  clearly  seen  in  the 
interstellar  dust,  HII  regions,  and  the  OB  asso¬ 
ciations  formed  by  young  stars.  The  dust  often 
forms  thin  lanes  along  the  inner  edge  of  the  spiral 
arms,  with  star  forming  regions  on  their  outside. 
Enhanced  synchrotron  radio  emission  associated 
with  spiral  arms  has  also  been  detected. 
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Fig.  19.17  Above : 

A  spiral  galaxy  from 
above:  M51  (type  Sc).  The 
interacting  companion  is 
NGC5195  (type  Irr  II). 
(Lick  Observatory).  Below. 
A  spiral  galaxy  from  the 
side:  the  Sb  spiral 
NGC4565.  (NOAO/Kitt 
Peak  National 
Observatory) 


A  well  formed  wide  spiral  pattern  is  relatively 
common,  and  hence  it  must  be  a  long  lasting  phe¬ 
nomenon.  The  spiral  pattern  is  generally  thought 
to  be  a  wave  in  the  density  of  the  stellar  disk, 
as  discussed  in  Sect.  18.4.  As  the  interstellar 
gas  streams  through  the  density  wave  a  shock, 
marked  by  the  dust  lanes,  is  formed  as  the  in¬ 
terstellar  gas  is  compressed,  leading  to  the  col¬ 


lapse  of  molecular  clouds  and  the  formation  of 
stars.  The  density  wave  theory  predicts  charac¬ 
teristic  streaming  motions  within  the  arm,  which 
have  been  detected  in  some  galaxies  by  observa¬ 
tions  of  the  H I  21  cm  line. 

There  may  be  several  reasons  producing  spiral 
arms.  The  most  common  is  an  external  perturba¬ 
tion  when  another  galaxy  passes  by  at  a  close  dis- 
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tance  creating  a  wave  in  the  galaxy.  Other  sources 
of  perturbation  could  be  the  bar  of  the  galaxy  or 
the  gravitational  field  of  a  temporary  “traffic  jam” 
of  stars. 

A  single  close  encounter  or  a  traffic  jam  can 
explain  temporary  spiral  arms  but  not  more  per¬ 
manent  features.  The  differential  rotation  of  the 
galaxy  tends  to  wipe  out  the  arms  during  a  few 
revolutions  or  in  a  few  hundreds  of  millions  of 
years.  Some  renewal  mechanism  is  needed  to 
keep  the  arms  permanent. 

Standing  waves  can  explain  the  permanence. 
Birth  of  spiral  arms  and  their  motion  gener¬ 
ate  density  waves  travelling  through  the  whole 
galaxy.  The  waves  can  be  reflected  or  refracted 
by  the  central  bulge.  When  wave  fronts  moving 
in  different  directions  meet  they  can  either  damp 
or  amplify  each  others.  Spiral  arms  are  standing 
waves,  fronts  of  amplified  waves,  seen  as  concen¬ 
trates  of  stars  and  gas  clouds.  The  spiral  forms 
move  around  the  galaxy  at  constant  speed. 


1 9.5  Stellar  Ages  and  Element 
Abundances  in  Galaxies 

From  the  Milky  Way  we  know  that  stars  of  pop¬ 
ulations  I  and  II  are  different  not  only  in  respect 
to  their  spatial  distribution,  but  also  in  respect  to 
their  ages  and  heavy  element  abundances.  This 
fact  gives  important  evidence  about  the  formation 
of  the  Milky  Way,  and  it  is  therefore  of  interest  if 
a  similar  connection  can  be  found  in  other  galax¬ 
ies. 

The  indicators  of  composition  most  easily 
measured  are  the  variations  of  colour  indices  in¬ 
side  galaxies  and  between  different  galaxies.  Two 
regularities  have  been  discovered  in  these  varia¬ 
tions:  First,  according  to  the  colour-luminosity 
relation  for  elliptical  and  SO  galaxies,  brighter 
galaxies  are  redder.  Secondly,  there  is  a  colour- 
aperture  effect ,  so  that  the  central  parts  of  galax¬ 
ies  are  redder.  For  spirals  this  relationship  is  due 
to  the  presence  of  young,  massive  stars  in  the 
disk,  but  it  has  also  been  observed  for  elliptical 
and  SO  galaxies. 

Galactic  spectra  are  composed  of  the  spectra 
of  all  their  stars  added  together.  Thus  the  colours 
depend  both  on  the  ages  of  the  stars  (young  stars 


are  bluer)  and  on  the  heavy  element  abundance  Z 
(stars  with  larger  Z  are  redder).  The  interpreta¬ 
tion  of  the  observational  results  thus  has  to  be 
based  on  detailed  modelling  of  the  stellar  com¬ 
position  of  galaxies  or  population  synthesis. 

Stars  of  different  spectral  classes  contribute 
different  characteristic  absorption  features  to  the 
galaxy  spectrum.  By  observing  the  strength  of 
various  spectral  features,  one  can  find  out  about 
the  masses,  ages  and  chemical  composition  of 
the  stars  that  make  up  the  galaxy.  For  this  pur¬ 
pose,  a  large  number  of  characteristic  properties 
of  the  spectrum,  strengths  of  absorption  lines  and 
broad-band  colours  are  measured.  One  then  at¬ 
tempts  to  reproduce  these  data,  using  a  represen¬ 
tative  collection  of  stellar  spectra.  If  no  satisfac¬ 
tory  solution  can  be  found,  more  stars  have  to 
be  added  to  the  model.  The  final  result  is  a  pop¬ 
ulation  model,  giving  the  stellar  composition  of 
the  galaxy.  Combining  this  with  theoretical  stellar 
evolution  calculations,  the  evolution  of  the  light 
of  the  galaxy  can  also  be  computed. 

Population  synthesis  of  E  galaxies  show  that 
practically  all  their  stars  were  formed  simulta¬ 
neously  about  13-14  x  109  years  ago,  which  is 
close  to  the  age  of  the  universe  Most  of  their  light 
comes  from  red  giants,  whereas  most  of  their 
mass  resides  in  lower  main  sequence  stars  of  less 
than  one  solar  mass. 

Since  all  stars  have  roughly  the  same  age,  the 
colours  of  elliptical  galaxies  are  directly  related 
to  their  metallicities.  Thus  the  colour-luminosity 
relation  indicates  that  Z  in  giant  ellipticals  may 
be  double  that  in  the  solar  neighbourhood,  while 
it  may  be  smaller  by  a  factor  100  in  dwarfs.  Simi¬ 
larly,  the  radial  dependence  of  the  colours  can  be 
explained  if  the  value  of  Z  at  the  centre  is  an  or¬ 
der  of  magnitude  larger  than  it  is  at  larger  radii. 

The  stellar  composition  of  disk  galaxy  bulges 
is  generally  similar  to  that  of  ellipticals.  The  ele¬ 
ment  abundances  in  the  gas  in  spirals  can  be  stud¬ 
ied  by  means  of  the  emission  lines  from  H II  re¬ 
gions  ionised  by  newly  formed  stars.  In  this  case 
too,  the  metallicity  increases  towards  the  centre. 

The  birth  histories  of  galaxies  explain  partly 
the  different  metallicities  and  age  distributions. 
As  we  will  see  later,  galaxies  collide  and  merge 
in  the  span  of  billions  of  years.  Most  clearly  this 
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is  seen  in  dwarf  galaxies  having  sow  few  stars 
that  even  a  single  merger  leaves  a  distinct  signa¬ 
ture.  Figure  19.18  shows  the  age  distribution  of 
two  Milky  Way’s  neighbour  galaxies.  A  few  of 
the  stars  of  Leo  A  were  born  immediately  after 
the  origin  of  the  universe  but  most  of  the  stars 
were  born  during  the  last  5-6  billion  years.  The 
stars  of  the  Carina  dwarf  galaxy  were  born  dur¬ 
ing  three  separate  periods,  at  the  age  of  1-2,  6-8 
and  about  1 1  billion  years. 


1 9.6  Systems  of  Galaxies 

The  galaxies  are  not  smoothly  distributed  in 
space;  rather,  they  form  systems  of  all  sizes: 
galaxy  pairs,  small  groups ,  large  clusters  and  su¬ 
perclusters  formed  from  several  groups  and  clus¬ 
ters.  The  larger  a  given  system,  the  less  its  density 
exceeds  the  mean  density  of  the  Universe.  On  the 
average,  the  density  is  twice  the  background  den¬ 
sity  for  systems  of  radius  5  Mpc  and  10  %  above 
the  background  at  radius  20  Mpc. 

Interactions  of  Galaxies  Galaxies  do  not 
evolve  in  isolation;  instead,  they  interact  continu¬ 
ously  with  other  galaxies.  Each  large  galaxy  has 
experienced  encounters  or  collisions  with  several 
other  large  galaxies  and  tens  or  even  hundreds 
encounters  with  much  smaller  galaxies. 

Among  the  most  common  distortions  are 
“bridges”  and  “tails”  caused  by  tidal  forces  dur¬ 
ing  an  encounter  (Fig.  19.19).  Sometimes  a  small 
galaxy  breaks  down  and  forms  a  ring  in  the  equa¬ 
torial  plane  of  the  larger  galaxy  or  sometimes  on 
a  polar  orbit.  The  most  striking  ring  galaxies  are 
born  when  a  dense  galaxy  passes  straight  through 
another  one  (Fig.  19.20).  In  many  cases  the  ef¬ 
fects  of  collisions  are  seen  as  starbursts  when  the 
gas  and  dust  clouds  of  the  galaxies  meet,  con¬ 
dense  and  start  to  produce  new  stars.  Sometimes 
image  processing  is  needed  to  show  faint  but 
sharp  rings  around  a  galaxy  (Fig.  19.21). 

The  interactions  between  galaxies  are  not  al¬ 
ways  dramatic.  For  example,  the  Milky  Way 
has  two  satellites,  the  Large  and  Small  Mag¬ 
ellanic  Clouds  (see  Fig.  19.5),  which  are  Irr  I 
type  dwarf  galaxies  at  about  60  kpc  distance. 
It  is  thought  that  approximately  5  x  108  years 


Fig.  19.18  The  age  distribution  of  the  dwarf  galaxies  in 
Leo  and  Carina  shows  that  the  stars  of  the  galaxies  were 
formed  in  separate  collision  episodes.  (Tolstoy,  Hill,  Tosi, 
2009,  Ann.  Rev.  Astron.  Astrophys.  47,  371) 

ago,  these  passed  the  Milky  Way  at  a  distance  of 
about  10-15  kpc,  leaving  behind  the  Magellanic 
Stream ,  a  180°  long  thin  stream  of  neutral  hy¬ 
drogen  clouds.  Systems  of  this  type,  where  a  gi¬ 
ant  galaxy  is  surrounded  by  a  few  small  compan¬ 
ions,  are  quite  common.  Computations  show  that 
in  many  such  cases  the  tidal  interactions  are  so 
strong  that  the  companions  will  merge  with  the 
parent  galaxy  at  the  next  close  approach.  This  is 
likely  to  happen  to  the  Magellanic  Clouds. 

During  earlier  epochs  in  the  Universe,  when 
the  density  was  larger,  interactions  between  galax¬ 
ies  must  have  been  much  more  common  than  at 
present.  Thus  it  has  been  proposed  that  a  large 
fraction  of  bright  galaxies  have  undergone  ma¬ 
jor  mergers  at  some  stage  in  their  history.  In 
particular,  there  are  good  reasons  to  believe  that 
the  slowly  rotating,  non-axisymmetric  giant  el¬ 
lipticals  may  have  formed  by  the  merger  of  disk 
galaxies. 
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Fig.  1 9.1 9  The  galaxy 
pair  NGC  4676  is  known  as 
the  Mice.  The  galaxies  are 
about  to  collide  and  a  long 
gas  and  star  tail  protrudes 
from  each  of  them. 
(Hubble/NASA,  H.  Ford 
(JHU),  G.  Illingworth 
(UCSC/LO),  M.  Clampin 
(STScI),  G.  Hartig 
(STScI),  the  ACS  Science 
Team,  j a  ESA) 


Fig.  1 9.20  The  Cartwheel 
is  the  best  known  ring 
galaxy.  It  was  born  when  a 
small  galaxy  passed 
through  the  centre  of  the 
big  galaxy.  (Photo  Hubble, 
K.  Borne  (StScI)  and  Nasa) 


Groups  The  most  common  type  of  galaxy  sys¬ 
tems  are  small,  irregular  groups  of  a  few  tens  of 
galaxies.  A  typical  example  is  the  Local  Group , 
which  contains  two  larger  galaxies  in  addition  to 
the  Milky  Way — the  Andromeda  Galaxy  M31, 
an  Sb  spiral  of  about  the  same  size  as  the  Milky 
Way  with  two  dwarf  companions,  and  the  smaller 
Sc  spiral  M3 3.  The  rest  of  the  about  50  members 


of  the  Local  Group  are  dwarfs;  about  20  are  of 
type  dE  and  10  of  type  Irr  I.  The  diameter  of  the 
Local  Group  is  about  1.2  Mpc. 

Clusters  A  system  of  galaxies  may  be  defined 
to  be  a  cluster  if  it  contains  a  larger  number  (at 
least  50)  of  bright  galaxies.  The  number  of  mem¬ 
bers  and  the  size  of  a  cluster  depend  on  how  they 
are  defined.  One  way  of  doing  this  is  to  fit  the 
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Fig.  19.21  Rings  of  stars 
with  sharp  boundaries  in 
the  outer  parts  of  the 
galaxy  NGC  474.  The  rings 
were  born  when  a 
lenticular  galaxy  tore  a 
rotating  spiral  galaxy  layer 
by  layer. 

(CFHT/Coelum— J.-C. 
Cuillandre  &  G.  Anselmi) 


observed  distribution  of  galaxies  within  a  cluster 
with  an  expression  of  the  form  (19.8).  In  this  way 
a  characteristic  cluster  radius  of  about  2-5  Mpc  is 
obtained.  The  number  of  members  depends  both 
on  the  cluster  radius  and  on  the  limiting  magni¬ 
tude.  A  large  cluster  may  contain  several  hundred 
galaxies  that  are  less  than  two  magnitudes  fainter 
than  the  characteristic  luminosity  L*  of  (19.2). 

Clusters  of  galaxies  can  be  ordered  in  a  se¬ 
quence  from  extended,  low-density,  irregular  sys¬ 
tems  (sometimes  called  clouds  of  galaxies)  to 
denser  and  more  regular  structures  (Fig.  19.23). 
The  galaxy  type  composition  also  varies  along 
this  sequence  in  the  sense  that  in  the  loose  irregu¬ 
lar  clusters,  the  bright  galaxies  are  predominantly 
spirals,  whereas  the  members  of  dense  clusters 
are  almost  exclusively  type  E  and  SO.  The  nearest 
cluster  of  galaxies  is  the  Virgo  Cluster  at  a  dis¬ 
tance  of  about  15  Mpc.  It  is  a  relatively  irregu¬ 
lar  cluster,  where  a  denser  central  region  contain¬ 
ing  early  galaxy  types  is  surrounded  by  a  more 
extended  distribution  of  mainly  spiral  galaxies. 
The  nearest  regular  cluster  is  the  Coma  Cluster, 


Fig.  19.22  A  theoretical  X-ray  spectrum  for  plasma  with 
a  temperature  of  108  K.  Most  of  the  radiation  comes 
from  the  bremsstrahlung  of  electrons,  peaking  at  8.6  kilo- 
electronvolts  ( thick  line).  Superimposed  are  spectral  lines 
produced  by  different  ions.  (Bohringer  &  Hensler,  1989, 
A&A  215,  147) 

roughly  90  Mpc  away.  In  the  Coma  Cluster  a  cen¬ 
tral  pair  of  giant  ellipticals  is  surrounded  by  a  flat- 
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Fig.  19.23  Above :  The 
irregular  Virgo  Cluster  of 
galaxies.  Below :  The 
regular  Coma  Cluster  (ESO 
and  Karl-Schwarzschild- 
Observatorium) 


I 

♦  h  t 


tened  (axis  ratio  about  2:1)  system  of  early  type 
galaxies. 

Earlier  galaxy  groups  were  studied  as  con¬ 
glomerates  of  visible  galaxies.  Recently  two 
other,  more  massive,  components  have  been  in¬ 
cluded. 

First  of  all,  galaxy  clusters  are  surrounded 
by  large  clouds  of  hot  gas,  as  mentioned  in 
Sect.  19.2.  The  gas  can  be  observed  by  its  X- 


ray  emission.  The  spectrum  shows  features  typ¬ 
ical  for  bremsstrahlung  of  electrons  and  lines  of 
totally  ionised  metals  (like  FeXXIV).  Tempera¬ 
ture  measured  from  X-ray  spectra  are  107— 108  K. 
The  total  mass  of  the  gas  in  the  poorest  clusters 
is  about  5  times  and  in  the  richest  clusters  even 
20  times  as  big  as  the  mass  inside  the  galaxies. 
Part  of  the  gas  comes  from  the  galaxies  (jets  of 
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quasars,  supernova  explosions),  but  most  of  it  is 
hydrogen  and  helium  from  the  big  bang. 

Besides  in  X-ray  images  the  existence  of  the 
hot  gas  is  seen  also  as  extra  spots  in  the  cosmic 
background  radiation,  caused  by  the  Sunyayev- 
Zel’dovich  effect.  Photons  of  the  background  ra¬ 
diation  hit  electrons  of  the  hot  gas  and  receive 
some  extra  energy  (inverse  Compton  scattering). 
Below  218  GHz  the  radiation  is  slightly  cooler 
and  above  218  GHz  a  little  brighter.  The  diameter 
of  the  spot  is  about  one  arc  minute  and  the  tem¬ 
perature  difference  from  the  surroundings  0.1-1 
millikelvins  (Fig.  19.25).  One  of  the  main  objec¬ 
tives  of  the  Planck  satellite  was  to  find  gas  clouds 
around  galaxy  clusters  using  this  phenomenon. 
The  final  catalogue  of  Planck  (2015)  contains 
over  1600  objects  most  of  which  are  previously 
unknown  galaxies  with  redshifts  z  =  0  —  1.5. 

Without  a  strong  source  of  gravitation  the  hot 
gas  would  quickly  disperse  into  space.  The  grav¬ 
itation  is  caused  by  dark  matter;  its  mass  is  about 
five  times  as  big  as  that  of  the  ordinary  matter 
(hot  gas  +  visible  galaxies)  in  the  cluster.  Grav¬ 
itational  lenses  have  been  used  to  investigate  the 
dark  matter  in  galaxy  clusters  (Sect.  19.7). 

Superclusters  Groups  and  clusters  of  galaxies 
may  form  even  larger  systems,  superclusters.  For 
example,  the  Local  Group  belongs  to  the  Lo¬ 
cal  Supercluster ,  a  flattened  system  whose  centre 
is  the  Virgo  Cluster,  containing  tens  of  smaller 
groups  and  clouds  of  galaxies.  The  Coma  Clus¬ 
ter  is  part  of  another  supercluster.  The  diame¬ 
ters  of  superclusters  are  10-20  Mpc.  However, 
on  this  scale,  it  is  no  longer  clear  whether  one 
can  reasonably  speak  of  individual  systems.  Per¬ 
haps  it  would  be  more  accurate  to  think  of  the 
distribution  of  galaxies  as  a  continuous  network, 
where  the  large  clusters  are  connected  by  walls 
and  strings  formed  by  smaller  systems.  Between 
these  there  remain  empty  regions  containing  very 
few  galaxies,  which  can  be  up  to  50  Mpc  in  di¬ 
ameter  (Figs.  19.24,  20.8). 


1 9.7  Active  Galaxies  and  Quasars 

So  far  in  this  chapter  we  have  been  concerned 
with  the  properties  of  normal  galaxies.  In  some 


Fig.  19.24  The  large-scale  space  distribution  of  245,591 
galaxies.  The  radial  co-ordinate  is  the  redshift,  which  can 
be  translated  into  a  distance  using  a  value  for  the  Hub¬ 
ble  constant.  The  thickness  of  the  slices  is  about  10°. 
(2dFGRS  Team,  http://www2.aao.gov.au/~TDFgg/) 

galaxies,  however,  the  normal  galaxy  is  overshad¬ 
owed  by  violent  activity.  This  activity  is  produced 
in  the  nucleus,  which  is  then  called  an  active 
galactic  nucleus  (AGN). 

The  luminosities  of  active  galactic  nuclei  may 
be  extremely  large,  sometimes  much  larger  than 
that  of  the  rest  of  the  galaxy.  It  seems  unlikely 
that  a  galaxy  could  maintain  such  a  large  power 
output  for  long.  For  this  reason  it  is  thought  that 
active  galaxies  do  not  form  a  separate  class  of 
galaxies,  but  rather  represent  a  passing  stage  in 
the  evolution  of  normal  galaxies. 

Activity  appears  in  many  different  forms. 
Some  galaxies  have  an  exceptionally  bright  nu¬ 
cleus  similar  to  a  large  region  of  ionised  hydro¬ 
gen.  These  may  be  young  galaxies,  where  near 
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Fig.  19.25  A  brightening  of  the  cosmic  background  ra¬ 
diation  caused  by  the  Sunyayev-Zel’dovits  effect.  The  ra¬ 
dio  observation  in  the  millimetre  band  were  made  with 
the  South  pole  telescope.  The  amplitude  of  the  brightening 
tells  that  the  temperature  of  the  gas  cloud  is  1.2  x  108  K 
and  the  total  mass  of  the  galaxy  cluster  about  1015  solar 
masses.  Optical  images  of  the  same  location  show  a  dis¬ 
tant  galaxy  cluster  SPT-CL  J2 106-5844,  which  has  a  red- 
shift  z  =  1.13.  (Foley,  Andersson,  Bazin,  de  Haan,  Ruel  et 
al.,  2011,  ApJ  731,  p.  89) 

the  centre  large  numbers  of  stars  are  forming  and 
evolving  into  supemovae  (starburst  nuclei).  In 
other  nuclei  the  radiation  cannot  have  been  pro¬ 
duced  by  stars,  and  the  most  plausible  source  of 
energy  in  these  nuclei  is  the  gravitational  energy 
of  a  supermassive  black  hole  (mass  >  108  Mq). 
In  some  galaxies,  the  spectral  lines  are  unusually 
broad,  indicating  large  internal  velocities.  These 
may  be  either  rotational  velocities  near  a  black 
hole  or  due  to  explosive  events  in  the  nucleus. 
In  some  galaxies,  jets  are  seen  coming  out  of  the 
nucleus.  Many  active  galaxies  radiate  a  nonther- 
mal  spectrum,  apparently  synchrotron  radiation 
produced  by  fast  electrons  in  a  magnetic  field. 

The  classification  of  active  galaxies  has  been 
developed  rather  unsystematically,  since  many 
of  them  have  been  discovered  only  recently,  and 
have  not  been  completely  studied.  For  example, 
the  Markarian  galaxies  catalogued  by  Benyamin 
Yerishevich  Markarian  in  the  early  1970’s  are 


defined  by  strong  ultraviolet  emission.  Many 
Markarian  galaxies  are  Seyfert  galaxies;  others 
are  galaxies  undergoing  a  burst  of  star  formation. 
The  N  galaxies  form  another  class  closely  similar 
to  the  Seyfert  galaxies. 

Two  natural  basic  classes  of  active  galaxies  are 
the  Seyfert  galaxies  and  the  radio  galaxies.  The 
former  are  spirals;  the  latter  are  ellipticals.  Some 
astronomers  think  that  the  Seyfert  galaxies  repre¬ 
sent  the  active  stage  of  normal  spiral  galaxies  and 
the  radio  galaxies  that  of  ellipticals. 

Seyfert  Galaxies  The  Seyfert  galaxies  are 
named  after  Carl  Seyfert ,  who  discovered  them 
in  1943.  Their  most  important  characteristics  are 
a  bright,  pointlike  central  nucleus  and  a  spectrum 
showing  broad  emission  lines.  The  continuous 
spectrum  has  a  nonthermal  component,  which  is 
most  prominent  in  the  ultraviolet.  The  emission 
lines  are  thought  to  be  produced  in  gas  clouds 
moving  close  to  the  nucleus  with  large  velocities. 

On  the  basis  of  the  spectrum,  Seyfert  galaxies 
are  classified  as  type  1  or  2.  In  a  type  1  spec¬ 
trum,  the  allowed  lines  are  broad  (correspond¬ 
ing  to  a  velocity  of  104  kms-1),  much  broader 
than  the  forbidden  lines.  In  type  2,  all  lines  are 
similar  and  narrower  (<103  kms-1).  Transitions 
between  these  types  and  intermediate  cases  have 
sometimes  been  observed.  The  reason  for  the  dif¬ 
ference  is  thought  to  be  that  the  allowed  lines  are 
formed  in  denser  gas  near  the  nucleus,  and  the 
forbidden  lines  in  more  diffuse  gas  further  out.  In 
type  2  Seyfert  galaxies,  the  denser  gas  is  missing 
or  obscured. 

Almost  all  Seyfert  galaxies  with  known  Hub¬ 
ble  types  are  spirals;  the  possible  exceptions 
are  of  type  2.  They  are  strong  infrared  sources. 
Type  1  galaxies  often  show  strong  X-ray  emis¬ 
sion. 

The  true  Seyfert  galaxies  are  relatively  weak 
radio  sources.  However,  there  are  compact  ra¬ 
dio  galaxies  with  an  optical  spectrum  that  is  es¬ 
sentially  the  same  as  for  Seyfert  galaxies.  These 
should  probably  be  classified  with  the  Seyfert 
galaxies.  In  general,  the  stronger  radio  emission 
seems  to  come  with  a  type  2  spectrum. 

It  is  estimated  that  about  1  %  of  all  bright  spi¬ 
ral  galaxies  are  Seyfert  galaxies.  The  luminosi¬ 
ties  of  their  nuclei  are  about  1036-1039  W,  of  the 


1 9.7  Active  Galaxies  and  Quasars 


411 


same  order  as  all  the  rest  of  the  galaxy.  Brightness 
variations  are  common. 

Radio  Galaxies  By  definition,  radio  galaxies 
are  galaxies  that  are  powerful  radio  sources.  The 
radio  emission  of  a  radio  galaxy  is  non-thermal 
synchrotron  radiation.  The  radio  luminosity  of  ra¬ 
dio  galaxies  is  typically  1033-1038  W,  and  may 
thus  be  as  large  as  the  total  luminosity  of  a  nor¬ 
mal  galaxy.  The  main  problem  in  explaining  ra¬ 
dio  emission  is  to  understand  how  the  electrons 
and  magnetic  fields  are  produced,  and  above  all, 
where  the  electrons  get  their  energy. 

The  forms  and  sizes  of  the  radio  emitting  re¬ 
gions  of  radio  galaxies  have  been  studied  ever 
since  the  1950’s,  when  radio  interferometers 
achieved  the  resolution  of  optical  telescopes.  The 
characteristic  feature  of  a  strong  radio  galaxy  is 
a  double  structure:  there  are  two  large  radio  emit¬ 
ting  regions  on  opposite  sides  of  the  observed 
galaxy.  The  radio  emitting  regions  of  some  ra¬ 
dio  galaxies  are  as  far  apart  as  6  Mpc,  almost  ten 
times  the  distance  between  the  Milky  Way  and 
Andromeda  galaxies.  One  of  the  smallest  dou¬ 
ble  radio  sources  is  the  galaxy  M87  (Fig.  19.26), 
whose  two  components  are  only  a  few  kpc  distant 
from  each  other. 

The  double  structure  of  radio  galaxies  appears 
to  be  produced  by  ejections  from  the  nucleus. 
However,  the  electrons  in  the  radio  lobes  can¬ 
not  be  coming  from  the  centre  of  the  galaxy,  be¬ 
cause  they  would  lose  all  their  energy  during  such 
a  long  transit.  Therefore  electrons  have  to  be  con¬ 
tinuously  accelerated  within  the  radio-emitting 
regions.  Within  the  radio  lobes  there  are  almost 
point-like  regions,  hot  spots.  These  are  generally 
symmetrically  placed  with  respect  to  the  nucleus, 
and  are  apparently  consequences  of  nuclear  ejec¬ 
tions. 

“Tailed”  radio  sources  also  exist.  Their  ra¬ 
dio  emission  mainly  comes  from  one  side  of  the 
galaxy,  forming  a  curved  tail,  which  is  often  tens 
of  times  longer  than  the  diameter  of  the  galaxy. 
The  best  examples  are  NGC  1265  in  the  Perseus 
cluster  of  galaxies  and  3C129,  which  appears 
to  be  in  an  elliptical  orbit  around  a  companion 
galaxy.  The  tail  is  interpreted  as  the  trail  left  by 
the  radio  galaxy  in  intergalactic  space. 


Another  special  feature  revealed  by  the  radio 
maps  is  the  presence  of  jets ,  narrow  lines  of  ra¬ 
dio  emission,  usually  starting  in  the  nucleus  and 
stretching  far  outside  the  galaxy.  The  best  known 
may  be  the  M87  jet,  which  has  also  been  observed 
as  an  optical  and  X-ray  jet.  The  optically  ob¬ 
served  jet  is  surrounded  by  a  radio  source.  A  simi¬ 
lar  radio  source  is  seen  on  the  opposite  side  of  the 
nucleus,  where  no  optical  jet  is  seen.  Our  near¬ 
est  radio  galaxy  Centaurus  A  also  has  a  jet  ex¬ 
tending  from  the  nucleus  to  near  the  edge  of  the 
galaxy. 

VLB  I  observations  of  radio  jets  have  also  re¬ 
vealed  superluminal  motions :  in  many  compact 
sources  the  components  appear  to  be  separating 
faster  than  the  speed  of  light.  Since  such  veloci¬ 
ties  are  impossible  according  to  the  theory  of  rel¬ 
ativity,  the  observed  velocities  can  only  be  appar¬ 
ent,  and  several  models  have  been  proposed  to  ac¬ 
count  for  them. 

Quasars  The  first  quasar  was  discovered  in 
1963,  when  Maarten  Schmidt  interpreted  the  op¬ 
tical  emission  lines  of  the  known  radio  source 
3C273  as  hydrogen  B  aimer  lines  redshifted  by 
16  %.  Such  large  redshifts  are  the  most  re¬ 
markable  characteristics  of  the  quasars.  Properly 
speaking,  the  word  quasar  is  an  abbreviation  for 
quasistellar  radio  source,  and  some  astronomers 
prefer  to  use  the  designation  QSO  (quasistellar 
object),  since  not  all  quasars  emit  radio  radia¬ 
tion. 

Optically  the  quasars  appear  almost  as  point 
sources,  although  improved  observational  tech¬ 
niques  have  revealed  an  increasing  number  of 
quasars  located  inside  more  or  less  normal  galax¬ 
ies  (Fig.  19.27).  Although  the  first  quasars  where 
discovered  by  radio  observations,  only  a  small 
fraction  of  all  optically  identified  quasars  are 
bright  radio  sources.  Most  radio  quasars  are  point 
sources,  but  some  have  a  double  structure  like  the 
radio  galaxies.  Satellite  X-ray  pictures  also  show 
the  quasars  to  be  pointlike. 

In  the  visible  region  the  quasar  spectra  are 
dominated  by  spectral  lines  with  rest  wavelengths 
in  the  ultraviolet.  The  first  observed  quasar  red- 
shifts  were  z  =  0.16  and  0.37,  and  later  searches 
have  continued  to  turn  up  ever  larger  redshifts. 
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Fig.  19.26  Above :  The  active  galaxy  M87.  In  the  lower 
right-hand  comer  a  short  exposure  of  the  core  region  has 
been  inserted  (same  scale  as  in  the  main  photograph). 
One  sees  a  blue  jet  coming  out  of  the  nucleus  of  a  nor¬ 
mal  EO  galaxy  (NOAO/Kitt  Peak  National  Observatory). 

The  present  record  is  8.7,  but  three  objects  have 
redshift  10.  If  these  are  quasars  their  light  must 
have  been  emitted  when  the  age  of  the  universe 
was  only  500  million  years.  The  large  inferred 
distances  of  the  quasars  mean  that  their  luminosi¬ 
ties  have  to  be  extremely  large.  Typical  values  lie 
in  the  range  of  1038— 1041  W.  The  brightness  of 


Below.  In  the  radio  map  made  using  the  VLA  the  jet 
is  observed  to  be  two-sided.  The  area  shown  is  much 
smaller  than  in  the  upper  picture.  (Owen,  F.N.,  Hardee, 
P.E.,  Bignell,  R.C.  (1980):  Astrophys.  J.  (Lett.)  239,  Lll) 


quasars  may  vary  rapidly,  within  a  few  days  or 
less.  Thus  the  emitting  region  can  be  no  larger 
than  a  few  light-days,  i.e.  about  100  au. 

The  quasars  often  have  both  emission  and  ab¬ 
sorption  lines  in  their  spectra.  The  emission  lines 
are  very  broad  and  are  probably  produced  in 
the  quasar  itself.  Much  of  the  absorption  spec- 


1 9.7  Active  Galaxies  and  Quasars 


413 


Fig.  19.27  One  of  the  nearest  quasars,  3C  273,  pho¬ 
tographed  with  two  cameras  aboard  the  Hubble  Space 
Telescope.  On  the  left ,  the  Wide  Field  Planetary  Camera 
sees  a  bright  point-like  source,  with  a  jet  blasted  out  from 
the  quasar  (towards  5  o’clock).  On  the  right ,  a  corona- 


graph  in  the  Advanced  Camera  for  Surveys  blocks  out  the 
brightest  parts  of  the  quasar.  Spiral  arms  in  the  host  galaxy 
can  be  seen,  with  dark  dust  lanes,  as  well  as  new  details  in 
the  path  of  the  jet.  (Photos  Hubble/NASA/ESA) 


Fig.  19.28  A  section  of  the  spectrum  of  the  quasar 
Q 1422+2309  observed  with  the  Keck  10  metre  telescope. 
The  Lyman  a  line  of  the  quasar  is  near  the  right  edge.  Its 
redshift  is  3.62.  To  the  left  of  the  line  is  a  Lyman  a  for- 

trum  consists  of  densely  distributed  narrow  lines 
that  are  thought  to  be  hydrogen  Lyman  a  lines 
formed  in  gas  clouds  along  the  line  of  sight  to 
the  quasar.  The  clouds  producing  this  “Lyman 
a  forest”  (Fig.  19.28)  are  young  galaxies  or  pro¬ 
togalaxies,  and  they  therefore  provide  important 
evidence  about  the  formation  of  galaxies.  Lyman 
of  forests  can  be  used  to  map  intergalactic  gas 
clouds  up  to  redshift  z  =  6  (Fig.  19.29).  Fur¬ 


est,  where  each  absorption  line  corresponds  to  a  hydrogen 
cloud  that  the  radiation  had  to  penetrate.  Wavelengths  are 
given  in  angstroms.  (Kuva  Womble,  Sargent,  Lyons) 


ther  out  Lyman  a  absorption  lines  cannot  be  seen 
since  most  of  the  hydrogen  is  neutral. 

Unified  Models  Although  the  forms  of  galac¬ 
tic  activity  may  at  first  sight  appear  diverse,  they 
can  be  unified  within  a  fairly  widely  accepted 
schematic  model  in  such  a  way  that  the  properties 
of  each  individual  object  depend  on  a  few  param¬ 
eters  only. 
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According  to  this  model,  most  galaxies  con¬ 
tain  a  compact  central  nucleus,  which  is  a  super- 
massive  black  hole,  with  mass  107-1010  M0,  sur¬ 
rounded  by  a  disk  or  ring  of  gas.  The  source  of 
energy  is  the  gravitational  energy  released  as  gas 
is  accreted  into  the  black  hole.  The  disk  may  also 
give  rise  to  a  jet,  where  some  of  the  energy  is  con¬ 
verted  into  perpendicular  motions  along  the  rota¬ 
tional  axis.  Thus  active  galactic  nuclei  are  simi¬ 
lar  to  the  nucleus  of  the  Milky  Way,  although  the 
masses  of  both  the  black  hole  and  the  gas  disk 
may  be  much  larger. 

Deducing  the  mass  of  the  central  black  hole 
is  difficult  and  uncertain.  However,  using  a  vari¬ 
ety  of  methods  involving  the  motions  of  stars  and 
gas  at  the  centre  of  nearby  galaxies  black  hole 
masses  for  about  30  galaxies  have  been  deter¬ 
mined.  The  most  important  result  of  these  studies 
is  that  there  is  a  close  relationship  between  the 
black  hole  mass  and  the  central  velocity  disper¬ 
sion  of  the  galaxy.  According  to  the  virial  theo¬ 
rem  the  velocity  dispersion  is  a  measure  of  the 
bulge  mass,  and  therefore  the  conclusion  is  that 
there  is  a  close  relationship  between  the  mass  of 
the  bulge  and  the  mass  of  the  central  black  hole. 

The  first  characteristic  parameter  of  the  uni¬ 
fied  model  is  obviously  the  total  luminosity.  For 


Fig.  19.29  Simulation  of  hydrogen  distribution  in  space. 
The  light  emitted  by  a  quasar  goes  through  different  gas 
clouds  creating  a  Lyman  a  forest.  (Renyue  Cen,  Princeton 
University) 


example,  the  only  essential  difference  between 
Seyfert  1  galaxies  and  radio-quiet  quasars  is  the 
larger  luminosity  of  quasars.  Another  basic  pa¬ 
rameter  is  the  radio  brightness,  which  may  be  re¬ 
lated  to  the  strength  of  a  jet.  On  the  basis  of  their 
radio  luminosity  one  can  connect  Seyfert  galaxies 
and  radio-quiet  quasars  on  one  hand,  and  radio 
galaxies  and  radio  quasars  on  the  other. 

The  third  important  parameter  of  unified  mod¬ 
els  is  the  angle  from  which  we  happen  to  view 
the  nuclear  disk.  For  example,  if  the  disk  is  seen 
edge-on,  the  actual  nucleus  is  obscured  by  the 
disk.  This  could  explain  the  difference  between 
Seyfert  types  1  and  2:  in  type  2  we  do  not  see 
the  broad  emission  lines  formed  near  the  black 
hole,  but  only  the  narrower  lines  from  the  disk. 
Similarly  a  galaxy  that  looks  like  a  double  radio 
source  when  seen  edge-on,  would  look  like  a  ra¬ 
dio  quasar  if  the  disk  were  seen  face-on.  In  the 
latter  case  there  is  a  possibility  that  we  may  be 
seeing  an  object  directly  along  the  jet.  It  will  then 
appear  as  a  blazar ,  an  object  with  rapid  and  vio¬ 
lent  variations  in  brightness  and  polarisation,  and 
very  weak  or  invisible  emission  lines.  If  the  jet 
is  almost  relativistic,  its  transverse  velocity  may 
appear  larger  than  the  speed  of  light,  and  thus  su¬ 
perluminal  motions  can  also  be  understood. 

One  prediction  of  the  unified  model  is  that 
there  should  be  a  large  number  of  quasars  where 
the  nucleus  is  obscured  by  the  disk  as  in  the 
Seyfert  2  galaxies.  In  analogy  with  the  Seyfert 
galaxies  these  objects  are  referred  to  as  type  2 
AGN  or  quasars.  Because  of  the  obscuration 
such  sources  would  not  be  included  in  surveys 
at  optical,  UV,  or  soft  X-ray  wavelengths.  In 
hard  X-rays  the  obscuration  is  less,  and  in  the 
far  infrared  the  absorbed  energy  is  re-radiated. 
Searches  for  type  2  quasars  have  been  made  at 
these  wavelengths  with  the  Chandra  X-ray  satel¬ 
lite  and  with  the  Spitzer  Space  Telescope.  The 
indications  from  these  searches  are  that  at  least 
3/4  of  all  supermassive  black  holes  are  heavily 
obscured. 

Gravitational  Lenses  An  interesting  pheno¬ 
menon  first  discovered  in  connection  with  quasars 
are  gravitational  lenses.  Since  light  rays  are  bent 
by  gravitational  fields,  a  mass  (e.g.  a  galaxy) 
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Fig.  19.30  (a)  The 

components  of  the  Einstein 
cross  are  gravitationally 
lensed  images  of  the  same 
quasar  (ESA/NASA). 

(b)  The  massive  galaxy 
cluster  Abell  2218  deflects 
light  rays  passing  through 
it  and  acts  as  a  giant 
gravitational  lens.  In  this 
Hubble  picture  from 
January  2000,  dozens  of 
arc-shaped  images  of 
distant  galaxies  can  be 
seen.  (Photo  A.  Fruchter, 

S.  Baggett,  R.  Hook  and 
Z.  Levay,  NASA/STScI) 


placed  between  a  distant  quasar  and  the  observer 
will  distort  the  image  of  the  quasar.  The  first  ex¬ 
ample  of  this  effect  was  discovered  in  1979,  when 
it  was  found  that  two  quasars,  5.7"  apart  in  the 
sky,  had  essentially  identical  spectra.  It  was  con¬ 
cluded  that  the  “pair”  was  really  a  double  image 
of  a  single  quasar.  Since  then  several  other  grav¬ 
itationally  lensed  quasars  have  been  discovered 
(Fig.  19.30). 


Gravitational  lenses  have  also  been  discov¬ 
ered  in  clusters  of  galaxies.  Here  the  gravitational 
field  of  the  cluster  distorts  the  images  of  dis¬ 
tant  galaxies  into  arcs  around  the  cluster  centre. 
Currently  some  two  hundred  individual  galaxies 
and  twenty  galaxy  clusters  are  known  to  pro¬ 
duce  lensed  images  of  background  galaxies  or 
quasars.  Such  systems  are  called  strong  gravita¬ 
tional  lenses  (Fig.  19.31). 
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Fig.  19.31  A  strong  gravitational  lens.  The  light  of  a  quasar  is  bent  by  the  gravity  of  a  lensing  galaxy.  The  quasar 
appears  as  two  images  on  both  sides  of  the  lensing  galaxy 
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Fig.  19.32  A  weak  gravitational  lens.  Light  rays  coming  from  the  upper  and  lower  edges  of  a  distant  galaxy  are  bent 
by  different  amounts.  The  image  of  the  galaxy  becomes  slightly  flattened 


Recently  emphasis  has  moved  towards  weak 
gravitational  lenses ,  which  are  much  more  com¬ 
mon  (Fig.  19.32).  The  lens  does  not  produce  sep¬ 
arate  images  or  arcs  from  a  background  but  only 
flattens  its  shape.  Since  the  axial  ratio  changes 
only  by  1-2  %,  the  weak  lenses  cannot  be  used 
to  study  shapes  of  individual  galaxies  because 
their  original  shapes  are  not  known.  However,  if 
hundreds  or  thousands  of  galaxies  around  a  mass 
concentration  are  measured,  statistical  analysis 
shows  the  flattening.  In  the  2010’s  weak  lenses 
are  routinely  used  to  observe  the  distribution  of 
the  invisible  matters. 

Figure  19.13  showed  an  example  about  dark 
matter  clouds  around  a  galaxy  cluster. 


1 9.8  The  Origin  and  Evolution  of 
Galaxies 

Because  the  speed  of  light  is  finite  we  see  distant 
galaxies  at  earlier  stages  in  their  life.  In  the  next 
chapter  we  show  how  the  age  of  a  galaxy  with 
given  redshift  can  be  calculated  based  on  the  rate 
of  expansion  of  the  Universe.  However,  this  rela¬ 
tionship  will  depend  on  the  cosmological  model, 


which  therefore  always  has  to  be  specified  when 
studying  the  evolution  of  galaxies. 

Evolution  of  galaxies  began  when  the  first 
stars  were  born  at  redshifts  10-20.  The  universe 
consisted  of  neutral  hydrogen  and  helium  only, 
and  the  temperature  of  the  gas  had  dropped  be¬ 
low  100  kelvins.  The  first  stars  were  very  mas¬ 
sive  and  shortlived.  Radiation  from  the  first  stars, 
galaxies  and  quasars  rapidly  ionised  the  interstel¬ 
lar  and  intergalactic  gas.  This  era  is  called  the 
time  of  ionisation.  Ionisation  was  almost  com¬ 
plete  when  the  redshift  reached  z  =  6,  and  ever 
since  99.9  %  of  the  hydrogen  has  been  ionised. 
Towards  the  end  of  the  reionisation,  at  redshifts 
6-15,  massive  galaxies  (M  >  1010M©)  were  al¬ 
ready  born.  Current  observations  of  galaxies  go 
back  to  redshifts  10-12. 

According  to  the  currently  widely  accepted 
cosmological  models  most  of  the  matter  in  the 
Universe  is  in  a  form  that  emits  no  radiation, 
and  is  only  observable  from  its  gravitational  ef¬ 
fects.  In  this  Cold  Dark  Matter  (CDM)  theory 
(see  Sect.  20.7)  the  first  systems  to  collapse  and 
start  forming  stars  were  small,  with  masses  like 
those  of  dwarf  galaxies.  Larger  galaxies  were 
formed  later  as  these  smaller  fragments  collected 
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into  larger  clumps.  This  model,  where  most  stars 
are  formed  in  small  galaxies  is  usually  described 
as  the  hierarchical  model. 

A  large  part  of  our  theoretical  ideas  on  galaxy 
evolution  is  based  on  numerical  simulations  of 
the  collapse  of  gas  clouds  and  star  formation  in 
them.  Using  some  prescription  for  star  formation 
one  can  try  to  compute  the  evolution  of  the  spec¬ 
tral  energy  distribution  and  the  chemical  abun¬ 
dances  in  the  resulting  galaxies.  The  results  of 
the  models  can  be  compared  with  the  observa¬ 
tional  data  presented  in  the  previous  sections  of 
this  chapter. 

The  density  distribution  of  dark  matter  is  ex¬ 
pected  to  be  very  irregular,  containing  numerous 
small-scale  clumps.  The  collapse  will  therefore 
be  highly  inhomogeneous,  both  in  the  hierarchi¬ 
cal  and  in  the  monolithic  picture,  and  subsequent 
mergers  between  smaller  systems  should  be  com¬ 
mon.  There  are  additional  complicating  factors. 
Gas  may  be  expelled  from  the  galaxy,  or  there 
may  be  an  influx  of  fresh  gas.  Interactions  with 
the  surroundings  may  radically  alter  the  course  of 
evolution — in  dense  systems  they  may  lead  to  the 
complete  merging  of  the  individual  galaxies  into 
one  giant  elliptical.  Much  remains  to  be  learned 
about  how  the  formation  of  stars  is  affected  by 
the  general  dynamical  state  of  the  galaxy  and  of 
how  an  active  nucleus  may  influence  the  forma¬ 
tion  process. 

Our  observational  knowledge  of  galaxy  evo¬ 
lution  is  advancing  very  rapidly.  Essentially  all 
the  relationships  described  earlier  in  this  chap¬ 
ter  have  been  studied  as  functions  of  time.  Still, 
a  complete  generally  accepted  description  of  the 
way  the  Universe  reached  its  present  state  has  not 
yet  been  established.  Here  we  can  only  mention 
a  few  of  the  most  central  aspects  of  the  processes 
leading  to  the  galaxies  we  observe  to-day. 

Density  and  Luminosity  Evolution  The  most 
basic  way  of  studying  the  formation  and  evolu¬ 
tion  of  galaxies  is  by  counting  their  numbers,  ei¬ 
ther  the  number  brighter  than  some  given  magni¬ 
tude  limit  (as  was  already  done  by  Hubble  in  the 
1930’s,  see  Sect.  20.1)  or  else  the  number  den¬ 
sity  as  a  function  of  redshift.  The  counts  can  be 
compared  with  the  numbers  expected  if  there  is 
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Fig.  19  .33  Galaxy  counts  in  the  U,  B,  I,  and  K  wave¬ 
length  bands.  The  counts  are  compared  to  those  in  a  cos¬ 
mological  model  without  evolution.  The  cosmological  pa¬ 
rameters  correspond  to  the  currently  preferred  “concor¬ 
dance”  model  (see  Sect.  19.5).  (H.C.  Ferguson  et  al.,  2000, 
ARAA  38,  667,  Fig.  4) 


Fig.  19.34  The  deepest  image  taken  by  the  Hubble  space 
telescope  and  published  in  2014.  The  total  exposure  time 
is  27  days.  The  original  picture  contains  about  15,000  faint 
galaxies.  (NASA,  ESA,  H.  Teplitz,  M.  Rafelski,  A.  Koeke- 
moer,  R.  Windhorst  and  Z.  Levay) 

no  evolution,  which  depend  on  the  cosmological 
model.  They  can  therefore  be  used  either  as  a  cos- 
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Fig.  1 9.35  Different  early 
forms  of  galaxies  in  the 
redshift  range  z  =  2-4.  Top 
line :  chain  galaxies;  second 
line :  clump-clusters;  third 
line :  double  galaxies; 
fourth  line :  tadpoles;  fifth 
line :  spiral  galaxies  being 
born;  bottom  line :  elliptic 
galaxies.  (Elmegreen, 
Elmegreen,  Rubin, 

Schaffer,  2005,  ApJ  631, 
p.  87) 


mological  test,  or  as  a  test  of  evolutionary  mod¬ 
els.  However,  the  present  situation  is  that  more 
reliable  cosmological  tests  are  available,  and  the 
number  counts  are  mainly  used  to  study  the  evo¬ 
lution  of  galaxies. 

There  are  two  ways  in  which  the  number 
counts  are  affected  by  galaxy  evolution.  In  den¬ 
sity  evolution  the  actual  number  of  galaxies  is 
changing,  whereas  in  luminosity  evolution  only 
the  luminosity  of  individual  galaxies  is  evolv¬ 
ing.  The  simplest  form  of  luminosity  evolution 


is  called  passive  luminosity  evolution ,  and  is  due 
to  the  changing  luminosity  of  stars  during  nor¬ 
mal  stellar  evolution.  Pure  luminosity  evolution 
is  expected  to  be  predominant  in  the  monolithic 
picture.  In  the  hierarchical  picture  density  evolu¬ 
tion  will  be  more  prominent,  since  in  this  picture 
smaller  galaxies  are  to  a  greater  extent  being  de¬ 
stroyed  to  produce  larger  more  luminous  ones. 

Figure  19.33  gives  an  example  of  the  results  of 
number  counts.  A  model  without  evolution  can¬ 
not  explain  these  observed  counts,  and  various 
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models  incorporating  evolutionary  effects  have  to 
be  introduced.  However,  a  unique  model  cannot 
be  determined  using  just  the  number  counts. 

Distant  Galaxies  A  more  direct  approach  to 
galaxy  formation  is  the  direct  search  for  the  most 
distant  objects  visible.  In  Fig.  19.34  we  show  the 
Hubble  Ultra  Deep  Field  (HUDF),  an  area  of  the 
sky  observed  with  the  Hubble  Space  Telescope. 
The  total  exposure  time  through  different  filters 
was  27  days.  The  image  contains  over  15,000 
galaxies.  Over  5000  are  seen  as  dots;  their  red- 
shift  is  aver  6.  The  other  ones,  seen  as  extended 
objects,  give  information  on  galaxy  evolution  at 
redshifts  1-5. 

The  most  important  observation  is  that  with  in¬ 
creasing  redshift  the  Hubble  classification  breaks 
down  and  finally  disappears.  Around  redshift 
z  =  3  the  spiral  galaxies  resembling  the  current 
ones  have  disappeared  almost  completely.  In¬ 
stead,  there  are  different  irregular  and  strange 
galaxies. 

Figure  19.35  shows  examples  of  distant  galax¬ 
ies  in  the  range  z  =  2-4.  Over  10  %  are  “chain 
galaxies”,  about  20  %  “clump  clusters”,  15  % 
“doubles”  and  10  %  “tadpoles”.  About  30  %  are 
evolving  spirals  and  10  %  ellipticals. 

Another  important  observation  is  that  in  the 
past  galaxies  were  smaller  than  nowadays.  Even 
if  the  mass  or  luminosity  of  a  galaxy  were  sim¬ 
ilar  to  a  modern  galaxy  its  diameter  at  z  =  3  is 
only  one  tenth.  The  most  probable  explanation  is 
that  the  outer  parts  grow  in  several  small  mergers. 
Collision  of  a  dwarf  galaxy  does  not  considerably 
increase  the  mass  of  the  great  galaxy  but  brings 
more  stars  to  its  outskirts  increasing  its  diameter. 
The  observation  clearly  indicates  how  important 
mutual  collisions  have  been  in  the  evolution  of 
galaxies  towards  their  current  forms. 

Evolution  of  AGN  The  first  clear  indication  of 
cosmic  evolution  was  in  the  numbers  of  radio 
galaxies  and  quasars.  Already  in  the  late  1960’s 
it  was  becoming  clear  that  the  density  of  quasars 
was  increasing  dramatically  towards  higher  red- 
shifts  (Fig.  19.36).  Roughly,  the  number  density 
of  quasars  increases  relative  to  the  present  den¬ 
sity  by  a  factor  100  out  to  a  broad  maximum  at 
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Fig.  1 9.36  Quasar  activity  at  different  redshifts  accord¬ 
ing  to  X-ray  and  infrared  observations  (gray  areas  and  the 
gray  line).  The  black  line  corresponds  to  the  stellar  birth 
rate  in  Fig.  19.37.  (Madau  ja  Dickinson,  2014,  Ann.  Rev. 
Astron.  Astrophys.  52) 
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redshift 

Fig.  19.37  Stellar  birth  rates  at  different  redshifts.  Dif¬ 
ferent  symbols  correspond  to  values  obtained  in  different 
studies  and  their  standard  errors.  (Madau  and  Dickinson, 
2014,  Ann.  Rev.  Astron.  Astrophys.  52) 

redshift  about  2.  The  observed  behaviour  may  be 
due  to  either  density  or  luminosity  evolution.  The 
density  of  radio  galaxies  also  has  a  maximum 
at  redshifts  about  1.5-3,  which  is  sometimes  re¬ 
ferred  to  as  the  quasar  era. 

The  Star  Formation  History  of  the  Universe 

Since  the  Universe  contained  only  neutral  gas  as 
the  first  stars  began  to  form,  the  most  general  de¬ 
scription  of  how  galaxies  came  to  be  is  in  terms 
of  the  rate  at  which  the  gas  is  being  turned  into 
stars.  The  star  formation  history  going  back  to 
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a  redshift  about  8  is  shown  in  Fig.  19.37.  The  star 
formation  rate  was  about  an  order  of  magnitude 
larger  than  its  current  value  at  redshift  1-2.  For 
even  larger  redshifts  it  seems  to  have  remained 
nearly  constant  or  decreased  slowly. 


1 9.9  Exercises 

Exercise  19.1  The  galaxy  NGC  772  is  an  Sb  spi¬ 
ral,  similar  to  M3 1 .  Its  angular  diameter  is  1'  and 
apparent  magnitude  12.0.  The  corresponding  val¬ 
ues  of  M31  are  3.0°  and  5.0.  Find  the  ratio  of  the 
distances  of  the  galaxies 


(a)  assuming  their  sizes  are  equal, 

(b)  assuming  they  are  equally  bright. 

Exercise  19.2  The  brightness  of  the  quasar 
3C279  has  shown  changes  with  the  time  scale  of 
one  week.  Estimate  the  size  of  the  region  produc¬ 
ing  the  radiation.  The  apparent  magnitude  is  18. 
If  the  distance  of  the  quasar  is  2000  Mpc,  what 
is  its  absolute  magnitude  and  luminosity?  How 
much  energy  is  produced  per  au3  ? 


Cosmology 
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Cosmology  is  a  science  studying  the  structure  and 
evolution  of  the  whole  Universe.  Research  in  cos¬ 
mology  tries  to  answer  questions  such  as:  How 
large  and  how  old  is  the  Universe?  How  was  it 
born?  How  is  matter  distributed?  How  were  the 
elements  formed?  What  will  be  the  future  of  the 
Universe? 

After  the  demise  of  the  Aristotelian  world  pic¬ 
ture,  it  took  hundreds  of  years  of  astronomical  ob¬ 
servations  and  physical  theories  to  reach  a  level  at 
which  a  satisfactory  modern  scientific  picture  of 
the  physical  universe  could  be  formed.  The  de¬ 
cisive  steps  in  the  development  were  the  clarifi¬ 
cation  of  the  nature  of  the  galaxies  in  the  1920’s 
and  the  general  theory  of  relativity,  which  gives  a 
basis  for  theoretical  research  of  the  universe,  de¬ 
veloped  by  Einstein  in  the  1910’s.  Yet,  just  a  few 
decades  ago  there  was  a  lot  of  room  for  armwaiv- 
ing  and  all  kinds  of  speculations.  Only  recently 
the  observations  made  with  satellites  have  given 
reasonably  precise  values  for  some  parameters 
and  some  alternative  models  could  be  excluded. 
The  central  tenet  of  modern  cosmology  is  the 
model  of  the  expanding  universe,  getting  all  the 
time  more  supporting  observational  evidence. 


20.1  Cosmological  Observations 

The  Olbers  Paradox  The  simplest  cosmolog¬ 
ical  observation  may  be  that  the  sky  is  dark  at 
night.  This  fact  was  first  noted  by  Johannes  Ke¬ 
pler ,  who,  in  1610,  used  it  as  evidence  for  a  finite 
universe.  As  the  idea  of  an  infinite  space  filled 


with  stars  like  the  Sun  became  widespread  in  con¬ 
sequence  of  the  Copernican  revolution,  the  ques¬ 
tion  of  the  dark  night  sky  remained  a  problem.  In 
the  18th  and  19th  centuries  Edmond  Halley ,  Loys 
de  Cheseaux  and  Heinrich  Olbers  considered  it  in 
their  writings.  It  has  become  known  as  the  Olbers 
paradox  (Fig.  20.1). 

The  paradox  is  the  following:  Let  us  suppose 
the  Universe  is  infinite  and  that  the  stars  are  uni¬ 
formly  distributed  in  space.  No  matter  in  what  di¬ 
rection  one  looks,  sooner  or  later  the  line  of  sight 
will  encounter  the  surface  of  a  star.  Since  the  sur¬ 
face  brightness  does  not  depend  on  distance,  each 
point  in  the  sky  should  appear  to  be  as  bright  as 
the  surface  of  the  Sun.  This  clearly  is  not  true. 
The  modern  explanation  of  the  paradox  is  that  the 
stars  have  only  existed  for  a  finite  time,  so  that  the 
light  from  very  distant  stars  has  not  yet  reached 
us.  Rather  than  proving  the  world  to  be  finite  in 
space,  the  Olbers  paradox  has  shown  it  to  be  of 
a  finite  age. 

Extragalactic  Space  In  1923  Edwin  Hubble 
showed  that  the  Andromeda  Galaxy  M31  was 
far  outside  the  Milky  Way,  thus  settling  a  long¬ 
standing  controversy  concerning  the  relationship 
between  the  nebulae  and  the  Milky  Way.  The 
numerous  galaxies  seen  in  photographs  form 
an  extragalactic  space  vastly  larger  than  the  di¬ 
mensions  of  the  Milky  Way.  It  is  important  for 
cosmology  that  the  distribution  and  motions  of 
the  basic  components  of  extragalactic  space,  the 
galaxies  and  clusters  of  galaxies,  should  every¬ 
where  be  the  same  as  in  our  local  part  of  the 
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Fig.  20.1  The  Olbers  paradox.  If  the  stars  were  uniformly 
distributed  in  an  unending,  unchanging  space,  the  sky 
should  be  as  bright  as  the  surface  of  the  Sun,  since  each 
line  of  sight  would  eventually  meet  the  surface  of  a  star. 

Universe.  Galaxies  generally  occur  in  various 
systems,  ranging  from  small  groups  to  clusters 
of  galaxies  and  even  larger  superclusters.  The 
largest  structures  observed  are  about  100  Mpc  in 
size  (see  Sect.  19.6).  They  are  thus  significantly 
smaller  than  the  volume  of  space  ( a  few  thousand 
Mpc  in  size)  in  which  the  distribution  of  galaxies 
has  been  investigated.  One  way  of  studying  the 
large-scale  homogeneity  of  the  galaxy  distribu¬ 
tion  is  to  count  the  number  of  galaxies  brighter 
than  some  limiting  magnitude  m.  If  the  galax¬ 
ies  are  uniformly  distributed  in  space,  this  num¬ 
ber  should  be  proportional  to  10°-6m  (see  Exam¬ 
ple  18.1).  For  example,  the  galaxy  counts  made 
by  Hubble  in  1934,  which  included  44,000  galax¬ 
ies,  were  consistent  with  a  galaxy  distribution  in¬ 
dependent  of  position  (homogeneity)  and  of  di¬ 
rection  (isotropy).  Hubble  found  no  “edge”  of  the 
Universe,  nor  have  later  galaxy  counts  found  one. 

Similar  counts  have  been  made  for  extragalac- 
tic  radio  sources.  (Instead  of  magnitudes,  flux 
densities  are  used.  If  F  is  the  flux  density,  then 
because  m  =  — 2.51g(F/Fo),  the  number  count 
will  be  proportional  to  F~3/2.)  These  counts 
mainly  involve  very  distant  radio  galaxies  and 
quasars  (Fig.  20.2).  The  results  seem  to  indicate 
that  the  radio  sources  were  either  much  brighter 
or  much  more  common  at  earlier  epochs  than  at 
present  (Sect.  19.8).  This  constitutes  evidence  in 
favour  of  an  evolving,  expanding  universe. 

In  general  the  simple  geometric  relation  be¬ 
tween  brightness  and  number  counts  will  only 


A  two-dimensional  analogy  can  be  found  in  an  optically 
thick  pine  forest  where  the  line  of  sight  meets  a  trunk 
wherever  one  looks.  (Photo  M.  Poutanen  and  H.  Kart- 
tunen) 

hold  for  objects  that  are  uniformly  distributed  in 
space.  Focal  inhomogeneities  will  cause  depar¬ 
tures  from  the  expected  relationship.  For  more 
distant  sources  the  geometry  of  the  Universe  as 
well  as  cosmic  evolution  will  change  the  basic 
10° -6m  behaviour. 

Hubble’s  Law  (See  Fig.  20.3.)  In  the  late 
1920’s,  Hubble  discovered  that  the  spectral  lines 
of  galaxies  were  shifted  towards  the  red  by  an 
amount  proportional  to  their  distances.  If  the  red- 
shift  is  due  to  the  Doppler  effect,  this  means  that 
the  galaxies  move  away  from  each  other  with  ve¬ 
locities  proportional  to  their  separations,  i.e.  that 
the  Universe  is  expanding  as  a  whole. 

In  terms  of  the  redshift  z  =  (A  —  Ao)  Ao»  Hub¬ 
ble’s  law  can  be  written  as 

z  =  (H/c)r ,  (20.1) 

where  c  is  the  speed  of  light,  H  is  the  Hub¬ 
ble  constant  and  r  the  distance  of  the  galaxy. 
For  small  velocities  (V  c)  the  Doppler  redshift 
z  =  V/c ,  and  hence 

V  =  Hr ,  (20.2) 

which  is  the  most  commonly  used  form  of  Hub¬ 
ble’s  law. 

For  a  set  of  observed  “standard  candles”,  i.e. 
galaxies  whose  absolute  magnitudes  are  close 
to  some  mean  Mq,  Hubble’s  law  corresponds  to 
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Fig.  20.2  The  quasar 
3C295  and  its  spectrum. 
The  quasars  are  among  the 
most  distant  cosmological 
objects.  (Photograph 
Palomar  Observatory) 


a  linear  relationship  between  the  apparent  mag¬ 
nitude  m  and  the  logarithm  of  the  redshift,  1  gz. 
This  is  because  a  galaxy  at  distance  r  has  an  ap¬ 
parent  magnitude  m  =  Mo  +  51g(r/10  pc),  and 
hence  Hubble’s  law  yields 

m  =  M0  +  51g(  — )  =  5  lg  z  +  C , 

\H  x  10  pc/ 

(20.3) 

where  the  constant  C  depends  on  H  and  Mo. 
Suitable  standard  candles  are  e.g.  the  brightest 
galaxies  in  clusters  and  Sc  galaxies  of  a  known 
luminosity  class.  Some  other  methods  of  dis¬ 
tance  determination  for  galaxies  were  discussed 
in  Sect.  19.2.  Most  recently,  type  la  supernovae 
(Sect.  14.4)  in  distant  galaxies  have  been  used  to 
determine  distances  out  to  the  redshift  z  =  1.7, 


where  departures  from  Hubble’s  law  are  already 
detectable. 

If  the  Universe  is  expanding,  the  galaxies  were 
once  much  nearer  to  each  other.  If  the  rate  of  ex¬ 
pansion  had  been  unchanging,  the  inverse  of  the 
Hubble  constant,  T  =  H~  ,  would  represent  the 
age  of  the  Universe.  If  the  expansion  is  gradually 
slowing  down,  the  inverse  Hubble  constant  gives 
an  upper  limit  on  the  age  of  the  Universe  In  fact, 
current  indications  (discussed  later  in  this  chap¬ 
ter)  are  that  the  rate  of  expansion  is  accelerating 
at  present.  In  that  case  the  age  of  the  Universe 
may  also  be  larger.  However,  //_1  will  still  be  an 
estimate  for  the  age  of  the  Universe. 

One  reason  for  the  difficulty  in  determining 
the  value  of  the  Hubble  constant  is  the  uncer- 
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Fig.  20.3  Hubble’s  law 
for  type  la  supemovae.  The 
solid  curve  represents  the 
“concordance”  model,  with 
basic  parameters  having 
the  values  mentioned  in 
Sect.  20.7.  Models  with  a 
zero  cosmological  constant 
cannot  explain  the 
observations.  (D.  Rubin  et 
al.,  2013,  ApJ  763,  35) 
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tainty  in  extragalactic  distances.  A  second  prob¬ 
lem  is  that  the  measured  values  of  the  velocity  V, 
corrected  to  take  into  account  the  motion  of  the 
Sun  within  the  Local  Group,  contain  a  signifi¬ 
cant  component  due  to  the  peculiar  motions  of  the 
galaxies.  These  peculiar  velocities  are  caused  by 
local  mass  concentrations  like  groups  and  clus¬ 
ters  of  galaxies.  It  is  possible  that  the  Local  Group 
has  a  significant  velocity  towards  the  centre  of  the 
Local  Supercluster  (the  Virgo  Cluster).  Because 
the  Virgo  Cluster  is  often  used  to  determine  the 
value  of  H,  neglecting  this  peculiar  velocity  leads 
to  a  large  error  in  H.  The  size  of  the  peculiar  ve¬ 
locity  is  not  yet  well  known,  but  it  is  probably 
about  250  kms-1. 

The  most  ambitious  recent  project  for  deter¬ 
mining  H  used  the  Hubble  Space  Telescope  in 
order  to  measure  cepheid  distances  to  a  set  of 
nearby  galaxies.  These  distances  were  then  used 
to  calibrate  other  distance  indicators,  such  as  the 
Tully-Fisher  relation  and  type  la  supernovae.  The 
final  result  was  H  =  (72  d=  8)  kms-1  Mpc-1. 
The  largest  remaining  source  of  error  in  this  result 
is  the  distance  to  the  Large  Magellanic  Cloud, 
used  for  calibrating  the  cepheid  luminosity. 

Nowadays  the  most  accurate  value  for  the 
Hubble  constant  is  obtained  by  studying  the 
angular  power  spectrum  of  the  background 
radiation  (Sect.  20.7).  According  to  the  mea- 


Fig.  20.4  It  is  nowadays  assumed  that  at  the  beginning 
the  expansion  rate  of  the  Universe  was  slowing  down  un¬ 
til  about  5  billion  years  ago  the  deceleration  changed  to 
acceleration.  The  age  of  the  Universe  calculated  from  the 
current  value  of  the  Hubble  constant  is  bigger  than  the  ac¬ 
tual  age 

surements  of  the  Planck  satellite  H  =  (67.7  ± 
0.5)  km  s-1  Mpc-1  and  the  age  of  the  Universe  is 
13.80  ±0.02  Ga. 

The  form  of  Hubble’s  law  might  give  the  im¬ 
pression  that  the  Milky  Way  is  the  centre  of 
the  expansion,  in  apparent  contradiction  with  the 
Copernican  principle.  Figure  20.5  shows  that,  in 
fact,  the  same  Hubble’s  law  is  valid  at  each  point 
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Fig.  20.5  A  regular  expansion  according  with  Hubble’s 
law  does  not  mean  that  the  Milky  Way  (O)  is  the  centre  of 
the  Universe.  Observers  at  any  other  galaxy  ( O' )  will  see 
the  same  Hubble  flow  (« dashed  lines ) 

in  a  regularly  expanding  universe.  There  is  no 
particular  centre  of  expansion. 

The  Thermal  Microwave  Background  Radia¬ 
tion  The  most  important  cosmological  discov¬ 
ery  since  Hubble’s  law  was  made  in  1965.  In  that 
year  Arno  Penzias  and  Robert  Wilson  discovered 
that  there  is  a  universal  microwave  radiation,  with 
a  spectrum  corresponding  to  that  of  blackbody  ra¬ 
diation  (see  Sect.  5.6)  at  a  temperature  of  about 
3  K  (Fig.  20.7).  For  their  discovery,  they  received 
the  Nobel  prize  in  physics  in  1979. 

The  existence  of  a  thermal  cosmic  radiation 
background  had  been  predicted  in  the  late  1940’s 
by  George  Gamow ,  who  was  one  of  the  first 
to  study  the  initial  phases  of  expansion  of  the 
Universe.  According  to  Gamow,  the  Universe  at 
that  time  was  filled  with  extremely  hot  radia¬ 
tion.  As  it  expanded,  the  radiation  cooled,  until  at 
present,  its  temperature  would  be  a  few  kelvins. 
After  its  discovery  by  Penzias  and  Wilson,  the 
cosmic  background  radiation  has  been  studied  at 
wavelengths  from  50  cm  to  0.5  cm.  The  first  de¬ 
tailed  measurements,  made  from  the  COBE  (Cos¬ 
mic  Background  Explorer)  satellite  showed  that 
it  corresponds  closely  to  a  Planck  spectrum  at 
(2.725  db  0.002)  K  (Fig.  20.6).  More  recently  the 
CMB  has  been  mapped  in  even  greater  detail  by 
the  WMAP  satellite. 

The  existence  of  the  thermal  cosmic  mi¬ 
crowave  background  (CMB)  gives  strong  support 
to  the  belief  that  the  Universe  was  extremely  hot 
in  its  early  stages.  The  background  is  very  nearly 
isotropic,  which  supports  the  isotropic  and  ho¬ 
mogeneous  models  of  the  Universe.  The  COBE 


Fig.  20.6  Observations  of  the  cosmic  microwave  back¬ 
ground  radiation  made  by  the  COBE  satellite  in  1990  are 
in  agreement  with  a  blackbody  law  at  2.7  K 

and  WMAP  satellites  have  also  detected  temper¬ 
ature  variations  of  a  relative  amplitude  6  x  10“ 6 
in  the  background.  These  fluctuations  are  inter¬ 
preted  as  a  gravitational  redshift  of  the  back¬ 
ground  produced  by  the  mass  concentrations  that 
would  later  give  rise  to  the  observed  structures 
in  the  Universe.  They  are  the  direct  traces  of  ini¬ 
tial  irregularities  in  the  big  bang,  and  provide  im¬ 
portant  constraints  for  theories  of  galaxy  forma¬ 
tion.  Perhaps  even  more  importantly,  the  ampli¬ 
tude  of  the  fluctuations  on  different  angular  scales 
have  provided  crucial  constraints  on  the  cosmo¬ 
logical  model.  We  shall  return  to  this  question  in 
Sect.  20.7. 

The  Isotropy  of  Matter  and  Radiation  Apart 
from  the  CMB,  several  other  phenomena  con¬ 
firm  the  isotropy  of  the  Universe.  The  distribu¬ 
tion  of  radio  sources,  the  X-ray  background,  and 
faint  distant  galaxies,  as  well  as  Hubble’s  law  are 
all  isotropic.  The  observed  isotropy  is  also  evi¬ 
dence  that  the  Universe  is  homogeneous,  since 
a  large-scale  inhomogeneity  would  be  seen  as  an 
anisotropy. 

The  Age  of  the  Universe  Estimates  of  the  ages 
of  the  Earth,  the  Sun  and  of  star  clusters  are  im¬ 
portant  cosmological  observations  that  do  not  de¬ 
pend  on  specific  cosmological  models.  From  the 
decay  of  radioactive  isotopes,  the  age  of  the  Earth 
is  estimated  to  be  4600  million  years.  The  age  of 
the  Sun  is  thought  to  be  slightly  larger  than  this. 
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Fig.  20.7  The 

cosmological  principle.  In 
the  small  circle  (A)  about 
the  observer  (0)  the 
distribution  of  galaxies 
does  not  yet  represent  the 
large-scale  distribution.  In 
the  larger  circle  ( B )  the 
distribution  is  already 
uniform  on  the  average 


The  ages  of  the  oldest  star  clusters  in  the  Milky 
Way  are  10-15  Ga. 

The  values  thus  obtained  give  a  lower  limit  to 
the  age  of  the  Universe.  In  an  expanding  universe, 
the  inverse  Hubble  constant  gives  another  esti¬ 
mate  of  that  age.  It  is  most  remarkable  that  the 
directly  determined  ages  of  cosmic  objects  are  so 
close  to  the  age  given  by  the  Hubble  constant. 
This  is  strong  evidence  that  Hubble’s  law  really 
is  due  to  the  expansion  of  the  Universe.  It  also 
shows  that  the  oldest  star  clusters  formed  very 
early  in  the  history  of  the  Universe. 

The  Relative  Helium  Abundance  A  cosmo¬ 
logical  theory  should  also  give  an  acceptable  ac¬ 
count  of  the  origin  and  abundances  of  the  ele¬ 
ments.  Even  the  abundance  of  the  elementary  par¬ 
ticles  and  the  lack  of  antimatter  are  cosmological 
problems  that  have  begun  to  be  investigated  in  the 
context  of  theories  of  the  early  Universe. 

Observations  show  that  the  oldest  objects  con¬ 
tain  about  25  %  by  mass  of  helium,  the  most 
abundant  element  after  hydrogen.  The  amount  of 
helium  produced  is  sensitive  to  the  temperature  of 
the  Universe,  which  is  related  to  that  of  the  back¬ 
ground  radiation.  The  computations  made  for  the 
standard  models  of  the  expanding  Universe  (the 
Friedmann  models)  yield  a  helium  abundance  of 
exactly  the  right  size. 


20.2  The  Cosmological  Principle 

One  hopes  that  as  ever  larger  volumes  of  the  Uni¬ 
verse  are  observed,  its  average  properties  will  be¬ 
come  simple  and  well  defined.  Figure  20.7  at¬ 
tempts  to  show  this.  It  shows  a  distribution  of 
galaxies  on  a  plane.  As  the  circle  surrounding  the 
observer  O  becomes  larger,  the  mean  density  in¬ 
side  the  circle  becomes  practically  independent 
of  its  size.  The  same  behaviour  occurs,  regardless 
of  the  position  of  the  centre  of  O  :  at  close  dis¬ 
tances,  the  density  varies  randomly  (Fig.  20.8), 
but  in  a  large  enough  volume,  the  average  den¬ 
sity  is  constant.  This  is  an  example  of  the  cosmo¬ 
logical  principle:  apart  from  local  irregularities, 
the  Universe  looks  the  same  from  all  positions  in 
space. 

The  cosmological  principle  is  a  far-reaching 
assumption,  which  has  been  invoked  in  order  to 
put  constraints  on  the  large  variety  of  possible 
cosmological  theories.  If  in  addition  to  the  cos¬ 
mological  principle  one  also  assumes  that  the 
Universe  is  isotropic,  then  the  only  possible  cos¬ 
mic  flow  is  a  global  expansion.  In  that  case,  the 
local  velocity  difference  V  between  two  nearby 
points  has  to  be  directly  proportional  to  their  sep¬ 
aration  (V  =  Hr );  i.e.  Hubble’s  law  must  apply. 

The  plane  universe  of  Fig.  20.7  is  homoge¬ 
neous  and  isotropic,  apart  from  local  irregular- 
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Fig.  20.8  The  galaxies 
seem  to  be  distributed  in  a 
“foamlike”  way.  Dense 
strings  and  shells  are 
surrounded  by  relatively 
empty  regions.  (Seldner, 
M.  et  al.  (1977):  Astron.  J. 
82,  249) 


ities.  Isotropy  at  each  point  implies  homogene¬ 
ity,  but  homogeneity  does  not  require  isotropy. 
An  example  of  an  anisotropic,  homogeneous  uni¬ 
verse  would  be  a  model  containing  a  constant 
magnetic  field:  because  the  field  has  a  fixed  di¬ 
rection,  space  cannot  be  isotropic. 

We  have  already  seen  that  astronomical  obser¬ 
vations  support  the  homogeneity  and  isotropy  of 
our  observable  neighbourhood,  the  metagalaxy. 
On  the  grounds  of  the  cosmological  principle, 
these  properties  may  be  extended  to  the  whole  of 
the  Universe. 

The  cosmological  principle  is  closely  related 
to  the  Copernican  principle  that  our  position  in 
the  Universe  is  in  no  way  special.  From  this  prin¬ 
ciple,  it  is  only  a  short  step  to  assume  that  on 
a  large  enough  scale,  the  local  properties  of  the 
metagalaxy  are  the  same  as  the  global  properties 
of  the  Universe. 

Homogeneity  and  isotropy  are  important  sim¬ 
plifying  assumptions  when  trying  to  construct 
cosmological  models  which  can  be  compared 


with  local  observations.  They  may  therefore  rea¬ 
sonably  be  adopted,  at  least  as  a  preliminary  hy¬ 
pothesis. 


20.3  Homogeneous  and  Isotropic 
Universes 


Under  general  conditions,  space  and  time  coordi¬ 
nates  in  a  universe  may  be  chosen  so  that  the  val¬ 
ues  of  the  space  coordinates  of  observers  moving 
with  the  matter  are  constant.  It  can  be  shown  that 
in  a  homogeneous  and  isotropic  universe,  the  line 
element  (Appendix  B)  then  takes  the  form 


d s2  =  —c2  d t2  +  R2(t) 


x 


dr2 

1  —  kr 2 


+  r2  [dO2  +  cos2  0  d02)  , 

(20.4) 


known  as  the  Robertson-Walker  line  element. 
(The  radial  coordinate  r  is  defined  to  be  dimen¬ 
sionless.)  R(t)  is  a  time-dependent  quantity  rep¬ 
resenting  the  scale  of  the  Universe.  If  R  increases 
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Fig.  20.9  The  two-dimensional  analogues  of  the  Fried¬ 
mann  models:  a  spherical  surface,  a  plane  and  a  saddle 
surface 


with  time,  all  distances,  including  those  between 
galaxies,  will  grow.  The  coefficient  k  may  be 
+  1,  0  or  —1,  corresponding  to  the  three  possi¬ 
ble  geometries  of  space,  the  elliptic  or  closed ,  the 
parabolic  and  the  hyperbolic  or  open  model. 

The  space  described  by  these  models  need  not 
be  Euclidean,  but  can  have  positive  or  negative 
curvature.  Depending  on  the  curvature,  the  vol¬ 
ume  of  the  universe  may  be  finite  or  infinite.  In 
neither  case  does  it  have  a  visible  edge. 

The  two-dimensional  analogy  to  elliptical 
(k  =  +1)  geometry  is  the  surface  of  a  sphere 
(Fig.  20.9):  its  surface  area  is  finite,  but  has  no 
edge.  The  scale  factor  R(t )  represents  the  size  of 
the  sphere.  When  R  changes,  the  distances  be¬ 
tween  points  on  the  surface  change  in  the  same 
way.  Similarly,  a  three-dimensional  “spherical 
surface”,  or  the  space  of  elliptical  geometry,  has 
a  finite  volume,  but  no  edge.  Starting  off  in  an 
arbitrary  direction  and  going  on  for  long  enough, 
one  always  returns  to  the  initial  point  (unless  the 
distance  is  increased  too  much  by  the  expansion). 

When  k  =  0,  space  is  flat  or  Euclidean,  and  the 
expression  for  the  line  element  (20.4)  is  almost 
the  same  as  in  the  Minkowski  space.  The  only 
difference  is  the  scale  factor  R(t).  All  distances 
in  a  Euclidean  space  change  with  time.  The  two- 
dimensional  analogue  of  this  space  is  a  plane. 

The  volume  of  space  in  the  hyperbolic  geom¬ 
etry  (k  =  —  1)  is  also  infinite.  A  two-dimensional 
idea  of  the  geometry  in  this  case  is  given  by  a  sad¬ 
dle  surface  or  a  horn  becoming  infinitely  wider. 

In  a  homogeneous  and  isotropic  universe, 
many  physical  quantities  will  depend  on  time 
through  the  scale  factor  R(t).  For  example,  from 
the  form  of  the  line  element,  it  is  evident  that  all 
distances  will  be  proportional  to  R  (Fig.  20.10). 


Fig.  20.10  When  space  expands,  all  galaxy  separations 
grow  with  the  scale  factor  R:  r'  =  [R^) / R(to)]r 


Thus,  if  the  distance  to  a  galaxy  is  r  at  time  t,  then 
at  time  to  (in  cosmology,  the  subscript  0  refers  to 
the  present  value)  it  will  be 


R(to) 

R(f) 


(20.5) 


Similarly,  all  volumes  will  be  proportional  to  R3 . 
From  this  it  follows  that  the  density  of  any  con¬ 
served  quantity  (e.g.  mass)  will  behave  as  R~3 . 

It  can  be  shown  that  the  wavelength  of  ra¬ 
diation  in  an  expanding  universe  is  proportional 
to  R ,  like  all  other  lengths.  If  the  wavelength  at 
the  time  of  emission,  corresponding  to  the  scale 
factor  R ,  is  A,  then  it  will  be  Ao  when  the  scale 
factor  has  increased  to  Rq  : 


Ao  _  Ro 

T  “  -Rm 


(20.6) 


The  redshift  is  z  =  (Aq  —  A) /A,  and  hence 


1  +  z  = 


(20.7) 


i.e.  the  redshift  of  a  galaxy  expresses  how  much 
the  scale  factor  has  changed  since  the  light  was 
emitted.  For  example,  the  light  from  a  quasar 
with  z  =  1  was  emitted  at  a  time  when  all  dis¬ 
tances  were  half  their  present  values. 

For  small  values  of  the  redshift,  (20.7)  ap¬ 
proaches  the  usual  form  of  Hubble’s  law.  This  can 
be  seen  as  follows.  When  z  is  small,  the  change 
in  R  during  the  propagation  of  a  light  signal  will 
also  be  small  and  proportional  to  the  light  travel 
time  t.  Because  t  =  r/c  approximately,  where  r 
is  the  distance  of  the  source,  the  redshift  will  be 
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proportional  to  r .  If  the  constant  of  proportional¬ 
ity  is  denoted  by  H/c ,  one  has 

z  =  Hr/c.  (20.8) 

This  is  formally  identical  to  Hubble’s  law  (20.1). 
However,  the  redshift  is  now  interpreted  in  the 
sense  of  (20.7). 

As  the  universe  expands,  the  photons  in  the 
background  radiation  will  also  be  redshifted.  The 
energy  of  each  photon  is  inversely  proportional  to 
its  wavelength,  and  will  therefore  behave  as  R~l . 
It  can  be  shown  that  the  number  of  photons  will 
be  conserved,  and  thus  their  number  density  will 
behave  as  R~3 .  Combining  these  two  results,  one 
finds  that  the  energy  density  of  the  background 
radiation  is  proportional  to  R~4.  The  energy  den¬ 
sity  of  blackbody  radiation  is  proportional  to  T4, 
where  T  is  the  temperature.  Thus  the  temperature 
of  cosmic  background  radiation  will  vary  as  R~l . 


20.4  The  Friedmann  Models 

The  results  of  the  preceding  section  are  valid  in 
any  homogeneous  and  isotropic  universe.  In  order 
to  determine  the  precise  time-dependence  of  the 
scale  factor  R(t)  a  theory  of  gravity  is  required. 

In  1917  Albert  Einstein  presented  a  model 
of  the  Universe  based  on  his  general  theory  of 
relativity.  It  described  a  geometrically  symmet¬ 
ric  (spherical)  space  with  finite  volume  but  no 
boundary.  In  accordance  with  the  cosmologi¬ 
cal  principle,  the  model  was  homogeneous  and 
isotropic.  It  was  also  static :  the  volume  of  space 
did  not  change. 

In  order  to  obtain  a  static  model,  Einstein  had 
to  introduce  a  new  repulsive  force,  the  cosmo¬ 
logical  term ,  in  his  equations.  The  size  of  this 
cosmological  term  is  given  by  the  cosmological 
constant  A.  Einstein  presented  his  model  before 
the  redshifts  of  the  galaxies  were  known,  and  tak¬ 
ing  the  Universe  to  be  static  was  then  reasonable. 
When  the  expansion  of  the  Universe  was  discov¬ 
ered,  this  argument  in  favour  of  the  cosmological 
constant  vanished.  Einstein  himself  later  called  it 
the  biggest  blunder  of  his  life.  Nevertheless,  the 
most  recent  observations  now  seem  to  indicate 
that  a  non-zero  cosmological  constant  has  to  be 
present. 


The  St.  Petersburg  physicist  Alexander  Fried¬ 
mann  and  later,  independently,  the  Belgian  Geor¬ 
ges  Lemaitre  studied  the  cosmological  solutions 
of  Einstein’s  equations.  If  A  =  0,  only  evolving, 
expanding  or  contracting  models  of  the  Universe 
are  possible.  From  the  Friedmann  models  exact 
formulas  for  the  redshift  and  Hubble’s  law  may 
be  derived. 

The  general  relativistic  derivation  of  the  law  of 
expansion  for  the  Friedmann  models  will  not  be 
given  here.  It  is  interesting  that  the  existence  of 
three  types  of  models  and  their  law  of  expansion 
can  be  derived  from  purely  Newtonian  consider¬ 
ations,  with  results  in  complete  agreement  with 
the  relativistic  treatment.  The  detailed  derivation 
is  given  on  Box  20.1,  but  the  essential  character 
of  the  motion  can  be  obtained  from  a  simple  en¬ 
ergy  argument. 

Let  us  consider  a  small  expanding  spherical  re¬ 
gion  in  the  Universe.  In  a  spherical  distribution  of 
matter,  the  gravitational  force  on  a  given  spher¬ 
ical  shell  depends  only  on  the  mass  inside  that 
shell.  We  shall  here  assume  A  =  0. 

We  can  now  consider  the  motion  of  a  galaxy 
of  mass  m  at  the  edge  of  our  spherical  region. 
According  to  Hubble’s  law,  its  velocity  will  be 
V  =  Hr  and  the  corresponding  kinetic  energy, 

T  =  mV2/ 2.  (20.9) 

The  potential  energy  at  the  edge  of  a  sphere  of 
mass  M  is  U  =  —GMm/r.  Thus  the  total  energy 
is 


E  =  T  +  U  =mV2/2-  GMm/r,  (20.10) 


which  has  to  be  constant.  If  the  mean  density  of 
the  Universe  is  p,  the  mass  is  M  —  (47rr3 /3)p. 
The  value  of  p  corresponding  to  E  =  0  is  called 
the  critical  density ,  pc.  We  have 


1  99  G  Mm 

E=  -mH2r2 - 


-mH2r2  —  Gm 
2 


471  r3pc 


=  mr 


2 


(20.11) 
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whence 


Pc  = 


3  H2 
8  nG 


(20.12) 


The  expansion  of  the  Universe  can  be  com¬ 
pared  to  the  motion  of  a  mass  launched  vertically 
from  the  surface  of  a  celestial  body.  The  form  of 
the  orbit  depends  on  the  initial  energy.  In  order 
to  compute  the  complete  orbit,  the  mass  M  of 
the  main  body  and  the  initial  velocity  have  to  be 
known.  In  cosmology,  the  corresponding  param¬ 
eters  are  the  mean  density  and  the  Hubble  con¬ 
stant. 

The  E  =  0  model  corresponds  to  the  Eu¬ 
clidean  Friedmann  model,  the  Einstein-de  Sitter 
model.  If  the  density  exceeds  the  critical  density, 
the  expansion  of  any  spherical  region  will  turn 
to  a  contraction  and  it  will  collapse  to  a  point. 
This  corresponds  to  the  closed  Friedmann  model. 
Finally,  if  p  <  pc,  the  ever  expanding  hyperbolic 
model  is  obtained.  The  behaviour  of  the  scale  fac¬ 
tor  in  these  three  cases  is  shown  in  Fig.  20.1 1. 

These  three  models  of  the  universe  are  called 
the  standard  models.  They  are  the  simplest  rel¬ 
ativistic  cosmological  models  for  A  =  0.  Mod¬ 
els  with  A  ^  0  are  mathematically  more  compli¬ 
cated,  but  show  the  same  general  behaviour. 

The  simple  Newtonian  treatment  of  the  expan¬ 
sion  problem  is  possible  because  Newtonian  me¬ 
chanics  is  approximately  valid  in  small  regions 
of  the  Universe.  However,  although  the  resulting 
equations  are  formally  similar,  the  interpretation 


Fig.  20.11  The  time  dependence  of  the  scale  factor  for 
different  values  of  k.  The  cosmological  constant  A  =  0 


of  the  quantities  involved  (e.g.  the  parameter  k) 
is  not  the  same  as  in  the  relativistic  context.  The 
global  geometry  of  the  Friedmann  models  can 
only  be  understood  within  the  general  theory  of 
relativity. 

Consider  two  points  at  a  separation  r  at  the 
time  t .  Fet  their  relative  velocity  be  V .  If  the  dis¬ 
tance  at  to  is  rp  then 


and 


V  =  r  = 


m 

R(to) 


ro, 

(20.13) 


and  thus  the  Hubble  constant  is 


v  =  m 

r  R(t) 


(20.14) 


The  deceleration  of  the  expansion  is  described 
by  the  deceleration  parameter  q ,  defined  as 


q  =  -RR/R2.  (20.15) 


The  deceleration  parameter  describes  the  change 
of  the  rate  of  the  expansion  R.  The  additional  fac¬ 
tors  have  been  included  in  order  to  make  it  dimen¬ 
sionless,  i.e.  independent  of  the  choice  of  units  of 
length  and  time. 

The  value  of  the  deceleration  parameter  can 

be  expressed  in  terms  of  the  Hubble  constant  and 

density.  Substitute  the  definition  of  H  and  the  ex- 
•  • 

pression  of  R  (from  Box  20.1)  to  the  definition 
of  q .  We  get 


47rG  poRq 
3  R3H2' 


(20.16) 


The  density  is  usually  expressed  in  terms  of  the 
density  parameter  Q  defined  as  Q  =  p/pc ,  so 
that  Q  =  1  corresponds  to  the  Einstein-de  Sitter 
model.  Thus 


871 G  poRq 
3  R3H2' 


(20.17) 


Hence  there  is  a  simple  relation  between  ,£2 
and  q: 

P2  =  2q.  (20.18) 

The  value  q  =  1  /2  of  the  deceleration  parameter 
corresponds  to  the  critical  density  Q  =  1.  Both 
quantities  are  in  common  use  in  cosmology.  It 
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should  be  noted  that  the  density  and  the  decelera¬ 
tion  can  be  observed  independently.  The  validity 
of  (20.18)  is  thus  a  test  for  the  correctness  of  gen¬ 
eral  relativity  with  A  =  0. 


20.5  Cosmological  Tests 

A  central  cosmological  problem  is  the  question  of 
which  Friedmann  model  best  represents  the  real 
Universe.  Different  models  make  different  obser¬ 
vational  predictions.  Recently  there  has  been  con¬ 
siderable  progress  in  the  determination  of  the  cos¬ 
mological  parameters,  and  for  the  first  time  there 
is  now  a  set  of  parameters  that  appear  capable 
of  accounting  for  all  observations.  In  the  follow¬ 
ing,  some  possible  tests  will  be  considered.  These 
tests  are  related  to  the  average  properties  of  the 
Universe.  Further  cosmological  constraints  can 
be  obtained  from  the  observed  structures.  These 
will  be  discussed  in  Sect.  20.7. 

The  Critical  Density  If  the  average  density  p 
is  larger  than  the  critical  density  pc,  the  Uni¬ 
verse  is  closed.  For  the  Hubble  constant  H  = 
100  kms-1Mpc-1,  the  value  of  pc  =  1.9  x 
10-26  kgm-3,  corresponding  to  roughly  ten  hy¬ 
drogen  atoms  per  cubic  metre.  Mass  determina¬ 
tions  for  individual  galaxies  lead  to  smaller  val¬ 
ues  for  the  density,  favouring  an  open  model. 
However,  the  density  determined  in  this  way 
is  a  lower  limit,  since  there  may  be  significant 
amounts  of  invisible  mass  outside  the  main  parts 
of  the  galaxies. 

If  most  of  the  mass  of  clusters  of  galaxies 
is  dark  and  invisible,  it  will  increase  the  mean 
density  nearer  the  critical  value.  Using  the  virial 
masses  of  X-ray  clusters  of  galaxies  (Sect.  19.2), 
one  finds  £2$  =  0.3.  Considerations  of  the  ob¬ 
served  velocities  of  clusters  of  galaxies  indicate 
that  the  relative  amount  of  dark  matter  does  not 
increase  further  on  even  larger  scales. 

Neutrinos  have  a  small  mass  (about  10~4  elec¬ 
tron  mass).  A  large  neutrino  background  should 
have  been  produced  in  the  big  bang.  In  spite  of 
the  small  suggested  mass  of  the  neutrinos,  they 
can  still  form  a  part  of  the  mass  of  the  Universe. 
Nowadays  it  is  known  that  the  mass  of  the  neu¬ 
trino  is  too  small  to  explain  the  dark  matter  prob¬ 
lem.  Also,  they  move  at  relativistic  speeds  and  are 


too  “hot”  to  form  the  observed  structures.  Instead 
much  recent  work  in  cosmology  has  been  based 
on  the  hypothesis  of  Cold  Dark  Matter  (CDM), 
i.e.  the  idea  that  a  significant  part  of  the  mass  of 
the  Universe  is  in  the  form  of  non-relativistic  par¬ 
ticles  of  an  unknown  kind. 

It  is  understood  that  the  mean  density  of  the 
University  cannot  be  reliably  determined  from 
the  observations  of  the  current  constituents  of  the 
Universe.  Instead,  a  very  precise  value  for  the 
mean  density  is  found  by  studying  the  details  of 
the  cosmic  background  radiation,  since  that  gives 
a  single  picture  of  the  contents  of  the  whole  Uni¬ 
verse.  The  measurements  show  that  the  density  is 
exactly  the  critical  density,  which  means  that  the 
space  is  flat.  The  most  recent  result  of  Planck  ob¬ 
tained  in  2015  gives  a  curvature  of  0.001  db  0.004, 
meaning  that  the  deviation  from  zero  is  at  most 
0.5  %. 

The  Magnitude-Redshift  Test  Although  for 
small  redshifts,  all  models  predict  the  Hubble  re¬ 
lationship  m  =  5  lg  z  +  C  for  standard  candles,  for 
larger  redshifts  there  are  differences,  depending 
on  the  deceleration  parameter  q.  This  provides 
a  way  of  measuring  q . 

The  models  predict  that  galaxies  at  a  given 
redshift  look  brighter  in  the  closed  models  than 
in  the  open  ones  (see  Fig.  20.4).  Measurements 
of  type  la  supemovae  out  to  redshifts  z  =  1.7  us¬ 
ing  the  Hubble  Space  Telescope  have  now  shown 
that  the  observed  q  is  inconsistent  with  models 
having  A  =  0.  Assuming  £2$  =  0.3  these  obser¬ 
vations  require  Q a  =0.7,  where  Q a  is  defined 
below  in  Box  20.1. 

The  Angular  Diameter-Redshift  Test  Along 
with  the  magnitude-redshift  test,  the  relation  be¬ 
tween  angular  diameter  and  redshift  has  been 
used  as  a  cosmological  test.  Let  us  first  consider 
how  the  angular  diameter  0  of  a  standard  object 
varies  with  distance  in  static  models  with  differ¬ 
ent  geometries.  In  a  Euclidean  geometry,  the  an¬ 
gular  diameter  is  inversely  proportional  to  the  dis¬ 
tance.  In  an  elliptical  geometry,  6  decreases  more 
slowly  with  distance,  and  even  begins  to  increase 
beyond  a  certain  point.  The  reason  for  this  can  be 
understood  by  thinking  of  the  surface  of  a  sphere. 
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For  an  observer  at  the  pole,  the  angular  diameter 
is  the  angle  between  two  meridians  marking  the 
edges  of  his  standard  object.  This  angle  is  small¬ 
est  when  the  object  is  at  the  equator,  and  grows 
towards  the  opposite  pole.  In  a  hyperbolic  geom¬ 
etry,  the  angle  0  decreases  more  rapidly  with  dis¬ 
tance  than  in  the  Euclidean  case. 

In  an  expanding  closed  universe  the  angular 
diameter  should  begin  to  increase  at  a  redshift 
of  about  one.  This  effect  has  been  looked  for  in 
the  diameters  of  radio  galaxies  and  quasars.  No 
turnover  has  been  observed,  but  this  may  also  be 
due  to  evolution  of  the  radio  sources  or  to  the 
selection  of  the  observational  data.  At  smaller 
redshifts,  the  use  of  the  diameters  of  clusters  of 
galaxies  has  yielded  equally  inconclusive  results. 

Evolutionary  effects  are  the  most  important 
factors  for  the  uncertainty.  We  know  that  the 
sizes  of  large  scale  structures  are  changing  while 
the  Universe  is  evolving.  The  evolution  of  the 
structures  has  been  investigated  using  cosmo¬ 
logical  simulations  (e.g.  Millennium,  Illustris 
Fig.  20.14).  The  simulations  give  estimates  for 
the  object  sizes  at  different  redshift,  and  these  val¬ 
ues  can  be  compared  with  the  actual  observations. 
The  results  are  consistent  with  the  cosmological 
ACDM  concordance  model. 

Basically  the  same  idea  can  be  applied  to  the 
angular  scale  of  the  strongest  fluctuation  in  the 
cosmic  microwave  background.  The  linear  size 
of  these  depends  only  weakly  on  the  cosmolog¬ 
ical  model  and  can  therefore  be  treated  as  a  stan¬ 
dard  measuring  rod.  Their  redshift  is  determined 
by  the  decoupling  of  matter  and  radiation  (see 
Sect.  20.6).  Observations  of  their  angular  size 
have  provided  strong  evidence  that  £?o  +  Q a  =  1  * 
i.e.  the  Universe  is  flat. 

Primordial  Nucleosynthesis  The  standard  mo¬ 
del  predicts  that  25  %  of  the  mass  of  the  Universe 
turned  into  helium  in  the  “big  bang”.  This  amount 
is  not  sensitive  to  the  density  and  thus  does  not 
provide  a  strong  cosmological  test.  However,  the 
amount  of  deuterium  left  over  from  helium  syn¬ 
thesis  does  depend  strongly  on  the  density.  Al¬ 
most  all  deuterons  formed  in  the  big  bang  unite 
into  helium  nuclei.  For  a  larger  density  the  colli¬ 
sions  destroying  deuterium  were  more  frequent. 


Thus  a  small  present  deuterium  abundance  indi¬ 
cates  a  high  cosmological  density.  Similar  argu¬ 
ments  apply  to  the  amounts  of  3  He  and  7  Li  pro¬ 
duced  in  the  big  bang.  The  interpretation  of  the 
observed  abundances  is  difficult,  since  they  have 
been  changed  by  later  nuclear  processes.  Still, 
present  results  for  the  abundances  of  these  nuclei 
are  consistent  with  each  other  and  with  a  den¬ 
sity  corresponding  to  £2o  about  0.04.  Note  that 
this  number  only  refers  to  the  mass  in  the  form 
of  baryons,  i.e.  protons  and  neutrons.  Since  the 
virial  masses  of  clusters  of  galaxies  indicate  that 
£2o  is  about  0.3,  this  has  stimulated  models  such 
as  the  CDM  model,  where  most  of  the  mass  is  not 
in  the  form  of  normal  baryons. 

Ages  The  ages  of  different  Friedmann  models 
can  be  compared  with  known  ages  of  various 
systems.  The  age  to  of  a  Friedmann  model  with 
given  £?o  and  Q  a  is  obtained  by  integrating  equa¬ 
tion  (8)  of  Box  20.1.  This  gives 

to  =  Hq  f  d cl{^2o^  i  T  ^  T  1 

Jo 
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This  age  is  required  to  be  larger  than  the  ages  of 
the  oldest  known  astronomical  objects. 

If  the  density  is  critical  and  A  =  0,  to  No  = 
2/3.  Thus  if  Ho  =  75  kms-1  Mpc-1,  the  age  is 

=  9  Ga.  Larger  values  of  give  smaller  ages, 
whereas  positive  values  of  A  lead  to  larger  ages. 
It  has  been  a  source  of  embarrassment  that  the 
best  values  of  H  have  tended  to  give  an  age  for 
the  Universe  only  marginally  consistent  with  the 
ages  of  the  oldest  astronomical  objects.  With  the 
introduction  of  a  positive  cosmological  constant 
and  a  slight  downward  revision  of  stellar  ages  this 
problem  has  disappeared.  In  2003  the  WMAP  re¬ 
sults  gave  an  age  of  13.7  Ga.  A  more  recent  value 
by  the  Planck  satellite  is  13.80  ±  0.02  Ga. 

The  best  current  parameter  values  give  13- 
14  Ga  for  the  age  of  the  Universe. 

The  “Concordance”  Model  In  summary  there 
has  been  a  remarkable  recent  convergence  be¬ 
tween  different  cosmological  tests.  The  resulting 
model  has  a  positive  cosmological  constant,  and 
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most  of  the  matter  is  cold  and  dark.  It  is  thus  re¬ 
ferred  to  as  the  ACDM  model.  The  best  parame¬ 
ter  values  are  H  =  70  kms-1  Mpc-1,  &a  —  0.7, 

=  0.3,  with  cold  dark  matter  making  up  85  % 
of  the  total  density. 

The  concordance  model  is  by  no  means  defini¬ 
tive.  In  particular  the  reason  for  the  cosmological 
constant  is  a  major  puzzle.  In  order  to  allow  for 
the  possibility  that  A  is  variable  it  has  become 
customary  to  refer  to  it  as  dark  energy ,  saving  the 
term  cosmological  constant  for  the  case  of  a  con¬ 
stant  A.  Even  if  some  alternative  mechanism  can 
produce  the  same  effect  as  a  non-zero  A,  finding 
at  least  one  set  of  acceptable  parameters,  thanks 
to  the  WMAP  and  Planck  satellites,  is  an  impor¬ 
tant  step  forward. 

An  additional  set  of  constraints  on  the  cosmo¬ 
logical  model  comes  from  the  large-scale  struc¬ 
tures  of  the  Universe.  These  constraints,  which 
contribute  further  support  for  the  concordance 
model,  will  be  considered  in  Sect.  20.7. 


20.6  History  of  the  Universe 

We  have  seen  how  the  density  of  matter  and  of  ra¬ 
diation  energy  and  temperature  can  be  computed 
as  functions  of  the  scale  factor  R.  Since  the  scale 
factor  is  known  as  a  function  of  time,  it  is  possi¬ 
ble  to  calculate  how  these  quantities  change  with 
time. 

When  we  back  in  time  R  is  decreasing.  Since 
p  oc  R~3,  T  oc  R~{,  densities  and  temperatures 
at  the  beginning  were  so  immense  that  all  theo¬ 
ries  about  the  physical  processes  taking  place  are 
highly  conjectural.  Nevertheless  attempts  have 
been  made  at  understanding  the  most  fundamen¬ 
tal  properties  of  the  Universe  on  the  basis  of 
modern  theories  of  particle  physics.  For  example, 
no  indications  of  significant  amounts  of  antimat¬ 
ter  in  the  Universe  have  been  discovered.  Thus, 
for  some  reason,  the  number  of  matter  particles 
must  have  exceeded  that  of  antimatter  particles  by 
a  factor  of  1.000000001.  Because  of  this  symme¬ 
try  breaking ,  when  99.9999999  %  of  the  hadrons 
were  annihilated,  10-7  %  was  left  later  to  form 
galaxies  and  everything  else.  It  has  been  specu¬ 
lated  that  the  broken  symmetry  originated  in  par¬ 
ticle  processes  about  10-35  s  after  the  initial  time. 
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Fig.  20.12  The  energy  densities  of  matter  and  radiation 
decrease  as  the  Universe  expands.  Nucleon-antinucleon 
pairs  annihilate  at  10~4  s;  electron-positron  pairs  at  1  s 

The  breaking  of  fundamental  symmetries  in 
the  early  Universe  may  lead  to  what  is  known  as 
inflation  of  the  Universe.  In  consequence  of  sym¬ 
metry  breaking,  the  dominant  energy  density  may 
be  the  zero-point  energy  of  a  quantum  field.  This 
energy  density  will  lead  to  inflation,  a  strongly 
accelerated  expansion,  which  will  dilute  irregu¬ 
larities  and  drive  the  density  very  close  to  the 
critical  value.  One  may  thus  understand  how  the 
present  homogeneity,  isotropy  and  flatness  of  the 
Universe  have  come  about.  In  the  inflationary 
picture  the  Universe  has  to  be  very  nearly  flat, 
k2o  +  Q a  =  1  •  The  inflationary  models  also  make 
specific  predictions  for  the  form  of  the  irregulari¬ 
ties  in  the  CMB.  These  predictions  are  in  general 
agreement  with  what  has  been  observed. 

As  the  Universe  expanded,  the  density  and 
temperature  decreased  (Fig.  20.12)  and  condi¬ 
tions  became  such  that  known  physical  princi¬ 
ples  can  be  applied.  During  the  hot  early  stages, 
photons  and  massive  particles  were  continually 
changing  into  each  other:  high-energy  photons 
collided  to  produce  particle-antiparticle  pairs, 
which  then  were  annihilated  and  produced  pho¬ 
tons.  As  the  Universe  cooled,  the  photon  ener¬ 
gies  became  too  small  to  maintain  this  equilib¬ 
rium.  There  is  a  threshold  temperature  below 
which  particles  of  a  given  type  are  no  longer 
produced.  For  example,  the  threshold  tempera¬ 
ture  for  hadrons  (protons,  neutrons  and  mesons) 
is  T  =  1012  K,  reached  at  the  time  t  =  10-4  s. 
Thus  the  present  building  blocks  of  the  atomic 
nuclei,  protons  and  neutrons,  are  relics  from  the 
time  10_8-10-4  s,  known  as  the  hadron  era. 
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Even  before  the  hadron  era,  when  the  age  of 
the  Universe  was  about  10“ 12  s,  the  dark  mat¬ 
ter  particles  forming  about  5/6  of  all  matter  were 
born.  They  were  decoupled  from  radiation  around 
10~9  s  and  started  to  form  structures  to  which  the 
ordinary  matter  started  to  condense  hundred  mil¬ 
lion  years  later. 

The  Lepton  Era  In  the  time  period  10-4-l  s, 
the  lepton  era ,  the  photon  energies  were  large 
enough  to  produce  light  particles,  such  as  elec¬ 
tron-positron  pairs.  Because  of  matter-antimatter 
symmetry  breaking,  some  of  the  electrons  were 
left  over  to  produce  present  astronomical  bodies. 
During  the  lepton  era  neutrino  decoupling  took 
place.  Previously  the  neutrinos  had  been  kept 
in  equilibrium  with  other  particles  by  fast  par¬ 
ticle  reactions.  As  the  density  and  temperature 
decreased,  so  did  the  reaction  rates,  and  finally 
they  could  no  longer  keep  the  neutrinos  in  equi¬ 
librium.  The  neutrinos  decoupled  from  other  mat¬ 
ter  and  were  left  to  propagate  through  space  with¬ 
out  any  appreciable  interactions.  It  has  been  cal¬ 
culated  that  there  are  at  present  600  such  cosmo¬ 
logical  neutrinos  per  cubic  centimetre,  but  their 
negligible  interactions  make  them  extremely  dif¬ 
ficult  to  observe.  After  the  neutrino  decoupling 
the  amount  of  free  neutrons  kept  decreasing  as 
they  decayed  to  protons  and  electrons. 

The  Radiation  Era  After  the  end  of  the  lep¬ 
ton  era,  about  1  s  after  the  initial  time,  the  most 
important  form  of  energy  was  electromagnetic  ra¬ 
diation.  This  stage  is  called  the  radiation  era.  At 
its  beginning  the  temperature  was  about  1010  K 
and  at  its  end,  about  one  million  years  later,  when 
the  radiation  energy  density  had  dropped  to  that 
of  the  particles,  it  had  fallen  to  about  40,000  de¬ 
grees.  At  the  very  beginning  of  the  radiation  era 
within  a  few  hundred  seconds  helium  was  pro¬ 
duced. 

Just  before  the  epoch  of  helium  synthesis,  the 
number  ratio  of  free  protons  and  neutrons  was 
changing  because  of  the  decay  of  the  free  neu¬ 
trons.  After  about  100  s  the  temperature  had 
dropped  to  about  109  K,  which  is  low  enough 
for  deuterons  to  be  formed.  All  remaining  neu¬ 
trons  were  then  incorporated  in  deuterons;  these, 


in  turn,  were  almost  entirely  consumed  to  pro¬ 
duce  helium  nuclei.  Thus  the  amount  of  helium 
synthesised  was  determined  by  the  number  ratio 
of  protons  and  neutrons  at  the  time  of  deuterium 
production  t  =  100  s.  Calculations  show  that  this 
ratio  was  about  14  :  2.  Thus,  out  of  16  nucleons, 
2  protons  and  2  neutrons  were  incorporated  in 
a  helium  nucleus.  Consequently  4/16  =  25  %  of 
the  mass  turned  into  helium.  This  is  remarkably 
close  to  the  measured  primordial  helium  abun¬ 
dance. 

Only  the  isotopes  2H,  3  He,  4 He  and  7 Li  were 
produced  in  appreciable  numbers  by  nuclear  pro¬ 
cesses  in  the  big  bang.  The  heavier  elements  have 
formed  later  in  stellar  interiors,  in  supernova  ex¬ 
plosions  and  perhaps  in  energetic  events  in  galac¬ 
tic  nuclei. 

Radiation  Decoupling.  The  Matter  Era  As 

we  have  seen,  the  mass  density  of  radiation  (ob¬ 
tained  from  the  formula  E  =  me2)  behaves  as 
R~4,  whereas  that  of  ordinary  matter  behaves  as 
R~3.  Thus  the  radiation  mass  density  decreases 
more  rapidly.  At  the  end  of  the  radiation  era  it 
became  smaller  than  the  ordinary  mass  density. 
The  matter  era  began,  bringing  with  it  the  for¬ 
mation  of  galaxies,  stars,  planets  and  human  life. 
At  present,  the  mass  density  of  radiation  is  much 
smaller  than  that  of  matter.  Therefore  the  dynam¬ 
ics  of  the  Universe  is  completely  determined  by 
the  density  of  massive  particles. 

Soon  after  the  end  of  the  radiation  era,  radia¬ 
tion  decoupled  from  matter.  This  happened  when 
the  temperature  had  dropped  to  a  few  thousand 
degrees,  and  the  protons  and  electrons  combined 
to  form  hydrogen  atoms.  It  was  the  beginning  of 
the  “dark  ages”  at  redshifts  z  =  1000-100,  be¬ 
fore  stars  and  galaxies  had  formed,  when  the  Uni¬ 
verse  only  contained  dark  matter,  blackbody  radi¬ 
ation,  and  slowly  cooling  neutral  gas. 

At  present,  light  can  propagate  freely  through 
space.  The  world  is  transparent  to  radiation:  the 
light  from  distant  galaxies  is  weakened  only  by 
the  r~ 2  law  and  by  the  redshift.  Since  there  is 
no  certain  detection  of  absorption  by  neutral  gas, 
there  must  have  been  a  reionisation  of  the  Uni¬ 
verse.  It  is  thought  that  this  occurred  around  z  = 
5-10. 
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As  we  go  backward  in  time  from  the  present,  the 
distances  between  galaxies  and  clusters  of  galax¬ 
ies  become  smaller.  For  example,  the  typical  sep¬ 
aration  between  galaxies  is  100  times  their  di¬ 
ameter.  At  the  redshift  z  =  101  (R  (now)  /  R(t)  = 
100)  most  galaxies  must  have  been  practically  in 
contact.  For  this  reason  galaxies  in  their  present 
form  cannot  have  existed  much  earlier  than  about 
z—  100.  Since  the  stars  were  presumably  formed 
after  the  galaxies,  all  present  astronomical  sys¬ 
tems  must  have  formed  later  than  this. 

It  is  thought  that  all  observed  structures  in  the 
Universe  have  arisen  by  the  gravitational  collapse 
of  small  overdensities.  Whereas  the  presently  ob¬ 
servable  galaxies  have  undergone  considerable 
evolution,  which  makes  it  difficult  to  deduce  their 
initial  state,  on  larger  scales  the  density  variations 
should  still  be  small  and  easier  to  study.  These 
are  the  structures  considered  in  the  present  sec¬ 
tion.  The  later  evolution  of  galaxies  is  discussed 
in  Sect.  19.8,  and  the  formation  of  the  Milky  Way 
in  Sect.  18.5. 

The  Statistical  Description  of  Large-Scale 
Structure  The  departures  from  strict  homo¬ 
geneity  in  the  Universe  are  random  in  charac¬ 
ter,  and  must  therefore  be  described  using  statis¬ 
tical  methods.  Perhaps  the  most  straightforward 
way  of  doing  this  is  to  take  regions  of  a  given 
size,  specified  in  terms  of  their  mass,  and  give  the 
probability  distribution  for  relative  density  varia¬ 
tions  on  that  scale. 

A  second  method  is  to  consider  the  spatial 
separations  between  individual  objects  such  as 
galaxies  or  clusters.  The  distribution  of  these  sep¬ 
arations  is  used  to  define  the  correlation  function, 
which  is  a  measure  of  the  clustering  of  the  objects 
in  question. 

A  third  method  to  describe  large-scale  fluctu¬ 
ations  is  by  means  of  the  power  spectrum.  Here 
the  density  variations  (in  space  or  in  projection 
on  the  sky)  are  represented  as  sum  of  waves.  The 
power  spectrum  is  the  squared  amplitude  of  these 
waves  as  a  function  of  wavelength. 

All  three  methods  are  representations  of  the 
density  variations  in  the  Universe,  and  they  are 


Fig.  20.13  The  motion  of  the  Milky  Way  in  relation  to 
the  microwave  background  can  be  seen  in  the  measure¬ 
ments  of  WMAP.  One  side  of  the  sky  is  darker  (colder), 
and  the  other  side  is  lighter  (warmer).  The  horizontal 
stripe  is  the  densest  part  of  the  Milky  Way.  (Photo  NASA) 

theoretically  closely  related.  However,  in  practice 
they  are  observed  in  different  ways,  and  there¬ 
fore,  which  representation  is  most  suitable  de¬ 
pends  on  what  kind  of  observations  are  being 
analysed.  The  density  variations  are  usually  de¬ 
scribed  by  means  of  a  spectral  index  n  and  an 
amplitude  as  to  be  introduced  below. 

The  Growth  of  Perturbations  In  order  to  de¬ 
scribe  the  growth  of  structures  in  the  Universe, 
consider  a  given  region  containing  the  mass  M. 
If  its  density  is  slightly  larger  than  the  mean  den¬ 
sity,  its  expansion  will  be  slightly  slower  than  that 
of  the  rest  of  the  Universe,  and  its  relative  over¬ 
density  will  grow.  The  rate  of  growth  as  a  func¬ 
tion  of  mass  depends  on  the  relative  importance 
of  the  material  components  of  the  Universe,  dark 
matter,  radiation,  and  ordinary  baryonic  matter. 

It  is  assumed  that  there  is  an  initial  distribu¬ 
tion  of  perturbations  where  the  fluctuations  with 
mass  M  have  an  amplitude  that  is  proportional 
to  M-(^+3)/6.  The  spectral  index  n  is  a  cosmo¬ 
logical  parameter  to  be  determined  from  observa¬ 
tions. 

The  first  step  in  structure  formation  is  when 
a  given  mass  comes  within  the  horizon ,  i.e.  when 
there  has  been  enough  time  since  the  big  bang 
for  light  signals  to  cross  the  given  region.  During 
the  radiation  era  the  horizon  mass  grows  propor¬ 
tionally  to  t3//2,  and  thus  the  time  at  which  the 
mass  M  comes  within  the  horizon  will  be  propor¬ 
tional  to  M2/3 .  Any  perturbation  will  initially  be 
larger  than  the  horizon  mass,  and  while  this  is  the 
case  it  grows  in  proportion  to  t.  Once  a  matter 
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perturbation  comes  inside  the  horizon  its  ampli¬ 
tude  will  remain  constant.  This  constant  ampli¬ 
tude  will  behave  as  tM_^+3^6,  which  is  propor¬ 
tional  to  If  n  =  1,  the  perturbations 

enter  the  horizon  with  an  amplitude  that  is  inde¬ 
pendent  of  mass.  We  shall  see  that  the  observed 
value  of  n  is  in  fact  very  close  to  1 . 

Perturbations  of  both  dark  and  baryonic  mat¬ 
ter  density  will  behave  as  described  above  during 
the  radiation  era.  At  the  end  of  the  radiation  era  at 
time  tgQ,  when  the  mass  densities  of  radiation  and 
(non-relativistic)  matter  become  equal,  the  ampli¬ 
tude  of  the  perturbations  will  be  given  by  the  hori¬ 
zon  mass  M\iq  %  10 16  M©  for  M  <  Meq,  and 
will  be  proportional  to  a/-U+3)/6  for  M  Meq. 
After  equality  the  dark  matter  perturbations  will 
be  free  to  start  growing  again  as  t 2//3,  independent 
of  mass. 

Unlike  dark  matter,  ordinary  baryonic  matter 
perturbations  cannot  grow  as  long  as  the  Universe 
remains  ionised.  Instead  there  is  a  minimum  mass 
of  a  collapsing  gas  cloud  given  by  the  Jeans  mass 
My 

p3/2 

Ml  ~  q3/2 p2  ’  (20.20) 

where  p  and  P  are  the  density  and  pressure  in  the 
cloud  (see  Sect.  6.11).  The  value  of  Mj  before 
decoupling  was 

Mj  =  1018Mg  (20.21) 

and  after  decoupling 

Mj  =  105Mq.  (20.22) 

The  reason  for  the  large  difference  is  that  before 
decoupling,  matter  feels  the  large  radiation  pres¬ 
sure  (P  =  u/3 ,  see  Box  11.1).  After  decoupling, 
this  pressure  no  longer  affects  the  gas. 

The  large  Jeans  mass  before  decoupling  means 
that  overdense  regions  of  normal  gas  cannot  start 
growing  before  z  =  1000.  Rather  than  growing 
they  oscillate  like  sound  waves.  After  decoupling 
a  large  range  of  masses  become  Jeans  unsta¬ 
ble.  By  then  density  perturbations  of  dark  matter 
have  had  time  to  grow,  and  the  gas  will  therefore 
rapidly  fall  into  their  potential  wells.  The  first 
stars  will  start  forming  in  the  collapsing  regions, 
and  will  reionise  the  Universe. 


Fig.  20.14  Cosmological 
simulations  are  used  to 
study  the  evolution  of  the 
structures  of  the  Universe. 
The  international  Illustris 
simulation  shows  how  the 
network  of  dark  matter 
evolves  from  redshift  z  =  4 
{bottom  frame)  to  z  =  0 
{topmost frame).  (Illustris 
Collaboration,  www. 
illustris-project.org) 
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Because  the  expansion  of  the  Universe  works 
against  the  collapse,  the  density  of  Jeans  unstable 
regions  grows  rather  slowly.  In  order  to  produce 
the  observed  systems,  the  density  perturbations  at 
decoupling  cannot  be  too  small.  In  models  with¬ 
out  dark  matter  the  variations  in  the  CMB  pre¬ 
dicted  on  this  basis  tended  to  be  too  large.  In  the 
CDM  model  the  predicted  variations  are  of  the 
expected  amplitude. 

In  the  CDM  model  the  amplitude  of  fluc¬ 
tuations  on  scales  about  Meq  and  smaller  de¬ 
pends  only  weakly  on  mass.  This  is  why  the 
CDM  model  leads  to  an  hierarchical  description 
of  structure  formation.  In  this  picture,  systems  of 
all  masses  above  105  M©  begin  forming  after  de¬ 
coupling.  Because  smaller  systems  will  collapse 
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more  rapidly,  they  are  the  first  to  form,  at  redshifts 
about  20.  Once  the  first  sources  of  light,  starbursts 
or  AGNs,  had  formed,  they  could  reionise  the  gas. 
This  marked  the  ending  of  the  dark  ages  at  red- 
shifts  z  =  10-5. 

The  redshift  of  reionisation  is  still  not  well 
known,  and  is  therefore  treated  as  a  parameter  to 
be  determined  in  tests  based  on  large-scale  struc¬ 
tures.  It  is  usually  expressed  by  means  of  r,  the 
optical  depth  to  electron  scattering  of  the  back¬ 
ground  radiation.  A  larger  value  of  r  corresponds 
to  a  higher  electron  density,  implying  reionisation 
at  a  higher  redshift.  Since  galaxies  can  be  seen  at 
redshifts  larger  than  6,  the  corresponding  value  of 
x  =0.03  represents  a  minimum.  The  final  results 
(2012)  of  the  observations  with  WMAP  gave  r  = 
0.081  ±0.012,  which  would  have  corresponded 
to  a  redshift  10.1  ±  1.0.  The  final  results  (2015) 
of  Planck  the  gave  8.8  d=  0.1  as  the  mean  redshift 
of  reionisation. 

One  finally  has  to  ask  where  the  initial  pertur¬ 
bations  came  from.  An  attractive  feature  of  the 
inflationary  model  is  that  it  makes  specific  pre¬ 
dictions  for  these  initial  perturbations,  deriving 
them  from  quantum  effects  at  very  early  times. 
In  this  way  the  observed  properties  of  the  largest 
astronomical  systems  contain  information  about 
the  earliest  stages  of  our  Universe. 

Fluctuations  of  the  Cosmic  Microwave  Back¬ 
ground  One  important  way  of  studying  the 
large-scale  structure  of  the  early  Universe  is  by 
means  of  the  irregularities  of  the  cosmic  mi¬ 
crowave  background.  The  overdensities  that  were 
later  to  give  rise  to  observed  structures  should 
also  give  rise  to  temperature  variations  of  the 
CMB. 

The  temperature  variations  in  the  microwave 
background  have  been  mapped,  first  by  the  COBE 
satellite,  then  by  WMAP  (Fig.  20.13)  and  Planck. 
A  map  of  the  CMB  according  to  Planck  is  shown 
in  Fig.  20.15.  The  observed  variations  are  in  qual¬ 
itative  agreement  with  the  scenario  for  structure 
formation  described  above. 

A  more  quantitative  view  of  the  observations 
is  provided  by  the  power  spectrum  of  the  obser¬ 
vations  shown  in  Fig.  20.16.  This  shows  the  am¬ 
plitude  of  the  temperature  variations  as  a  function 
of  angular  scale  on  the  sky. 


Fig.  20.15  Temperature  distribution  of  the  background 
radiation  according  to  the  measurements  of  the  Planck 
satellite.  Dark  areas  are  colder,  light  areas  warmer  and 
more  dense.  (Planck  Collaboration) 

The  physical  processes  we  have  described  give 
rise  to  the  features  in  the  power  spectrum.  Thus 
the  first  peak  is  produced  by  perturbations  that 
have  just  had  time  to  collapse  to  a  maximum  den¬ 
sity  since  equality  of  matter  and  radiation,  be¬ 
fore  bouncing  back.  The  linear  size  of  this  peak 
does  not  depend  strongly  on  the  exact  parame¬ 
ters  of  the  model,  and  its  position  (angular  scale) 
can  therefore  be  used  as  a  standard  measuring  rod 
in  the  diameter — redshift  test.  The  second  and 
third  peaks  in  the  power  spectrum,  the  “acous¬ 
tic  peaks ”,  correspond  to  perturbations  that  have 
bounced  back  and  collapsed  again,  respectively. 
The  positions  and  amplitudes  of  these  features 
in  the  power  spectrum  depend  on  the  cosmolog¬ 
ical  parameters.  Thus  the  WMAP  observations 
of  the  CMB  can  be  fitted  using  a  model  con¬ 
taining  six  free  parameters:  the  Hubble  constant, 
the  densities  of  dark  matter  and  baryons,  the  op¬ 
tical  depth  r,  and  the  amplitude  as  and  shape 
n  of  the  initial  perturbations.  The  curvature  of 
the  Universe,  1  —  £2q  —  Qa  can  be  set  equal  to 
zero,  which  determines  the  value  of  Qa  •  The  re¬ 
sults  of  WMAP  and  Planck  have  given  values  for 
these  parameters  in  full  agreement  with  the  con¬ 
cordance  model. 

Very  Large  Scale  Structure  After  decoupling 
structures  on  different  scales  are  free  to  grow.  The 
important  dividing  line  is  whether  a  given  over¬ 
density  is  still  expanding  with  the  rest  of  the  Uni¬ 
verse  or  whether  it  has  had  time  to  recollapse. 
Systems  that  have  recollapsed  will  fairly  rapidly 
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Fig.  20.1 6  Angular  power 
spectrum  of  the 
temperature  variations  of 
the  cosmic  microwave 
background  from  the 
results  of  the  Planck 
satellite.  This  gives  the 
square  of  the  amplitude  of 
the  temperature  variations 
on  different  angular  scales. 
(Planck  Collaboration) 
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virialise ,  i.e.  settle  down  into  a  stationary  state. 
Such  systems  can  be  considered  real  astronomi¬ 
cal  systems. 

The  largest  astronomical  systems  are  clusters 
of  galaxies.  For  some  time  there  was  controversy 
about  whether  even  larger  structures,  superclus¬ 
ters,  existed.  The  controversy  was  settled  in  the 
1970’s  when  it  was  realised  that  this  largely  de¬ 
pended  on  what  was  meant  by  superclusters.  Re¬ 
gions  of  higher  density  existed  on  scales  larger 
than  clusters  of  galaxies  (a  few  Mpc),  all  the  way 
up  to  100  Mpc,  but  they  did  not  really  form  indi¬ 
vidual  structures  in  approximate  equilibrium  like 
the  clusters  of  galaxies. 

Between  these  two  alternatives  are  structures 
that  are  only  now  turning  round  and  beginning  to 
recollapse.  One  way  of  specifying  the  amplitude 
of  the  initial  fluctuations  is  to  give  the  scale  of  this 
transition  point.  This  scale  corresponds  roughly 
to  a  present  linear  size  of  8  Mpc,  and  for  this  rea¬ 
son  the  fluctuation  amplitude  is  commonly  given 
by  means  of  <78,  the  amplitude  of  the  fluctuations 
at  present  scale  8  Mpc.  Because  of  the  way  it  has 
been  chosen  <7g  should  be  close  to  1 . 

Smaller  systems  will  already  have  collapsed. 
There  are  many  ways  of  statistically  describing 
their  large-scale  distribution.  One  of  the  most 
studied  descriptors  of  clustering  is  the  corre¬ 
lation  function ,  §(r).  Consider  two  infinitesi¬ 
mal  volumes  dVi  and  dV2  separated  by  the  dis¬ 
tance  r.  If  there  were  no  clustering,  the  proba¬ 
bility  of  finding  galaxies  in  these  volumes  would 


be  N2  dV\dV2,  where  N  is  the  number  density 
of  galaxies.  Because  of  clustering  this  probability 
is  actually  N2 (l  +  §(r))  dVi  dV2-  The  correlation 
function  thus  measures  the  higher  probability  of 
finding  galaxies  near  each  other. 

Although  there  are  ways  of  estimating  the  cor¬ 
relation  function  from  the  distribution  on  the  sky, 
a  more  reliable  estimate  can  be  made  by  mapping 
the  distribution  of  galaxies  in  three  dimensions, 
using  redshifts  to  determine  their  distances.  The 
distribution  from  one  such  survey  (SDSS,  Sloan 
Digital  Sky  Survey)  is  shown  in  Fig.  20.17. 

As  expected,  the  general  form  of  the  correla¬ 
tion  function  is  a  power  law  (close  pairs  are  abun¬ 
dant,  distant  pairs  rarer).  It  has  been  known  since 
the  1970’s  that  the  correlation  function  is  approx¬ 
imately  proportional  to  ry .  The  constant  of  pro¬ 
portionality  is  related  to  the  amplitude  <78  and  the 
exponent  y  is  related  to  the  index  n  of  the  initial 
perturbations.  The  observed  value  of  y  is  about 
1.8,  depending  to  some  extent  on  the  sample  of 
objects  studied. 

At  large  separations  the  curve  shows  an  un¬ 
expected  maximum,  observed  for  the  first  time 
in  2005.  This  is  the  same  highest  acoustic  peak 
seen  in  the  angular  power  spectrum  of  Planck. 
The  observation  confirms  the  ACDM  cosmology 
and  shows  that  the  acoustic  waves  born  around 
z  =  1000  have  remained  till  the  present  time  and 
are  seen  in  the  distribution  of  galaxies.  The  same 
peak  has  been  seen  later  in  the  larger  SDSS  sur¬ 
vey. 
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Fig.  20.17  Correlation  function  for  the  distribution  of 
47,000  bright  red  SDSS  galaxies.  The  inset  shows  the 
found  “bump”  in  more  detail.  The  value  of  h  is  the  Hubble 
constant  divided  by  100,  or  about  0.7.  Thus  a  difference 
100/z —  1  corresponds  to  a  distance  of  about  150  Mpc  (about 
6  degrees  on  the  sky).  The  bump  tells  about  acoustic  os¬ 
cillations  of  baryons,  i.e.  large  scale  acoustic  waves  of  the 
visible  matter.  (Eisenstein  et  al.,  2005,  ApJ  633,  560) 

In  order  to  determine  the  cosmological  param¬ 
eters  the  (three-dimensional)  power  spectrum  of 
the  galaxy  distribution  is  usually  used.  It  is  then 
compared  to  theoretical  power  spectra  that  de¬ 
pend  on  the  parameters  in  order  to  find  the  best¬ 
fitting  model.  The  optical  depth  parameter  r  does 
not  affect  these  tests.  The  results  have  been  com¬ 
pletely  consistent  with  the  concordance  model. 

The  Concordance  Model  Again  We  have  dis¬ 
cussed  the  use  of  the  cosmic  microwave  back¬ 
ground  and  the  large-scale  distribution  of  galax¬ 
ies  in  order  to  determine  the  cosmological  param¬ 
eters.  Both  methods  independently  give  consis¬ 
tent  results.  Combining  them  produces  even  more 
(formally)  accurate  values.  Furthermore,  these 
values  are  in  agreement  with  the  ones  obtained 
from  the  traditional  cosmological  tests  discussed 
in  Sect.  20.5. 

There  are  eight  basic  parameters  describing 
the  concordance  model.  According  the  final  re¬ 
sults  of  Planck  (2015)  they  are: 

Hubble  constant  Ho  6.7  db  0.5  kms-1  Mpc-1, 

baryon  density  parameter  0.049  db  0.001, 


mass  density  parameter  0.259  ±  0.006, 
redshift  of  reionisation  zr i  8.8  ±  1.1, 
spectral  index  n  0.967  zb  0.004, 
fluctuation  amplitude  0.82  ±  0.01. 

Other,  less  comprehensive  tests,  such  as  the 
gravitational  lensing  by  cosmic  structures,  the 
velocities  induced  by  mass  concentrations,  and 
the  number  of  clusters  of  galaxies,  have  added 
support  to  the  concordance  model.  This  remark¬ 
able  agreement  between  many  independent  deter¬ 
minations  of  the  cosmological  parameters  have 
earned  the  model  its  name.  Once  the  parameters 
necessary  for  a  general  description  of  the  Uni¬ 
verse  are  reasonably  well  known,  other  compo¬ 
nents,  such  as  gravitational  waves  or  neutrinos, 
can  be  included  in  the  cosmological  model.  This 
new  development  has  been  called  the  era  of  pre¬ 
cision  cosmology.  However,  it  should  still  not  be 
forgotten  that  in  the  model  one  has  to  assume  the 
presence  of  both  dark  matter  and  dark  energy,  nei¬ 
ther  of  which  is  based  on  any  evidence  apart  from 
their  role  in  cosmology. 


20.8  The  Future  of  the  Universe 

The  standard  models  allow  two  alternative  pros¬ 
pects  for  the  future  development  of  the  Uni¬ 
verse.  Expansion  may  either  go  on  forever,  or  it 
may  reverse  to  a  contraction,  where  everything 
is  eventually  squeezed  back  to  a  point.  In  the 
final  squeeze,  the  early  history  of  the  Universe 
would  be  repeated  backwards:  in  turn,  galaxies, 
stars,  atoms  and  nucleons  would  be  broken  up. 
Finally  the  state  of  the  Universe  could  no  longer 
be  treated  by  present-day  physics.  It  is  not  known 
whether  the  collapse  would  continue  to  a  single 
point  or  would  the  Universe  avoid  the  singularity 
and  start  a  new  expansion. 

However,  according  to  the  current  observa¬ 
tions,  this  is  not  the  fate  of  the  Universe.  Instead, 
the  expansion  will  continue  forever,  even  at  an 
accelerating  pace. 

In  the  open  models  the  future  is  quite  differ¬ 
ent.  The  evolution  of  the  stars  may  lead  to  one  of 
four  end  results:  a  white  dwarf,  a  neutron  star  or 
a  black  hole  may  be  formed,  or  the  star  may  be 
completely  disrupted.  After  about  1011  years,  all 
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present  stars  will  have  used  up  their  nuclear  fuel 
and  reached  one  of  these  four  final  states. 

Some  of  the  stars  will  be  ejected  from  their 
galaxies;  others  will  form  a  dense  cluster  at  the 
centre.  In  about  1027  years  the  central  star  clus¬ 
ters  will  become  so  dense  that  a  black  hole  is 
formed.  Similarly  the  galaxies  in  large  clusters 
will  collide  and  form  very  massive  black  holes. 

Not  even  black  holes  last  forever.  By  a  quan¬ 
tum  mechanical  tunnelling  process,  mass  can 
cross  the  event  horizon  and  escape  to  infinity — 
the  black  hole  is  said  to  “evaporate”.  The  rate 
of  this  phenomenon,  known  as  the  Hawking  pro¬ 
cess ,  is  inversely  proportional  to  the  mass  of  the 
hole.  For  a  galactic-mass  black  hole,  the  evapo¬ 
ration  time  is  roughly  1098  years.  After  this  time, 
almost  all  black  holes  will  have  disappeared. 

The  ever  expanding  space  now  contains  black 
dwarfs,  neutron  stars  and  planet- size  bodies  (un¬ 
less  the  predictions  of  a  finite  proton  lifetime  of 
about  1031  years  are  confirmed;  in  that  case  all 
these  systems  will  have  been  destroyed  by  pro¬ 
ton  decay).  The  temperature  of  the  cosmic  back¬ 
ground  radiation  will  have  dropped  to  10-2°  K. 

Even  further  in  the  future,  other  quantum  phe¬ 
nomena  come  into  play.  By  a  tunnelling  pro¬ 
cess,  black  dwarfs  can  change  into  neutron  stars 
and  these,  in  turn,  into  black  holes.  In  this  way, 
all  stars  are  turned  into  black  holes,  which  will 
then  evaporate.  The  time  required  has  been  esti¬ 
mated  to  be  lO10"6  years!  At  the  end,  only  radi¬ 
ation  cooling  towards  absolute  zero  will  remain 
(Fig.  20.18). 


Fig.  20.18  According  to  current  results  the  future  of  the 
Universe  looks  like  this.  After  a  very,  very  long  time  all 
matter  will  have  changed  to  radiation.  The  wavelength  of 
the  radiation  will  increase  infinitely  in  an  ever  expanding 
and  cooling  space 


It  is  of  course  highly  doubtful  whether  our 
current  cosmological  theories  really  are  secure 
enough  to  allow  such  far-reaching  predictions. 
New  theories  and  observations  may  completely 
change  our  present  cosmological  ideas. 


Box  20.1  (Newtonian  Derivation  of  a  Differ¬ 
ential  Equation  for  the  Scale  Factor  R(t))  Let 
us  consider  a  galaxy  at  the  edge  of  a  massive 
sphere  (see  figure).  It  will  be  affected  by  a  cen¬ 
tral  force  due  to  gravity  and  the  cosmological 
force 
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In  these  equations,  the  radius  r  and  the  den¬ 
sity  p  are  changing  with  time.  They  may  be  ex¬ 
pressed  in  terms  of  the  scale  factor  R : 

r  =  (R/R0)r0,  (2) 

where  R  is  defined  to  be  Rq  when  the  ra¬ 
dius  r  =  rp. 


P  =  ( Ro/R)3Po ,  (3) 


where  the  density  p  =  po  when  R  =  Rq.  Intro¬ 
ducing  (2)  and  (3)  in  (1),  one  obtains 


••  a  1 

R  =  ~R 2  +  3AR’  (4) 

where  a  =  4ttGRqPo/3.  If  (4)  is  multiplied  on 
both  sides  by  R ,  the  left-hand  side  yields 


.  ..  1  d (R2) 

RR  =  — - 

2  d  t 
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and  thus  (4)  takes  the  form 


2  a 


2 

dR  +  -ARdR. 
3 


Let  us  define  Ro  =  R(to).  Integrating  (5)  from 
to  to  t  gives 


R2  -Rq=  2 a 


1 

~R 


+  \a(R2- 


The  constants  Ro  and  a  can  be  eliminated  in 
favour  of  the  Hubble  constant  Ho  and  the  den¬ 
sity  parameter  £2$.  Because  pc  =  3H^ /StvG, 

2  a  =  8tvGRqPo/3 

=  HqRqPq/ pc  =  H2R3C20,  (7) 


where  =  Pol  Pc-  Using  expression  (6) 
and  Ro  =  HoRo  in  (7),  and  defining  Qa  — 
A/(3Hq),  one  obtains 


R2 


Ti 2  p 2 


+  1  —  P2q  —  £2  a 


as  the  basic  differential  equation  govern¬ 
ing  R(t). 

For  simplicity  we  now  set  Q a  —  0-  Then  the 
time  behaviour  of  the  scale  factor  R  depends  on 
the  value  of  the  density  parameter  L?o.  Because 

•  r\ 

R  >  0  always,  according  to  (8) 

Ro 

£2o— —  +  1  L  0- 

K 


or 

Ro  ^  ^2o  —  1 
R  P2o 

If  >  1 ,  this  means  that 


R  <  Ro 


P2o 


£?o  —  1 


=  R 


max  ■ 


When  the  scale  factor  reaches  its  maximum 
value  7?max,  then  according  to  (8),  R  =  0,  and 
the  expansion  turns  into  contraction.  If  P2o  <  1 , 


the  right-hand  side  of  (8)  is  always  positive  and 
the  expansion  continues  forever. 

The  equation  for  the  time  dependence  of  the 
scale  factor  in  the  general  theory  of  relativity 
contains  the  constant  k  which  determines  the 
geometry  of  space: 

3  R 

Equations  (10)  and  (6)  (or  (8))  can  be  made 
identical  if  one  chooses 


H2Rl{n0-\)  =  kc2. 


Thus,  complete  agreement  between  the  New¬ 
tonian  and  the  relativistic  equation  for  R  is 
obtained.  The  values  of  the  geometrical  con¬ 
stant  k  =  + 1 ,  0,  —  1  correspond  respectively  to 
>  1 ,  =  1  and  <  1 .  More  generally,  the  con¬ 
dition  for  a  flat  model,  k  =  0,  corresponds  to 
P2o  +  k2  a  =  L 

When  k  =  0,  the  time  dependence  of  the  ex¬ 
pansion  is  very  simple.  Setting  P2o  =  1  and  us¬ 
ing  (10)  and  (7),  one  obtains 


R 


3 

0 


The  solution  of  this  equation  is 


(11) 


It  is  also  easy  to  calculate  the  time  from  the 
beginning  of  the  expansion:  R  =  Rq  at  the  time 


2  1 
U)  =  - 


3  Ho 


This  is  the  age  of  the  Universe  in  the  Einstein- 
de  Sitter  model. 


Box  20.2  (Three  Redshifts)  The  redshift  of 
a  distant  galaxy  is  the  result  of  three  different 
mechanisms  acting  together.  The  first  one  is  the 
peculiar  velocity  of  the  observer  with  respect 
to  the  mean  expansion:  the  Earth  moves  about 
the  Sun,  the  Sun  about  the  centre  of  the  Milky 
Way,  and  the  Milky  Way  and  the  Local  Group 
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of  galaxies  is  falling  towards  the  Virgo  Cluster. 
The  apparatus  measuring  the  light  from  a  dis¬ 
tant  galaxy  is  not  at  rest;  the  velocity  of  the 
instrument  gives  rise  to  a  Doppler  shift  that 
has  to  be  corrected  for.  Usually  the  velocities 
are  much  smaller  than  the  speed  of  light.  The 
Doppler  shift  is  then 

ZD  =  v/c.  (1) 

For  large  velocities  the  relativistic  formula  has 
to  be  used: 


zd  = 


c  +  v 


c  —  V 


The  redshift  appearing  in  Hubble’s  law  is  the 
cosmological  redshift  zc-  It  only  depends  on  the 
values  of  the  scale  factor  at  the  times  of  emis¬ 
sion  and  detection  of  the  radiation  ( R  and  Rf) 
according  to 


zc  =  Ro/R-l.  (3) 


The  third  type  of  redshift  is  the  gravitational 
redshift  zg.  According  to  general  relativity, 
light  will  be  redshifted  by  a  gravitational  field. 
For  example,  the  redshift  of  radiation  from  the 
surface  of  a  star  of  radius  R  and  mass  M  will 
be 


1 

Zg_  y/l-Rs/H 


where  Rs  =  2 GM/c2  is  the  Schwarzschild  ra¬ 
dius  of  the  star.  The  gravitational  redshift  of  the 
radiation  from  galaxies  is  normally  insignifi¬ 
cant. 

The  combined  effect  of  the  redshifts  can  be 
calculated  as  follows.  If  the  rest  wavelength  Ao 
is  redshifted  by  the  amounts  zi  and  Z2  by  two 
different  processes,  so  that 


z  = 


A2  —  Ao 
Aq 


A2  _  ^  _  A2  A^ 
Aq  Ai  Aq 


or 


(1  +  Z)  —  (1  +  zi)(l  +  Z2)« 


Similarly,  the  three  redshifts  zd,  £c  and  Zg  will 
combine  to  give  an  observed  redshift  z,  accord¬ 
ing  to 

1  +  z  =  (1  +  zd)(1  +  zc)(l  3“  Zg).  (5) 


20.9  Examples 

Example  20.1  (a)  In  a  forest  there  are  n  trees 
per  hectare,  evenly  spaced.  The  thickness  of  each 
trunk  is  D.  What  is  the  distance  of  the  wood  not 
seen  for  the  trees?  (Find  the  probability  that  the 
line  of  sight  will  hit  a  trunk  within  a  distance  x.) 
(b)  How  is  this  related  to  the  Olbers  paradox? 

(a)  Imagine  a  circle  with  radius  v  around  the  ob¬ 
server.  A  fraction  s(x),  0  <  s(x)  <  1,  is  cov¬ 
ered  by  trees.  Then  we’ll  move  a  distance  dx 
outward,  and  draw  another  circle.  There  are 
2Ttnxdx  trees  growing  in  the  annulus  lim¬ 
ited  by  these  two  circles.  They  hide  a  dis¬ 
tance  lixxnDdx  or  a  fraction  nD  dx  of  the 
perimeter  of  the  circle.  Since  a  fraction  s(x) 
was  already  hidden,  the  contribution  is  only 
(1  —  s(x))nD  dx.  We  get 

s(x  +  dx)  =  s(x)  +  (l  —  s(x))nD  dx, 
which  gives  a  differential  equation  for  s : 

dx 

This  is  a  separable  equation,  which  can  be  in¬ 
tegrated: 

fs  ds  fx 

I  - =  /  nD  dx. 

Jo  1  —  S  Jo 

This  yields  the  solution 

s(x)  =  l-z~nDx. 

This  is  the  probability  that  in  a  random  direc¬ 
tion  we  can  see  at  most  to  a  distance  v .  This 
function  s  is  a  cumulative  probability  distri¬ 
bution.  The  corresponding  probability  den¬ 
sity  is  its  derivative  ds/dx.  The  mean  free 
path  A  is  the  expectation  of  this  distribution: 
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For  example,  if  there  are  2000  trees  per 
hectare,  and  each  trunk  is  10  cm  thick,  we 
can  see  to  a  distance  of  50  m,  on  the  average, 
(b)  The  result  can  easily  be  generalised  into  three 
dimensions.  Assume  there  are  n  stars  per  unit 
volume,  and  each  has  a  diameter  D  and  sur¬ 
face  A  =  ttD2  perpendicular  to  the  line  of 
sight.  Then  we  have 


s(x)  =  l-e~nAx, 


We  substitute  hv/kT  =  v  and  dv  =  (kT /  h)  dx\ 


N  =  8n 


x2  dx 

ex  -  1‘ 


The  integral  cannot  be  expressed  in  terms  of  ele¬ 
mentary  functions  (however,  it  can  be  expressed 
as  an  infinite  sum  2^7^0(l/n3)),  but  it  can  be 
evaluated  numerically.  Its  value  is  2.4041.  Thus 
the  photon  density  at  2.7  K  is 


where 


X  =  1/nA. 

For  example,  if  there  were  one  sun  per  cubic 
parsec,  the  mean  free  path  would  be  1.6  x 
104  parsecs.  If  the  universe  were  infinitely 
old  and  infinite  in  size,  the  line  of  sight  would 
eventually  meet  a  stellar  surface  in  any  direc¬ 
tion,  although  we  could  see  very  far  indeed. 


Example  20.2  Find  the  photon  density  of  the 
2.7  K  background  radiation. 

The  intensity  of  the  radiation  is 

2hv 3  1 

D  _ _ 

v  c 2  ehv/(kT)  1 


and  the  energy  density 


uv 


An 


B 


c 


v 


8nhv3  1 

q3  Qhv/(kT)  _  j  * 


N 


=  \6tt  ( 


1.3  805  x  10-23  JK-1  x  2.7  K 


V 6.6256  x  10-34  Js  x  2.9979  x  108  ms"1 


x  1.20206 


3.99  x  108  m-3  % 


400  cm 


-3 


20.10  Exercises 

Exercise  20.1  The  apparent  diameter  of  the  gal¬ 
axy  NGC3159  is  1.3',  apparent  magnitude  14.4, 
and  radial  velocity  with  respect  to  the  Milky  Way 
6940  km  s_1 .  Find  the  distance,  diameter  and  ab¬ 
solute  magnitude  of  the  galaxy.  What  potential 
sources  of  error  can  you  think  of? 

Exercise  20.2  The  radial  velocity  of  NGC772 
is  2562  kms-1.  Compute  the  distance  obtained 
from  this  information  and  compare  the  result  with 
Exercise  18.1. 


The  number  of  photons  per  unit  volume  is  found 
by  dividing  the  energy  density  by  the  energy  of 
a  single  photon,  and  integrating  over  all  frequen¬ 
cies: 

C°°  uv  dv  87 r  r°°  2  dv 

N=k  nn  =  nk  v  _  1  • 


Exercise  20.3  If  the  neutrinos  have  nonzero 
mass,  the  universe  can  be  closed.  What  is  the  min¬ 
imum  mass  needed  for  this?  Assume  that  A  =  0, 
the  density  of  neutrinos  is  600  cm-3 ,  and  the  den¬ 
sity  of  other  matter  is  one  tenth  of  the  critical 
density. 
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Is  there  life  elsewhere  in  the  universe?  And  if 
there  is,  are  there  any  intelligent  beings?  How 
did  life  emerge  on  the  Earth,  and,  as  a  matter 
of  fact,  how  do  we  define  life  and  intelligence? 
These  are  probably  the  most  interesting  unsolved 
questions  in  science.  During  the  last  few  decades 
a  whole  new  field  of  astrobiology  has  evolved 
around  these  problems. 


21.1  What  Is  Life? 

There  have  been  several  science  fiction  tv- series 
showing  extraterrestrial  life  forms.  Usually  the 
sentient  beings  have  appeared  quite  humanlike, 
except  of  some  appendices  or  other  rather  trivial 
features  that  try  to  make  them  unattractive.  They 
may  even  have  sex  with  humans.  However,  if  the 
foreign  life  forms  have  evolved  independently,  in¬ 
terbreeding  would  be  totally  impossible. 

The  chemistry  of  foreign  beings  could  be  to¬ 
tally  different  from  ours.  Could  we  even  under¬ 
stand  that  they  are  living  beings?  In  fact,  what  is 
life?  It  seems  that  life  is  an  elusive  concept,  dif¬ 
ficult  to  define  in  terms  of  just  a  few  properties. 
We  have  only  one  example  of  life,  and  therefore 
it  is  difficult  to  make  general  conclusions  of  its 
properties.  However,  we  can  assume  that  certain 
properties  of  the  known  life  forms  may  be  gener¬ 
alised  also  to  foreign  life. 

Common  features  of  all  terrestrial  life  forms 
are  reproduction  and  evolution.  If  living  be¬ 
ings  produced  exact  replicas  of  themselves,  there 
would  be  no  evolution  and  no  adaptation  to 
changing  environments.  Thus  the  reproduction 


process  must  be  slightly  imperfect  leading  to 
a  variety  of  descendants.  This  will  give  material 
for  the  natural  selection,  ‘survival  of  the  fittest’. 

Natural  selection  is  a  fairly  general  principle, 
working  in  some  sense  also  outside  of  biology. 
Are  there  any  other  general  principles  related  to 
the  evolution  of  life?  If  we  could  find  even  one  ex¬ 
ample  of  life  that  evolved  independently  of  ours, 
this  would  vastly  improve  our  knowledge. 

Energy  consumption  is  also  characteristic  of 
life.  Life  requires  increasing  order,  i.e.  decreasing 
entropy.  Local  decrease  of  entropy  is  not  against 
thermodynamics:  it  only  means  that  a  living  be¬ 
ing  must  be  able  to  take  energy  in  some  form 
and  utilise  it  for  reproduction,  growth,  motion  or 
other  purposes. 

To  produce  similar  offsprings  a  living  being 
must  have  the  ability  to  store  information  and 
pass  it  to  its  descendants.  All  terrestrial  life  forms 
use  DNA  or  RNA  molecules  composed  of  nu¬ 
cleotides  for  storing  information  (see  next  sec¬ 
tion). 

Carbon  can  combine  to  form  very  complex 
molecules.  Silicon  can  also  form  large  molecules 
but  they  are  not  as  stable  as  carbon  compounds, 
and  silicon  cannot  form  rings  like  carbon.  Maybe 
some  simple  life  forms  could  be  based  on  silicon, 
or  on  something  quite  different  that  we  have  not 
even  thought  of. 

Also  a  liquid  solvent  is  needed.  Our  own  life 
would  not  be  possible  without  water.  It  remains 
liquid  in  a  much  wider  temperature  range  than 
most  other  substances,  which  makes  it  a  good  sol¬ 
vent.  Yet  in  the  astronomical  sense  the  tempera- 
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ture  range  is  rather  limited.  In  a  colder  environ¬ 
ment,  methane  or  ammonium  might  act  as  the  sol¬ 
vent. 

The  basic  building  block  of  all  terrestrial  life 
forms  is  the  cell.  It  has  a  membrane  surrounding 
liquid  cytoplasm.  The  cell  membrane  is  semiper- 
meable  and  functions  as  a  two-way  filter  that  lets 
certain  molecules  go  in  and  others  come  out;  this 
selective  transport  is  mediated  via  specific  pro¬ 
teinaceous  channels.  There  are  two  kinds  of  cells, 
simpler  prokaryotic  cells  and  more  complex  eu¬ 
karyotic  cells.  In  eukaryotic  cells  the  genetic  ma¬ 
terial,  in  the  form  of  DNA  molecules,  is  inside 
a  nucleus,  surrounded  by  a  nuclear  membrane.  In 
the  prokaryotic  cells  there  is  no  separate  nucleus, 
and  the  DNA  floats  coiled  in  the  cytoplasm. 

Terrestrial  life  is  divided  into  three  domains, 
Bacteria ,  Archaea  and  Eukarya.  Both  Bacteria 
and  Archaea  contain  usually  a  single  prokaryotic 
cell.  Eukarya  contains  all  more  complex  beings, 
like  animals  and  plants. 

According  to  this  scheme  viruses  are  not 
alive  although  they  have  certain  properties  com¬ 
mon  to  living  beings.  There  are  also  some  other 
molecules,  such  as  viroids  and  prions,  that  are  not 
classified  as  living;  yet  they  are  not  quite  inani¬ 
mate. 

If  even  the  definition  of  terrestrial  life  leads  to 
such  borderline  cases,  a  more  general  definition 
of  all  possible  kinds  of  life  is  truly  challenging. 
Facing  this  problem  we  have  to  restrict  our  dis¬ 
cussion  to  life  that,  at  least  to  some  extent,  re¬ 
sembles  our  own. 


21 .2  Chemistry  of  Life 

The  set  of  really  important  elements  is  relatively 
small;  it  includes  hydrogen  (H),  oxygen  (O),  ni¬ 
trogen  (N),  carbon  (C),  sulphur  (S)  and  phosphor 
(P).  The  heavier  elements  can  be  remembered  by 
the  mnemonic  SPONC. 

The  importance  of  carbon  is  in  its  ability  to 
make  lots  of  different  very  complex  molecules, 
which  are  essential  for  life.  There  are  three  ba¬ 
sic  types  of  molecules  that  function  as  common 
building  blocks  to  all  life:  lipids  for  membranes, 
nucleotides  and  amino  acids. 


Amino  acids  consist  of  three  different  compo¬ 
nents,  a  carboxyl  (COOH),  an  amine  part  (NH2) 
and  a  side  chain,  which  can  be  just  a  single  hydro¬ 
gen  or  a  more  complex  structure.  Altogether  there 
are  dozens  of  different  amino  acids,  but  only  20 
of  them  are  used  in  genetically  coded  proteins. 

Amino  acids  can  join  to  form  more  com¬ 
plex  molecules,  proteins.  Typically,  up  to  sev¬ 
eral  hundred  amino  acids  are  needed  for  a  pro¬ 
tein  molecule.  Proteins  have  numerous  functions: 
they  support  structures,  they  act  as  catalysts  in 
nearly  all  biological  reactions,  in  which  case  they 
are  called  enzymes ,  they  carry  messages  as  hor¬ 
mones,  and  so  on. 

Nucleotides  are  basic  building  blocks  of  the 
genetic  material,  DNA  and  RNA.  A  nucleotide 
has  three  components,  a  sugar,  a  phosphate  group 
and  a  base.  The  phosphate  parts  are  always  the 
same.  All  DNA  molecules  have  the  same  sugar 
part;  also  all  RNA  molecules  have  the  same  sugar 
part,  which,  however,  differs  from  the  sugar  of  the 
DNA  molecules  by  having  one  additional  oxygen 
atom.  The  base  can  be  one  of  five  different  types, 
adenine  (A),  guanine  (G),  cytosine  (C),  thymine 
(T),  or  uracil  (U).  DNA  molecules  contain  bases 
A,  G,  C  and  T,  and  RNA  molecules  A,  G,  C 
and  U. 

Nucleotides  join  to  form  long  chains.  The  de¬ 
oxyribonucleic  acid  or  DNA  (Fig.  21.1)  consists 
of  two  such  chains  that  are  bound  together  as  two 
intertwined  helices.  The  corresponding  bases  join 
together  by  hydrogen  bonds.  The  bases  come  al¬ 
ways  in  matching  pairs,  AT,  TA,  CG,  or  GC. 

A  helix  is  like  a  screw  that  can  be  either  left-  or 
righthanded.  All  terrestrial  DNA  molecules  have 
the  same  handedness  or  chirality :  they  are  all  of 
L-type,  or  lefthanded.  The  reason  for  the  asym¬ 
metry  is  not  quite  understood. 

The  DNA  molecule  contains  information  on 
how  to  make  proteins.  Three  consecutive  base 
pairs  form  a  code,  called  a  codon ,  that  speci¬ 
fies  one  amino  acid.  Usually  thousands  of  such 
triplets  together  contain  instructions  for  building 
a  protein. 

The  basic  units  of  heredity  are  called  genes. 
They  are  regions  of  DNA  whose  final  products 
are  either  proteins  or  RNA  molecules. 

Human  cells  contain  some  25,000  genes,  and 
the  DNA  consists  of  3  x  109  base  pairs.  In  plants 
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Fig.  21.1  (a)  The  DNA  molecule  consists  of  two  strands 
of  alternating  phosphate  groups  and  sugars  connected 
by  matching  base  pairs.  When  the  information  stored  in 
a  fragment  of  the  DNA  is  needed,  the  strands  separate  and 
the  bases  are  used  to  produce  an  RNA  molecule  that  will 
contain  the  same  information,  (b)  The  strands  coil  to  form 
a  double  helix.  (Webb:  Where  is  everybody?) 


the  numbers  can  be  much  higher,  but  the  simplest 
known  bacterial  genomes  have  only  a  few  hun¬ 
dred  genes.  It  has  been  estimated  that  the  mini¬ 
mum  number  of  genes  needed  for  a  living  being 
is  about  200-300. 

Actually,  only  a  part  of  the  DNA  contains  ge¬ 
netic  information.  The  rest  is  called  junk-DNA 
since  it  has  no  known  function.  The  fraction  of 


Fig.  21.1  (Continued) 


the  junk-DNA  is  highly  variable;  in  some  bacte¬ 
ria  the  amount  is  very  small. 

The  DNA  is  the  storage  of  the  genetic  code, 
but  the  code  is  not  functional  directly  from  the 
DNA.  Instead  a  complex  molecular  translation 
machinery  is  needed  to  execute  the  instructions. 

First,  the  instructions  encoded  in  the  genetic 
sequence  are  copied  (or  transcribed)  into  another 
type  of  nucleic  acid.  This  ribonucleic  acid  or 
RNA  resembles  the  DNA  but  has  one  more  oxy¬ 
gen  atom  in  its  sugar.  Different  RNA  molecules 
serve  several  roles  in  the  translation  process. 
Messenger  RNA  (mRNA)  carries  the  information 
in  the  DNA  to  an  organ  called  ribosome,  com¬ 
posed  of  several  RNA’s  and  multiple  proteins. 
There  another  RNA,  the  transfer  RNA  (tRNA), 
brings  amino  acids  into  the  reaction  site,  and 
still  another  RNA  molecule,  or  one  of  the  ribo- 
somal  RNA’s  (rRNA)  forms  the  linkages  (peptide 
bonds)  between  the  adjacent  amino  acids. 

RNA  molecules  have  to  carry  instructions  for 
making  just  one  protein,  and  thus  they  are  much 
shorter  than  DNA  molecules.  Still  they  contain 
similar  information,  and  in  some  simple  life 
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forms  like  viruses  they  can  act  as  the  storage  of 
the  genetic  code. 


21.3  Prerequisites  of  Life 

Just  after  the  big  bang  there  were  hardly  any  other 
elements  than  hydrogen  and  helium.  Heavier  ele¬ 
ments  are  needed,  both  for  life  itself  and  for  solid 
planets  on  which  life  can  evolve.  Thus  at  least 
some  of  the  earliest  stars  had  to  explode  as  su¬ 
pernovas  and  eject  their  fusion  products  to  the  in¬ 
terstellar  space. 

Most  stars  in  elliptic  galaxies  and  globular 
clusters  are  very  old  and  have  a  low  metallicity. 
Therefore  they  are  not  very  probable  locations  for 
life.  The  most  suitable  places  for  life  seem  to  be 
the  disk  populations  of  spiral  galaxies,  contain¬ 
ing  young  stars  with  high  abundances  of  heavier 
elements. 

Not  all  places  of  the  galactic  disk  are  equally 
profitable.  Far  out  from  the  galactic  centre  the 
star  birth  rate  and  consequently  the  metallicity  is 
low.  Close  to  the  centre,  metallicity  is  high  but 
the  environment  is  rather  hostile.  Star  density  is 
very  high,  and  therefore  radiation  is  intense,  and 
nearby  stars  disturb  planetary  orbits.  A  rough  es¬ 
timate  is  that  only  about  20  %  of  the  stars  of 
the  Milky  Way  or  a  similar  galaxy  lie  within  the 
galactic  habitable  zone. 

Assume  that  a  star  is  in  the  galactic  habitable 
zone.  The  star  has  its  own  habitable  zone  where 
habitable  planets  can  exist.  This  zone  is  usually 
defined  as  the  region  where  the  temperature  is  be¬ 
tween  the  freezing  and  boiling  points  of  water. 
For  a  fast-rotating  perfect  blackbody  (A  =  0  in 
Eq.  (7.51))  orbiting  the  Sun  this  would  mean  that 
its  distance  should  be  in  the  range  0.56-1.04  au. 
Planets,  however,  are  not  blackbodies,  and  the 
real  situation  is  more  complicated.  If  a  planet  has 
a  high  albedo,  it  reflects  away  most  of  the  inci¬ 
dent  radiation  and  its  temperature  will  be  much 
lower.  But  depending  on  the  chemical  composi¬ 
tion  of  the  atmosphere  and  possible  clouds  the 
greenhouse  effect  may  increase  the  temperature 
considerably.  Many  gases  are  transparent  to  visi¬ 
ble  light,  allowing  it  to  heat  the  surface.  Most  of 
the  energy  is  emitted  back  in  the  infrared  region, 


which  is  effectively  absorbed  by  these  green¬ 
house  gases,  such  as  water  vapour,  carbon  diox¬ 
ide  and  methane,  and  thus  remains  trapped  in  the 
atmosphere. 

If  the  star  is  cool,  the  habitable  zone  is  very 
narrow.  Hot  stars  have  wider  habitable  zones,  but 
their  main  sequence  phase  is  short,  giving  little 
time  for  life  to  evolve.  Thus  main  sequence  stars 
not  too  different  from  the  Sun  are  usually  con¬ 
sidered  the  best  candidates  for  having  habitable 
planets. 

During  the  main  sequence  phase  a  star  will  be¬ 
come  a  little  brighter,  which  will  push  the  hab¬ 
itable  zone  slightly  outwards.  Thus  the  region 
where  the  temperature  remains  suitable  for  a  long 
period  of  time  is  narrower  than  the  habitable  zone 
at  any  given  moment.  The  continuously  habitable 
zone  can  be  defined  as  the  region  that  remains 
habitable  for  a  time  that  is  comparable  to  the  main 
sequence  phase  of  the  star.  For  the  Sun  the  es¬ 
timates  of  the  width  of  this  region  vary  at  least 
from  0.06  au  up  to  0.2  au.  The  problem  is  how  to 
model  albedo  and  greenhouse  effects  over  a  very 
long  period  of  time. 

Binary  stars  are  very  common,  but  at  least  ear¬ 
lier  it  was  thought  that  they  could  not  have  habit¬ 
able  planets,  since  planetary  orbits  would  be  com¬ 
plicated  or  unstable.  However,  there  are  two  kinds 
of  possible  orbits  that  might  be  suitable.  If  the 
components  of  a  binary  are  far  away  from  each 
other,  each  component  could  have  a  planetary 
system  of  its  own.  Or,  if  it’s  a  close  binary,  there 
can  be  distant  planets  orbiting  the  whole  binary 
system. 


21.4  Hazards 

Even  if  life  could  emerge,  there  are  many  hazards 
that  may  wipe  it  out.  By  looking  at  the  Moon  we 
can  see  that  meteor  bombardment  was  very  in¬ 
tense  in  the  young  solar  system.  Collision  of  a  big 
asteroid  or  comet  could  be  fatal,  the  immediate 
devastation  caused  by  the  explosion  being  only 
one  of  the  consequences.  The  collision  would 
eject  a  lot  of  dust  to  the  atmosphere  cooling  the 
climate  for  several  years.  The  mass  extinction  65 
million  years  ago  seems  to  have  been  caused  by 
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Fig.  21.2  In  the  young  solar  system  small  objects  col¬ 
lided  continuously  to  larger  planets,  threatening  seriously 
the  evolution  of  life.  Later  collisions  and  perturbations  by 
planets  have  cleaned  away  most  of  the  potentially  danger¬ 
ous  objects.  Still  collisions  take  place  even  nowadays.  In 
1994  comet  Shoemaker-Levy  collided  to  Jupiter  after  dis¬ 
integrating  into  several  parts.  Traces  of  the  collisions  of 
the  fragments  were  seen  as  dark  spots  in  the  atmosphere  of 
Jupiter.  In  June  20,  1908,  an  explosion  happened  in  Tun- 
guska,  Siberia,  hewing  down  trees  in  a  couple  of  thousand 
square  kilometres.  The  explosion  was  possibly  caused  by 
a  comet  with  a  diameter  of  about  hundred  metres 

such  an  event.  As  the  comet  Shoemaker-Levy  hit¬ 
ting  Jupiter  (Fig.  21.2)  in  1994  showed,  such  col¬ 
lisions  are  still  possible.  Fortunately  they  are  not 
very  frequent  any  more. 

Almost  all  of  the  currently  known  exoplanets 
are  Jupiter-like  giants.  They  seem  to  be  neces¬ 


sary  for  habitable  planets,  because  their  pertur¬ 
bations  clean  the  young  solar  system  from  debris 
by  ejecting  it  outside  the  planetary  system.  How¬ 
ever,  many  of  the  known  giant  planets  move  on 
highly  eccentric  orbits,  and  they  may  disturb  also 
the  orbits  of  earthlike  planets.  Thus  it  is  further 
required  that  the  giant  planets  should  be  on  nearly 
circular  orbits  and  not  too  close  to  the  star. 

Also  smaller  planets  have  participated  in  clear¬ 
ing  the  regions  around  their  orbits,  which  is  re¬ 
flected  in  the  new  definition  of  a  planet  (Sect.  7.1). 

Seasons  depend  on  the  obliquity  of  the  rota¬ 
tion  axis  and  the  eccentricity  of  the  orbit.  High 
values  will  lead  to  strong  seasonal  temperature 
variations.  In  the  case  of  the  Earth  the  Moon 
seems  to  have  a  stabilising  effect;  without  the 
Moon  the  tilt  of  the  axis  would  have  varied  much 
more,  possibly  causing  more  severe  ice  ages  fatal 
to  life.  Hence  also  a  relatively  big  moon  seems  to 
be  in  the  shopping  list  of  a  habitable  planet.  How¬ 
ever,  recently  there  have  been  some  objections  to 
this  requirement. 

We  have  only  recently  started  to  understand 
the  rather  delicate  balance  and  complex  feedback 
effects  working  in  the  atmosphere.  Currently  the 
climate  is  warming  due  to  the  increasing  green¬ 
house  effect,  but  the  Earth  has  experienced  also 
quite  opposite  phases.  If  the  albedo  increases,  the 
amount  of  energy  reaching  the  surface  decreases, 
glaciers  and  snow  cover  expand,  and  the  amount 
of  clouds  increases  till  most  of  the  atmospheric 
humidity  is  solidified  as  snow  and  ice.  All  this 
will  increase  the  albedo  further,  speeding  up  this 
icehouse  effect.  There  is  geological  evidence  of 
global  glaciation  periods  750-580  million  years 
ago  and  possibly  also  2.3  billion  years  ago.  The 
Snowball  Earth  hypothesis  assumes  that  the  cli¬ 
mate  cooled  down  for  millions  of  years  and  the 
whole  surface  was  covered  by  a  layer  of  ice  at 
least  one  kilometer  thick.  During  the  long  cold 
period  most  of  the  living  organisms,  all  of  which 
at  that  time  lived  in  water,  became  extinct.  Vol¬ 
canic  activity  was  still  going  on,  adding  more  car¬ 
bon  dioxide  to  the  atmosphere.  Finally  the  result¬ 
ing  greenhouse  effect  started  to  warm  the  climate. 

There  are  many  factors  that  seem  to  be  crucial 
for  life.  Some  of  them  may  not  look  very  impor¬ 
tant,  but  might  still  have  made  it  impossible  for 
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life  to  emerge.  However,  in  many  cases  we  don’t 
know  how  important  they  really  are  or  if  they  are 
equally  crucial  for  foreign  life  forms. 


21.5  Origin  of  Life 

One  way  to  try  to  understand  the  origin  of  the  ter¬ 
restrial  life  is  to  start  with  the  available  atoms  and 
molecules  and  see  if  they  could  produce  life.  Dur¬ 
ing  the  last  decades  there  has  been  considerable 
progress,  but  the  process  is  very  complicated  and 
not  yet  well  understood.  Here  we  can  only  outline 
briefly  how  it  might  have  happened. 

In  a  famous  experiment  in  1953  Harold  Urey 
and  Stanley  Miller  sent  energy  in  the  form  of 
electric  sparks  through  a  gas  mixture  supposed 
to  be  similar  to  the  early  atmosphere  of  the 
Earth,  containing  methane,  ammonia,  hydrogen 
and  water  vapour.  After  a  few  days  the  solution 
contained  several  organic  compounds,  including 
some  amino  acids.  At  that  time  it  was  assumed 
that  the  early  atmosphere  was  reducing.  More  re¬ 
cent  studies  suggest  that  this  is  not  quite  true,  and 
the  earliest  atmosphere  was  rather  neutral,  con¬ 
taining  mostly  CO2,  CO,  N2,  H2O  and  maybe 
some  H2.  Such  an  atmosphere  would  have  pro¬ 
duced  organic  compounds  much  slower,  if  at  all. 

Some  amino  acids  have  been  found  in  me¬ 
teorites.  Thus  they  seem  to  have  been  already 
present  in  the  nebula  from  which  the  planetary 
system  condensed.  Complex  organic  molecules 
have  been  found  also  in  interstellar  molecule 
clouds  (Sect.  15.3).  There  have  even  been  claims 
of  detecting  the  simplest  amino  acid,  glycine,  but 
the  results  are  controversial. 

The  next  step,  putting  the  basic  blocks  to¬ 
gether  to  form  DNA  or  RNA  molecules,  is  much 
more  difficult.  This  looks  like  the  chicken  and 
egg  paradox:  the  information  contained  in  the 
DNA  is  needed  to  make  proteins,  and  proteins 
are  needed  to  catalyse  the  production  of  the  nu¬ 
cleotides,  which  are  the  building  blocks  of  the  nu¬ 
cleic  acids.  So  which  came  first? 

In  the  1980’s  Sidney  Altman  and  Thomas 
Cech  found  that  some  RNA  molecules  can  act  as 
catalysts.  Since  RNA  resembles  DNA,  it  can  store 
genetic  material  to  some  extent.  Thus  there  is  no 


need  for  the  DNA  and  proteins.  Even  RNA  frag¬ 
ments  cannot  be  synthesised  easily,  but  as  they 
act  as  enzymes  and  can  replicate,  it  is  assumed 
that  the  initial  chemical  evolution  first  led  to  short 
and  relatively  simple  RNA  molecules.  Eventually 
some  of  then  combined  to  more  complex  ones, 
some  of  which  were  better  adapted  to  the  envi¬ 
ronment  either  by  replicating  faster  or  by  being 
more  durable.  Thus  the  natural  selection  started 
to  produce  more  complex  molecules;  this  chem¬ 
ical  evolution  was  working  already  before  actual 
life  emerged. 

The  first  cell-like  structures  could  evolve 
from  asymmetric  molecules  or  lipids,  one  end 
of  which  attracts  water  and  the  other  end  re¬ 
pels  water.  In  water  such  molecules  tend  to  form 
bi-layered  membranes  where  the  hydrophilic  or 
water-attracting  end  points  outwards  and  hy¬ 
drophobic  or  water-repelling  end  inwards.  Fur¬ 
ther  on,  such  membranes  form  spontaneously 
spherical  vesicles.  If  RNA  happened  to  get  in¬ 
side  such  a  membrane,  it  may  have  been  protected 
from  the  environment,  and  could  have  been  con¬ 
tained  within  its  own  chemical  environment.  In 
some  cases  this  could  have  improved  its  replica¬ 
tion,  and  thus  led  to  further  increase  its  concen¬ 
tration  within  the  vesicle. 

A  rather  common  assumption  is  that  the  first 
primitive  life  forms  were  RNA  life.  However,  re¬ 
cently  this  theory  has  been  challenged.  RNA  has 
some  drawbacks.  It  is  not  as  stable  as  DNA,  and 
its  replication  is  not  as  accurate  as  the  protein  me¬ 
diated  replication  of  DNA.  If  the  life  was  initially 
based  on  RNA,  evolution  led  finally  to  the  appear¬ 
ance  of  DNA  molecules.  Since  DNA  is  superior 
to  RNA  due  to  its  stability,  it  soon  took  over  the 
role  of  information  carrier. 

Currently  the  energy  of  sunlight  is  utilised 
by  plants  and  some  bacteria  in  photosynthe¬ 
sis,  which  produces  carbohydrates  from  water 
and  carbon  dioxide.  There  are  also  organisms 
that  do  not  need  sunlight  but  can  use  chemi¬ 
cal  energy  to  produce  organic  matter  in  a  pro¬ 
cess  called  chemosynthesis.  Such  organisms  have 
been  found  e.g.  near  hydrothermal  vents  on  mid¬ 
ocean  ridges  (Fig.  21.3).  These  vents  eject  hot 
mineral-rich  water  to  the  ocean.  Even  though 
the  temperature  can  be  as  high  as  400  °C,  the 
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Fig.  21.3  Black  smokers  in  the  Mid- Atlantic  Ridge  are 
hydrothermal  vents  that  sprout  hot  mineral-rich  water. 
(Photo:  P.  Rona,  Credit:  OAR/National  Undersea  Re¬ 
search  Program  (NURP);  NOAA) 

high  pressure  prevents  the  water  from  boiling. 
Although  this  is  too  hot  for  life,  there  are  re¬ 
gions  around  the  vents  where  the  temperature  is 
suitable  for  such  thermophiles.  They  could  have 
been  the  first  life  forms,  in  which  case  fife  did  not 
emerge  in  a  Darwinian  warm  pond  but  in  a  hot 
pressure  kettle.  However,  this  is  also  a  matter  of 
debate. 

This  kind  of  bottom-up  approach  tries  to  build 
life  from  the  simple  constituents  already  avail¬ 
able  in  the  interstellar  space.  Another  approach, 
the  top-down  method,  tries  to  trace  life  back  in 
time  as  far  as  possible. 

The  oldest  sediment  rocks  on  the  Earth,  found 
in  Isua  in  western  Greenland,  are  3.8  Ga  old. 
Since  they  contain  sediments,  deposited  by  water, 
and  pillow  lavas,  formed  in  water,  the  tempera¬ 
ture  at  that  time  could  not  have  a  value  very  dif¬ 
ferent  from  the  current  one.  The  solar  luminosity 
was  then  lower  than  nowadays,  but  the  difference 


was  compensated  by  a  higher  amount  of  decaying 
radioactive  materials  and  remanent  heat  of  the  re¬ 
cently  bom  Earth. 

Oldest  signs  of  life  are  almost  as  old.  These 
signs  are,  however,  just  isotope  ratios  that  can  be 
interpreted  as  results  of  bacterial  life.  The  carbon 
isotope  12  C  is  about  100  times  as  abundant  as  the 
heavier  isotope  13 C.  The  lighter  isotope  is  some¬ 
what  more  reactive  and  tends  to  be  enriched  in 
living  organisms.  In  the  Isua  rocks  there  are  sed¬ 
iments  with  a  small  excess  of  12 C,  which  might 
indicate  some  kind  of  life. 

In  the  Warrawoona  Group  in  Australia  there 
are  3.5  Ga  old  formations  that  look  like  stro¬ 
matolites,  mounds  consisting  of  layers  of  mi¬ 
crobial  cells  and  calcium  carbonate.  If  they  are 
real  stromatolites,  they  may  have  been  formed  by 
cyanobacteria,  but  this  is  still  a  matter  of  debate. 

In  the  early  times,  at  least  for  a  billion  years, 
photosynthesis  was  non-oxygenic.  Cyanobacte¬ 
ria  were  possibly  the  first  organisms  capable  of 
oxygenic  photosynthesis.  They  started  to  produce 
oxygen,  but  initially  it  was  dissolved  in  water  and 
consumed  in  different  oxidation  reactions.  Even¬ 
tually  also  the  amount  of  atmospheric  oxygen 
started  to  rise,  and  2.2  Ga  ago  it  reached  10  % 
of  the  current  value,  i.e.  about  2  %  of  the  total 
abundance  in  the  atmosphere. 

First  eukaryotes  appeared  in  the  fossil  record 
2.1  Ga  ago  and  multicellular  organisms  1.5  Ga 
ago.  The  fossil  evidence  becomes  much  clearer 
towards  the  end  of  the  Proterozoic  era.  The  Edi- 
acara  fauna,  which  is  about  600  million  years  old, 
contains  the  oldest  fossils  of  big  and  complex  an¬ 
imals.  These  were  softbodied  animals.  At  the  end 
of  the  Cambrian  period  543  million  years  ago 
traces  of  the  Ediacara  fauna  disappear  and  are  re¬ 
placed  by  a  huge  variety  of  new  animals,  many 
with  protecting  shields.  This  increase  in  the  vari¬ 
ety  of  life  forms  is  called  the  Cambrian  explosion. 

All  life  forms  use  similar  genetic  codes,  which 
indicates  that  they  have  the  same  origin.  This 
forefather  of  all  life  is  called  LUCA,  the  Last  Uni¬ 
versal  Common  Ancestor. 

Relationships  of  living  beings  can  be  studied 
by  comparing  their  DNA  or  RNA.  The  more  the 
molecules  of  two  species  differ,  the  more  distant 
the  species  are  in  the  evolutionary  sense.  These 
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Fig.  21.4  A  simplified 
phylogenetic  tree. 

A  branch  is  the  older  the 
closer  it  is  to  the  last 
common  ancestor,  LUCA. 
(Adapted  from  Webb: 
Where  is  everybody?) 


distances  can  be  plotted  as  a  map,  called  the  phy¬ 
logenetic  tree  (Fig.  21.4). 

The  phylogenetic  tree,  as  we  now  know  it,  has 
three  branches,  the  domains  of  Archaea ,  Bacte¬ 
ria  and  Eukarya.  The  organisms  closest  to  the 
root  are  thermophiles  that  live  close  to  hydrother¬ 
mal  vents  or  in  hot  water.  Obviously,  the  LUCA 
lived  in  such  a  hot  environment.  However,  RNA 
molecules  do  not  remain  intact  in  such  hot  en¬ 
vironments.  If  the  earliest  life  was  RNA  life,  it 
would  have  evolved  in  a  cooler  environment.  Cur¬ 
rently  we  do  not  know  the  real  birthplace  of  life. 

Although  the  phylogenetic  tree  points  to  a 
common  origin,  there  may  have  been  other  starts, 
too,  but  natural  selection  has  eliminated  the  other 
ones  that  were  less  competitive. 


21 .6  Are  We  Martians? 

Mars  and  the  Earth  have  very  similar  rotation  pe¬ 
riods  and  axial  tilts.  Since  the  notorious  ‘discov¬ 
ery’  of  canals,  Mars  and  its  inhabitants  have  been 


a  popular  subject  in  science  fiction  stories.  Later 
observations  revealed  a  very  thin  atmosphere  and 
low  temperature,  which  make  Mars  a  rather  hos¬ 
tile  place.  Finally,  the  Viking  landers  showed 
a  marred  planet.  Yet  the  possibility  of  some  sim¬ 
ple  life  forms  cannot  be  excluded. 

In  1984  a  meteorite  was  found  in  the  Allan 
Hills  region  in  Antarctica  and  labelled  as  ALH 
84001  (Fig.  21.5).  The  piece  of  rock  was  esti¬ 
mated  to  be  3.9  Ga  old.  Chemistry  of  the  mete¬ 
orite  shows  that  it  had  originated  on  Mars;  an  im¬ 
pact  had  thrown  it  to  an  orbit  that  brought  it  to  the 
Earth. 

In  1996  a  group  of  NASA  scientists  an¬ 
nounced  that  the  meteorite  contained  structures 
resembling  fossilised  microbes  and  compounds 
that  could  be  products  of  living  organisms,  such 
as  polycyclic  aromatic  hydrocarbons  (or  PAH) 
and  magnetite.  However,  they  can  be  produced  by 
other  processes,  too.  Not  surprisingly,  the  results 
and  the  implications  of  Martian  life  caused  a  lot 
of  scepticism.  Only  further  Mars  expeditions  and 
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Fig.  21.5  The  meteorite 
ALH  84001  found  in 
Antarctica  has  a  chemical 
composition  that  indicates 
its  Martian  origin.  The 
meteorite  contains 
substances  that  can  be 
produced  by  living 
organisms.  The  wormlike 
structures  resemble 
bacteria.  It  is,  however, 
reasonable  to  ask  whether 
one  single  sample  can 
prove  that  once  there  was 
life  on  Mars.  (NASA) 


possible  in  situ  experiments  can  decide  whether 
there  has  been  life  on  Mars. 

In  case  there  has  really  been  life  on  Mars,  there 
are  several  possibilities: 

•  Life  originated  independently  on  the  Earth  and 
Mars. 

•  Life  originated  only  on  the  Earth  and  was  then 
transported  to  Mars. 

•  Life  originated  only  on  Mars  and  was  trans¬ 
ported  to  the  Earth. 

It  seems  that  life  on  the  Earth  emerged  almost 
as  soon  as  the  conditions  became  favourable.  It 
has  been  argued  that  the  life  appeared  even  too 


quickly.  This  problem  would  be  solved  if  life 
originated  on  Mars.  The  surface  of  the  more  dis¬ 
tant  and  smaller  Mars  had  cooled  down  faster 
to  become  habitable  before  the  Earth.  Thus  life 
would  have  had  more  time  to  evolve  on  Mars, 
and  was  transferred  to  the  Earth  when  conditions 
here  became  suitable.  Thus  our  earliest  ancestors 
could  be  Martian  bacteria.  Presently  such  consid¬ 
erations  are,  however,  just  speculations. 

The  idea  of  life  spreading  from  one  celestial 
body  to  another  is  known  as  panspermia.  The 
idea  dates  back  to  the  antiquity,  but  its  first  seri¬ 
ous  advocate  was  the  Swedish  chemist  Svante  Ar- 
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rhenius,  who  published  a  book  on  the  subject  in 
1908.  Of  the  later  proponents,  Sir  Fred  Hoyle  was 
the  most  famous.  Panspermia  fit  well  to  his  cos¬ 
mology:  the  universe  had  no  beginning,  neither 
did  life,  but  had  always  existed.  Thus  the  tough 
problem  of  the  origin  of  life  was  neatly  avoided. 

Now  panspermia,  in  a  certain  more  limited 
sense,  begins  to  seem  a  little  more  possible  the¬ 
ory.  Primitive  life  forms  can  survive  inside  me¬ 
teorites  in  the  coldness  and  lethal  radiation  of 
the  interplanetary  space  long  enough  to  travel 
from  one  planet  to  another.  Interstellar  distances, 
though,  are  too  long,  and  the  probability  of  a  me¬ 
teoroid  from  one  planetary  system  hitting  another 
system  is  too  low.  It  seems  obvious  that  our  life 
has  originated  here  in  our  own  solar  system. 


21 .7  Life  in  the  Solar  System 

Once  there  may  have  been  life  on  Mars.  Although 
probes  have  not  detected  signs  of  life,  it  is  not 
impossible  that  there  might  still  be  some  mi¬ 
croscopic  life,  but  we  cannot  expect  to  find  any 
macroscopic  life  forms.  The  same  is  true  for  other 
places  in  our  solar  system. 

Mercury  has  no  atmosphere,  Venus  is  too  hot, 
and  the  giant  planets  have  no  solid  surface.  Be¬ 
sides  the  Earth  and  Mars  this  leaves  only  some 
satellites  as  possible  habitats.  It  has  also  been 
speculated  that  there  might  be  living  things  float¬ 
ing  in  the  atmospheres  of  the  giant  planets,  but 
the  emergence  and  evolution  of  such  things  seems 
rather  improbable. 

Europa  as  well  as  some  other  icy  satellites 
are  nowadays  considered  potential  places  for  life. 
Their  surfaces  are  too  cold,  but  tidal  heating 
keeps  the  interiors  warm  enough.  Big  satellites 
rotate  synchronously,  but  if  the  orbit  is  not  per¬ 
fectly  circular,  the  orbital  velocity  varies  accord¬ 
ing  to  Kepler’s  second  law.  Thus  the  satellite  li- 
brates  just  like  our  Moon,  which  means  that  the 
direction  of  the  tidal  distortion  keeps  changing. 
Also  the  distance  to  the  planet  varies,  and  there¬ 
fore  the  magnitude  of  the  tidal  force  varies,  too. 
These  effects  deform  the  satellite  continuously 
giving  rise  to  tidal  heating. 

Europa’s  surface  is  covered  by  ice.  In  some 
places  the  ice  cover  is  broken  into  plates  that  ob¬ 


viously  have  moved  relative  to  each  other.  The  ro¬ 
tation  period  of  the  surface  differs  from  the  rota¬ 
tion  period  of  the  magnetic  field,  which  is  frozen 
to  the  interior.  Observations  seem  to  indicate  that 
the  ice  cover  floats  on  an  ocean. 

The  illumination  under  the  ice  is  too  dim  for 
photosynthesis,  but  there  might  be  thermal  vents 
as  in  the  oceans  of  the  Earth.  Hence  the  ocean 
could  be  a  habitat  for  microbes  that  can  utilise 
the  thermal  energy. 

Titan  is  the  only  satellite  with  a  thick  atmo¬ 
sphere.  The  atmosphere  is  also  rich  in  organic 
compounds,  like  methane.  Methane  dissociates 
rapidly,  and  thus  the  high  methane  content  means 
that  there  must  be  a  source  of  new  methane.  Liv¬ 
ing  organisms  are  one  such  source,  but  because 
of  its  coldness  Titan  does  not  look  like  a  promis¬ 
ing  place  for  life.  A  more  plausible  theory  was 
that  there  was  a  methane  ocean  on  Titan  but  the 
Huygens  probe  revealed  a  rather  dry  landscape, 
which,  however,  has  signs  of  liquid  flows.  Radar 
images  sent  by  the  Cassini  probe  show  some  dark 
areas  that  might  be  methane  lakes  (Fig.  8.25). 


21.8  Detecting  Life 

If  we  find  a  potentially  habitable  planet,  is  there 
any  hope  that  we  can  see  if  life  has  emerged  on 
it?  The  question  can  be  answered  by  studying 
whether  we  can  find  life  on  the  Earth  from  satel¬ 
lite  observations.  In  1990  the  Galileo  probe  made 
just  such  experiments,  and  it  seems  that  it  is  in¬ 
deed  possible  to  detect  life,  at  least  the  kind  of 
life  we  have  on  the  Earth.  Similar  observations  of 
the  Moon  showed  no  traces  of  life. 

The  detection  is  based  on  spectroscopic  obser¬ 
vations  that  can  reveal  some  signatures  of  life. 
These  signatures  are  in  the  infrared  part  of  the 
spectrum,  thus  requiring  observations  made  out¬ 
side  the  atmosphere. 

There  are  two  emission  features  that  are  strong 
indicators  of  life,  ozone  and  methane.  Photosyn¬ 
thesis  is  the  most  probable  source  of  molecular 
oxygen,  which  is  then  broken  into  two  oxygen 
atoms  by  ultraviolet  radiation.  The  free  oxygen 
atoms  join  to  molecules  to  form  ozone.  Methane 
is  also  produced  by  living  beings.  It  is  quickly  ox¬ 
idised,  and  has  to  be  continuously  replenished  to 
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keep  the  level  noticeable.  However,  there  can  be 
large  reservoirs  of  methane,  particularly  in  cold 
environments.  Thus  methane  itself  is  not  a  sign 
of  life,  but  if  it  is  found  together  with  ozone,  the 
evidence  becomes  more  convincing. 

Another  feature  is  the  infrared  reflectance 
spectrum  of  the  green  plants.  Chlorophyll  absorbs 
visible  light,  particularly  blue  and  red,  but  there 
is  a  distinct  cutoff  called  the  red-edge,  seen  as 
a  steep  gradient  of  the  spectrum  between  690  and 
740  nm.  Longer  wavelengths  are  very  effectively 
reflected  to  avoid  excessive  heating. 


21 .9  SETI — Detecting  Intelligent  Life 

Mankind  has  been  sending  radio  transmissions 
for  almost  a  century.  Our  radio  signals  are  now 
filling  a  sphere  with  a  radius  of  almost  one 
hundred  lightyears.  Another  civilisation  orbiting 
a  nearby  star  might  be  able  to  pick  up  this  trans¬ 
mission  with  a  big  radio  telescope.  Such  leakage 
radiation  is,  however,  very  weak.  Sensitivity  of 
our  own  radio  receivers  has  increased  enormously 
since  they  were  invented.  Thus  it  has  been  possi¬ 
ble  to  reduce  the  power  of  the  transmitters,  and 
the  signals  leaking  to  space  have  become  weaker. 
Also,  more  and  more  signals  are  sent  in  cables 
and  optical  fibres.  If  another  civilisation  has  un¬ 
dergone  similar  development,  detecting  leakage 
signals  is  extremely  difficult.  Chances  are  much 
better  if  the  signal  has  been  sent  intentionally  to¬ 
wards  potential  receivers  in  the  hope  that  some¬ 
body  will  detect  it. 

One  might  think  that  galaxies  and  star  clusters 
are  worth  listening,  since  there  are  many  stars  in 
the  narrow  beam  of  the  telescope.  Unfortunately, 
it  is  not  quite  so.  Other  galaxies  are  so  far  away 
that  the  signal  could  be  too  weak  to  be  detected. 
Globular  clusters  consist  of  very  old  stars  with 
low  metallicity.  Thus  the  probability  of  finding 
a  habitable  planet  is  tiny  indeed.  Open  clusters 
are  relatively  young,  and  life  may  not  have  had 
time  to  evolve  to  a  communicating  civilisation. 
Thus  galaxies  and  star  clusters  are  not  the  best 
places  to  search  for  signs  of  life. 

What  frequency  should  we  use?  If  the  sending 
and  receiving  party  have  developed  radio  astron¬ 
omy,  they  must  be  aware  of  certain  common  fre¬ 
quencies,  like  the  hydrogen  21  cm  radiation.  Such 


a  wavelength  itself  may  not  be  a  good  choice  be¬ 
cause  of  the  background  noise,  but  some  of  its 
multiples  or  a  sum  of  two  common  frequencies 
might  fall  in  the  quiet  part  of  the  radio  spectrum. 
A  good  frequency  could  be  the  H2O  maser  emis¬ 
sion  at  22  GHz.  Around  this  frequency  the  sky 
is  pretty  quiet  except  for  a  few  sources.  But  it 
is  not  enough  to  listen  to  those  frequencies  only, 
since  they  are  Doppler  shifted  due  to  the  relative 
motion  of  the  transmitter  and  receiver.  And  if  the 
transmitter  and/or  receiver  are  on  planets  orbiting 
a  star,  the  Doppler  shift  will  change  periodically. 
Fortunately,  current  receivers  are  capable  of  fol¬ 
lowing  millions  of  frequencies  simultaneously. 

Radio  emission  from  natural  sources  can  be 
steady  noise  or  vary  in  a  periodic,  quasiperiodic 
or  chaotic  manner.  If  we  want  to  send  a  signal  to 
be  recognised  as  artificial,  it  should  contain  a  pat¬ 
tern  that  cannot  arise  naturally.  It  could  e.g.  con¬ 
tain  an  increasing  number  of  pulses  representing 
the  first  few  prime  numbers. 

Although  most  SETI  research  concentrates  on 
radio  frequencies,  also  optical  wavelengths  have 
recently  been  considered  seriously.  Pulsed  lasers 
pack  a  lot  of  energy  to  a  short  pulse  (lasting  typi¬ 
cally  one  nanosecond)  confined  to  a  very  narrow 
wavelength  band  and  a  narrow  beam.  The  flash 
can  be  even  brighter  than  the  central  star.  If  such 
a  signal  were  pointed  towards  us,  it  should  be  rel¬ 
atively  easy  to  detect.  Such  optical  SETI  research, 
or  OSETI,  has  already  been  started,  but  it  is  still 
behind  the  radioastronomical  SETI. 

In  1974  Frank  Drake  used  the  Arecibo  radio 
telescope  to  send  a  message  towards  the  globular 
cluster  Ml 3  (which  is  not  a  good  place  for  life). 
The  message  contained  1679  pulses  (Fig.  21.6). 
This  number  has  exactly  two  factors,  23  and  73. 
Thus  the  receiver,  who  obviously  must  under¬ 
stand  some  mathematics,  could  guess  that  the 
message  contains  a  two-dimensional  picture.  If 
we  ever  detect  such  a  message,  we  can  be  pretty 
certain  of  its  artificial  origin,  even  if  we  were  not 
able  to  interpret  the  message. 

The  first  serious  SETI  project  (Search  for  Ex- 
traTerrestrial  Civilisations)  was  carried  out  in 
1960,  also  by  Frank  Drake.  This  project  Ozma 
observed  two  nearby  stars,  r  Cet  and  e  Eri,  at  the 
21  cm  wavelength.  Since  then  radio  technology 
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Fig.  21.6  The  signal  sent  with  the  Arecibo  radio  tele¬ 
scope  consists  of  1679  pulses.  The  number  1679  has  only 
two  factors,  23  and  73,  so  obviously  the  signal  represents 
a  twodimensional  image  of  23  x  73  pixels.  A  lot  of  in¬ 
formation  about  the  sender  is  coded  in  the  image,  but  it 
is  difficult  to  say  whether  the  receiving  civilisation  could 
interpret  it.  In  any  case,  the  artificial  origin  of  the  signal  is 
indisputable 

has  improved  enormously,  and  currently  we  are 
able  to  listen  to  a  huge  number  of  channels  at  the 
same  time. 

There  are  two  basic  search  strategies.  In  a  tar¬ 
geted  search  we  listen  to  a  few  objects  that  are  po¬ 
tential  candidates  for  harbouring  life.  In  a  wide- 
sky  survey  large  areas  of  the  sky  are  scanned. 
Most  current  projects  belong  to  the  latter  cate¬ 
gory. 


Observing  time  with  large  telescopes  is  ex¬ 
pensive,  and  priority  is  given  to  projects  that  can 
be  expected  to  produce  positive  results.  In  some 
projects,  like  SERENDIP,  this  problem  is  avoided 
by  having  the  receiver  sit  piggyback  on  some 
other  instrument  and  listen  to  whatever  the  tele¬ 
scope  happens  to  be  observing.  Thus  the  SETI 
project  does  not  need  any  dedicated  time  of  its 
own.  The  drawback  is,  of  course,  that  many  of 
the  target  areas  may  not  be  interesting  in  the  SETI 
sense. 

Detecting  a  potentially  artificial  signal  among 
the  huge  amount  of  data  requires  a  lot  of  comput¬ 
ing  power.  The  seti@home  project  has  connected 
millions  of  computers  to  an  enormous  virtual  ma¬ 
chine  to  analyse  the  data.  Anyone  with  a  com¬ 
puter  connected  to  the  Internet  can  load  a  screen¬ 
saver  program  that  will  automatically  fetch  pack¬ 
ets  of  data  and  send  back  the  results. 

Thus  far  not  a  single  message  sent  by  another 
civilisation  has  been  confirmed.  There  have  been 
some  interesting  cases,  but  they  have  been  single 
bursts  of  unknown  origin.  They  have  not  been  de¬ 
tected  later,  not  even  with  more  sensitive  instru¬ 
ments. 


21.10  Number  of  Civilisations 

Although  no  extraterrestrial  civilisation  has  been 
found,  we  can  try  to  estimate  their  number.  The 
SETI  pioneer  Frank  Drake  suggested  a  formula 
for  calculating  the  number  of  civilisations  in  the 
Milky  Way  capable  of  communication  at  a  given 
instant: 

N  =  R  x  fp  x  fh  x  fi  x  fi  x  fc  x  L,  (21.1) 

where  N  is  the  total  number  of  communicating 
civilisations  in  the  Milky  Way,  R  is  the  annual 
birth  rate  of  stars,  /p  is  the  fraction  of  stars  pos¬ 
sessing  planets,  /h  is  the  fraction  of  planets  be¬ 
ing  habitable,  f\  is  the  fraction  of  habitable  plan¬ 
ets  having  some  kind  of  life,  /  is  the  fraction  of 
these  planets  having  intelligent  life,  fc  is  the  frac¬ 
tion  of  intelligent  civilisations  that  have  devel¬ 
oped  means  for  interstellar  communication,  and 
finally  L  is  the  time  in  years  that  such  a  civilisa¬ 
tion  has  been  communicating.  All  the  /-factors 
are  probabilities  that  are  in  the  range  [0,  1]. 
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The  astronomical  factors  ( R ,  /p  and  /h)  are 
the  only  ones  that  are  known  with  any  accuracy. 
The  biological  factors,  f\  and  f\ ,  involve  a  lot  of 
guesswork.  The  last  two  factors,  fc  and  L,  are 
even  harder,  since  they  are  related  to  the  socio¬ 
logical  behaviour  of  the  exocivilisation. 

Actually,  the  formula  was  intended  as  the  ba¬ 
sis  of  the  agenda  of  an  influential  SETI  meeting 
held  in  Green  Bank  in  1961.  It  splits  the  prob¬ 
lem  nicely  into  smaller  subproblems  that  can  be 
discussed  separately.  But  using  the  formula  to 
find  the  actual  number  of  civilisations  is  not  very 
meaningful,  since  so  many  of  the  factors  are  to¬ 
tally  unknown.  In  the  most  “optimistic”  case  we 
could  find  out  that  the  distance  between  neigh¬ 
bouring  civilisations  is  just  a  few  parsecs,  but  giv¬ 
ing  the  probabilities  small  (possibly  more  realis¬ 
tic)  values,  their  product  might  be  so  minute  that 
we  ought  to  be  alone  in  the  Milky  Way.  At  least 
the  formula  shows  how  little  we  know. 

Even  if  favourable  conditions  and  evolution  of 
communicating  civilisations  were  relatively  com¬ 
mon,  the  last  factor  may  turn  out  to  be  the  limiting 
one.  If  the  lifetime  of  a  civilisation  is  short  com¬ 
pared  to  the  age  of  the  universe,  the  chances  of 
hearing  a  message  from  another  star  are  poor. 

Earlier  many  astronomers  seemed  to  think  that 
exocivilisations  would  not  be  that  rare,  while  bi¬ 
ologists  showed  that  the  evolution  of  life  had  so 
many  obstacles  that  we  should  not  expect  to  find 
other  civilisations  in  our  neighbourhood.  Now  we 
understand  better  both  the  biochemistry  of  early 
life  and  the  many  problems  in  having  a  habitable 


planet.  Although  opinions  vary  considerably,  we 
might  guess  that  very  simple  microbe-like  life  is 
relatively  common,  but  intelligent,  communicat¬ 
ing  beings  might  be  extremely  rare. 


21.11  Exercises 

Exercise  21.1  Calculate  the  limits  of  the  habit¬ 
able  zone  of  the  Sun  assuming  the  planet  is  a  fast- 
rotating  blackbody  with  a  Bond  albedo  of  0.3. 
What  is  the  continuously  habitable  zone,  if  the  lu¬ 
minosity  of  the  Sun  was  originally  0.7  times  the 
current  value? 

Exercise  21.2  Assume  there  are  n  stars  in  a  cu¬ 
bic  parsec  and  a  fraction  p  of  them  have  com¬ 
municating  civilisations.  What  is  the  average  dis¬ 
tance  between  two  neighbouring  civilisations? 
Apply  the  result  to  the  solar  vicinity.  The  stellar 
density  can  be  estimated  from  Table  C.17.  What 
is  the  average  distance  between  nearest  civilisa¬ 
tions,  if  the  probability  of  a  star  having  a  planet 
with  a  civilisation  is  (a)  0.01,  (b)  0.00001? 

Exercise  21.3  An  asteroid  with  a  diameter  of  one 
hundred  metres  is  approaching  the  Earth.  Esti¬ 
mate  the  minimum  value  of  the  kinetic  energy  re¬ 
leased  in  the  collision.  Compare  the  result  with 
the  Hiroshima  atomic  bomb.  The  energy  of  the 
bomb  was  equivalent  to  15  kilotons  of  TNT.  One 
ton  of  TNT  corresponds  to  the  energy  of  4. 184  x 
109  joules. 


Exoplanets 


22 


The  Copernican  principle  means  that  the  Earth 
has  gradually  lost  its  special  status:  it  is  just  a 
planet  orbiting  a  relatively  common  star  in  an  or¬ 
dinary  galaxy.  But  how  common  planetary  sys¬ 
tems  are?  Nowadays  we  think  that  they  belong  to 
the  normal  life  of  a  star.  However,  it  is  difficult 
to  make  ground  based  observations  of  exoplan¬ 
ets,  i.e.  planets  orbiting  some  other  star  than  our 
Sun.  The  light  reflected  by  the  planet  is  so  dim 
that  it  is  swamped  with  the  light  of  the  star.  The 
situation  is,  however,  improving  thanks  to  evolv¬ 
ing  observation  methods. 


22.1  Other  Planetary  Systems 

At  the  beginning  of  2016  over  1300  planetary 
systems  and  a  total  of  over  2000  planets  had  been 
found.  The  numbers  have  increased  rapidly  dur¬ 
ing  the  last  few  years.  It  seems  that  the  birth  of 
planetary  systems  is  more  and  more  closely  re¬ 
lated  to  the  birth  process  of  stars. 

Due  to  the  observing  methods  most  of  the 
known  exoplanets  are  rather  massive.  Recently 
also  planets  about  the  size  of  the  Earth  have  been 
detected.  The  next  big  question  is  if  any  of  them 
is  suitable  for  harbouring  life. 

Origin  of  planetary  systems  have  been  studied 
using  theoretical  calculations.  Most  calculations 
have  dealt  with  single  stars  around  which  stable 
planetary  orbits  are  easier  to  find.  However,  stable 
orbits  are  also  possible  around  wide  binary  stars 
either  close  to  one  of  the  components,  or  so  far 
that  the  planet  orbits  both  components. 


The  Milky  Way  contains  about  1011  stars. 
Maybe  109— 1010  of  them  have  suitable  condi¬ 
tions  for  a  planetary  system. 

The  first  exoplanet  was  found  in  1992.  It  was 
quite  a  strange  object,  orbiting  a  pulsar.  Pulsars 
are  final  stages  of  stellar  evolution,  and  planets 
orbiting  the  star  should  be  destroyed. 

Some  earlier  observations  had  given  hints 
about  exoplanets  but  they  were  not  conclusive. 
The  first  exoplanet  orbiting  an  ordinary  star  was 
found  in  1995  around  ft  Pictoris. 

In  addition  to  planets  orbiting  a  star  there  are 
some  indications  of  rogue  planets  that  have  es¬ 
caped  from  planetary  systems.  Star  density  in  star 
birth  regions  can  be  so  high  that  the  perturba¬ 
tions  of  other  stars  throw  planets  away  from  their 
original  orbits.  Such  runaway  planets  are  even 
more  difficult  to  detect  than  planets  orbiting  other 
stars. 


22.2  Observational  Methods 

Although  only  a  few  planets  can  be  observed 
directly  with  current  methods,  the  existence  of 
planets  can  be  deduced  using  several  indirect 
methods. 

Astrometric  methods  are  based  on  perturba¬ 
tions  of  the  proper  motion  or  radial  velocity  of 
the  star.  If  the  mass  of  the  star  is  big  enough  the 
centre  of  mass  will  be  clearly  displaced  from  the 
star.  For  example,  the  centre  of  mass  of  our  solar 
system  is  outside  the  surface  of  the  Sun,  and  the 
Sun  as  well  the  planets  orbit  around  this  point. 

459 


©  Springer- Verlag  Berlin  Heidelberg  2017 

H.  Karttunen  et  al.  (eds.),  Fundamental  Astronomy,  DOI  10.1007/978-3-662-53045-0_22 


460 


22  Exoplanets 


Therefore  the  motion  of  the  Sun  through  space  is 
not  linear  but  shows  particularly  the  12  year  pe¬ 
riod  of  Jupiter’s  orbital  motion. 

Earliest  observations  were  based  on  similar  as¬ 
trometric  measurements  as  with  binary  stars.  The 
wobble  of  the  star  caused  by  a  planet  is  so  minute 
that  the  results  were  not  conclusive.  The  advan¬ 
tage  of  the  method  is  that  it  does  not  depend  on 
the  angle  between  the  orbital  plane  and  the  line  of 
sight. 

Much  more  precise  results  have  been  obtained 
by  studying  the  periodic  changes  of  the  radial  ve¬ 
locities  of  stars  using  the  Doppler  effect.  The  ra¬ 
dial  velocities  of  many  nearby  stars  show  indeed 
periodic  variations  indicating  that  they  have  mas¬ 
sive  invisible  companions.  The  method  makes 
it  possible  to  detect  Jupiterlike  planets  around 
nearby  planets,  but  smaller  ones  are  too  light  to 
cause  detectable  variations  in  the  radial  veloc¬ 
ity  of  the  star.  This  is  why  most  of  the  currently 
known  exoplanets  are  more  massive  than  Jupiter. 

This  method  works  the  better  the  closer  the  ob¬ 
server  is  to  the  orbital  plane  of  the  orbit.  If  the  or¬ 
bital  plane  is  almost  perpendicular  to  the  line  of 
sight  no  Doppler  shift  cannot  be  seen. 

A  third  method  is  based  on  transits  of  plan¬ 
ets:  when  a  planet  is  between  the  star  and  the  ob¬ 
server  it  occults  a  part  of  the  stellar  disk.  Such 
events  have  already  been  observed  with  ground 
based  telescopes,  but  especially  the  Kepler  space¬ 
craft  sent  by  NASA  in  2009  has  investigated  them 
by  following  about  one  hundred  thousand  stars  in 
the  galactic  habitable  zone.  The  method  works, 
of  course,  only  if  the  observer  is  very  close  to  the 
orbital  plane  of  the  planet.  Yet  over  a  thousand 
planets  have  been  found,  many  of  which  are  about 
the  size  of  the  Earth.  Planetary  transits  are  seen 
as  very  regular  dimming  of  the  star.  At  the  same 
time  the  satellite  has  observed  also  other  kinds  of 
brightness  variations,  thus  advancing  studies  of 
variable  stars. 

A  fourth  method  is  gravitational  lensing.  If  a 
star  passes  in  front  of  a  background  star  its  grav¬ 
ity  will  bend  the  light,  and  the  background  star 
will  look  brighter.  If  the  front  star  has  planets  the 
brightening  is  slightly  stronger  than  without  plan¬ 
ets. 

Also  rogue  planets  can  be  detected  with  the 
transit  or  gravitational  lens  methods.  Then  the 


brightness  will  change  only  once  and  for  a  very 
short  period.  Interpretation  of  such  events  is  prob¬ 
lematic:  is  it  really  a  planet  or  a  phenomenon  of 
the  star  itself. 

A  fifth  method  is  timing,  which  is  well  applied 
to  regularly  variable  stars.  A  planet  causes  the 
distance  of  the  star  vary  slightly,  which  changes 
the  periodicity  of  the  brightness  variation  of  the 
star  in  a  regular  manner. 

This  was  the  method  that  gave  the  first  con¬ 
vincing  evidence  of  exoplanets  in  1992  when  the 
pulsation  of  the  pulsar  PSR  B 1257+ 12  showed 
regular  variation.  The  discovery  was  strange 
since  a  pulsar  should  not  have  planets.  The  ob¬ 
jects  may  be  second  generation  planets  that  con¬ 
densed  from  the  material  ejected  by  a  supernova 
explosion.  Later  the  same  method  has  also  re¬ 
vealed  some  planets  around  variable  stars. 

Direct  imaging  of  exoplanets  is  not  yet  an  ef¬ 
fective  search  method.  The  first  images  show  ob¬ 
jects  whose  existence  has  already  been  confirmed 
with  other  methods  (Fig.  22.1). 


Fig.  22.1  A  detailed  image  of  the  circumstellar  disk 
around  yd  Pictoris  was  obtained  with  the  ESO  ADONIS 
adaptive  optics  system  at  the  3.6  m  telescope  at  La  Silla, 
Chile,  and  the  Observatoire  de  Grenoble  coronagraph  at 
the  wavelength  of  1.25  microns  in  1996.  The  full  extent 
of  the  disk  is  about  1500  AU.  The  area  around  the  star 
masked  by  the  coronagraph  is  only  24  AU  (correspond¬ 
ing  to  a  distance  from  the  Sun  to  halfway  between  Uranus 
and  Neptune).  No  planets  are  directly  seen  but  their  grav¬ 
itational  effects  can  be  detected  as  a  bending  of  the  main 
plane  in  the  inner  part  of  the  disk.  (ESO) 
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Also  young  recently  born  stars  can  indicate 
planetary  systems.  Many  stars  are  surrounded  by 
a  dust  disk  (Fig.  22.2).  The  easiest  way  to  explain 
the  infrared  radiation  of  some  young  stars  is  to 
assume  a  massive  disk  surrounding  the  star  and 
interacting  with  the  star.  Later  the  disk  can  evolve 
to  a  planetary  system.  Such  disks  have  been  de¬ 
tected  around  e.g.  Vega  and  /3  Pictoris. 


2MASSWJ 1 207334-393254 


^  77B  mas 

55  AU  at  70  pc 


N 


E  — 


The  Brown  Dwart  2M1207  and  its  Planetary  Companion 

(VLT/NACO) 


bSO  PH  Phjip  MoiW  (30  Ap.il  £005) 


Fig.  22.2  The  first  image  of  an  exoplanet  was  obtained 
in  2004  with  the  VLT  telescope.  The  mass  of  the  planet  is 
about  five  times  the  mass  of  Jupiter  and  it  orbits  a  brown 
dwarf  at  a  distance  of  55  AU.  (ESO) 


22.3  Properties  of  Exoplanets 

For  practical  reasons  it  is  easiest  to  find  big  plan¬ 
ets  orbiting  close  to  a  star.  They  affect  the  posi¬ 
tion  of  the  star  most  strongly,  and  due  to  the  short 
orbital  period  the  periodicity  of  the  variations  is 
most  easy  to  detect.  Therefore  many  of  the  exo¬ 
planets  are  such  “hot  jupiters”.  This  is  seen  in  the 
distribution  of  Fig.  22.3.  A  great  majority  of  the 
exoplanets  found  this  far  are  clearly  more  mas¬ 
sive  than  the  Earth.  This  is  another  example  of  a 
statistical  bias  caused  by  a  selection  effect  typ¬ 
ical  in  astronomy.  There  may  be  a  lot  of  Earth¬ 
sized  planets  but  they  are  extremely  difficult  to 
observe. 

The  planetary  orbits  of  our  own  solar  system 
are  nearly  circular.  If  Mercury  is  excluded  the 
eccentricity  of  every  orbit  is  less  than  0.1.  Or- 


eccentricity 


Fig.  22.4  Distribution  of  the  eccentricities  of  the  known 
exoplanets 


Fig.  22.3  Orbits  and 
masses  of  some  extrasolar 
planets.  Open  circles  are 
planets  orbiting  a  pulsar. 
Planetary  masses  are  given 
in  Jupiter’s  masses.  The 
figure  contains  some  1200 
planets  that  were  known 
reasonably  well  at  the 
beginning  of  2016. 
Actually,  a  lot  more  planets 
have  been  detected,  but 
their  data  are  yet  quite 
uncertain 
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bits  of  the  exoplanets,  on  the  other  hand,  can  be 
very  eccentric  (Fig.  22.4).  Theories  concerning 
the  evolution  of  planetary  systems  must  be  able  to 
explain  how  the  eccentricities  decrease  by,  for  ex¬ 
ample,  collisions  of  minor  bodies  or  mutual  per¬ 
turbations  of  planets. 

When  also  solid  earthlike  planets  can  be  ob¬ 
served  the  next  question  is,  if  any  of  them  has  an 
environment  suitable  for  life.  After  that  we  can 
investigate  if  their  spectra  show  features  typical 
for  life  mentioned  in  the  previous  chapter. 


22.4  Exercises 

Exercise  22.1  How  much  the  apparent  place  of 
the  Sun  oscillates  due  to  Jupiter’s  orbital  motion 
as  seen  from  the  distance  of  10  parsecs. 

Exercise  22.2  A  distant  observer  detects  the 
brightness  of  the  Sun  changing  when  Jupiter  tran¬ 
sits  the  Sun.  How  big  is  this  change  in  magni¬ 
tudes?  And  what  is  the  change  when  the  Earth 
transits  the  Sun? 


Mathematics 


A 


A.1  Geometry 

Units  of  Angle  and  Solid  Angle  Radian  is  the 
angular  unit  most  suitable  for  theoretical  studies. 
One  radian  is  the  angle  subtended  by  a  circular 
arc  whose  length  equals  the  radius.  If  r  is  the  ra¬ 
dius  of  a  circle  and  s  the  length  of  an  arc,  the  arc 
subtends  an  angle 

a  =  s/r. 


Since  the  circumference  of  the  circle  is  Inr , 
we  have 


surface  of  a  sphere  with  radius  r  subtends  a  solid 
angle 

co  =  A/r2. 

Since  the  area  of  the  sphere  is  4nr2,  a  full 
solid  angle  equals  4n  steradians. 

Circle 

Area  A  =  nr2. 

1  9 

Area  of  a  sector  As  =  -ar  . 


In  rad  =  360°  or  1  rad  =  180°/7T.  Sphere 


In  an  analogous  way  we  can  define  a  stera- 
dian ,  a  unit  of  solid  angle,  as  the  solid  angle  sub¬ 
tended  by  a  unit  area  on  the  surface  of  a  unit 
sphere  as  seen  from  the  centre.  An  area  A  on  the 


Area  A  =  4nr2. 

4  o 

Volume  V  =  -nr  . 

3 

2  2 

Volume  of  a  sector  Vs  =  -nr  h 

3 

2  , 

=  -nr  (1  -  cosa)  =  Vspherehava. 

Area  of  a  segment  As  =  2nrh  =  2nr  (1  —  cosa) 
=  A  Sphere  hav  OL. 
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A.2  Conic  Sections 

As  the  name  already  says,  conic  sections  are 
curves  obtained  by  intersecting  circular  cones 
with  planes. 


Ellipse  Equation  in  rectangular  coordinates 


a  =  the  semimajor  axis, 
b  ~  the  semiminor  axis  b  =  a\J  1  —  e 2, 
e  =  eccentricity  0  <  e  <  1 . 

Distance  of  the  foci  from  the  centre  c  =  ea. 
Parameter  (semilatus  rectum)  p  =  a(  1  —  e1). 
Area  A  =  nab. 


Equation  in  polar  coordinates 

P 

r  = - , 

l  -he  cos  f 

where  the  distance  r  is  measured  from  one  focus, 
not  from  the  centre. 

When  e  =  0,  the  curve  becomes  a  circle. 

Hyperbola  Equations  in  rectangular  and  polar 
coordinates 

2  2 

p 

a2  b2  1  +  ecos  / 


Eccentricity  e  >  1 . 

Semi-minor  axis  b  =  a\J e2  —  1. 

Parameter  p  =  a(e2  —  1). 

Asymptotes  y  =  ±^x. 

Parabola  Parabola  is  a  limiting  case  between 
the  previous  ones;  its  eccentricity  is  e  =  1. 
Equations 

2  P 

x  =  —  ay  ,  r= - . 

1  +  cos  / 


Distance  of  the  focus  from  the  apex  h  =  l /4  a. 
Parameter  p  =  1/2 a. 
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A.3  Taylor  Series 

Let  us  consider  a  differentiable  real- valued  func¬ 
tion  of  one  variable  /  :  R  — >  R.  The  tangent  to 
the  graph  of  the  function  at  xo  is 

y  =  f  (*o)  +  f'(x o)(x  -  x0), 

where  f'(x o)  is  the  derivative  of  /  at  xo.  Now, 
if  v  is  close  to  xo,  the  graph  of  the  tangent  at  x 
will  not  be  very  far  from  the  graph  of  the  function 
itself.  Thus,  we  can  approximate  the  function  by 


1 


\  —  x 


—  1  T  x  T  x  2  -|-  x  3  T 


\f  1  +  x  — 


1  1  o  1 

1  H — v - x  H - v 

2  8  16 


\pi  —  x 
1 

VTTx 

l 


=  l  - 


=  l  - 


1  l 

-v - V' 

2  8 

1  3  , 
-v  H — v' 

2  8 


1  3 

- X  — 

16 

5  3 

- X  — 

16 


1  3  9  5  o 

1  +  -X  +  -xz  H - xJ  + 

2  8  16 


f(x)  «  /(x0)  +  f\x o)(x  -  x0). 

The  approximation  becomes  worse,  the  more  the 
derivative  f'  varies  in  the  interval  [xo,x].  The 
rate  of  change  of  /'  is  described  by  the  second 
derivative  /",  and  so  on.  To  improve  accuracy, 
we  have  to  also  include  higher  derivatives.  It  can 
be  shown  that  the  value  of  the  function  /  at  x  is 
(assuming  that  the  derivatives  exist) 

fix)  =  fix  0)  +  //(x0)(x  -  x0) 

+  -f"ixo)ix  —  xo)2  +  •  •  • 

3 — r  /(w)(x o)(x  —  xo)n  +  •  •  •  , 
nl 

where  f^n)  (xo)  is  the  nth  derivative  at  xo  and  nl  is 

the  n -factorial,  nl  =  1-2-3 . n.  This  expansion 

is  called  the  Taylor  series  of  the  function  at  xq. 


The  following  list  gives  some  useful  Taylor  se¬ 
ries  (in  all  these  cases  we  have  xq  =  0): 


- =  1  —x  +x2  —  JC3  H - 

l+x 


converges  if  |x  |  <  1 


ex  =  1  +  x  +  -xz  +  -xJ  +  •  •  •  +  — xj 


1 


1 


.2  ,  „3  .  i 


1 


2!  3! 


n 


+ 


converges  for  all  x 


ln(  1  +  x)  =  x - x2  H — x3 - x4  + 

2  3  4 

x  £  (  1,  1] 


1  3  1  5 

sinx  =  x - x  H - x  — 

3!  5! 


1  9  1  4 

cosx  =  1 - x  H - x  — 

2!  4! 


1  3  2  5 

tanx  =  x  H — x  H - x  + 

3  15 


for  all  x 


for  all  x 


71 

|x|  <  — 
2 


Many  problems  involve  small  perturbations,  in 
which  case  it  is  usually  possible  to  find  expres¬ 
sions  having  very  rapidly  converging  Taylor  ex¬ 
pansions.  The  great  advantage  of  this  is  the  re¬ 
duction  of  complicated  functions  to  simple  poly¬ 
nomials.  Particularly  useful  are  linear  approxima¬ 
tions,  such  as 


1 


s/ 1  +  x  ~  1  +  —x, 


1 


/l+x 


1 

1 - X, 

2 


etc. 


A.4  Vector  Calculus 

A  vector  is  an  entity  with  two  essential  proper¬ 
ties:  magnitude  and  direction.  Vectors  are  usually 
denoted  by  boldface  letters  «,  b ,  A,  B  etc.  The 
sum  of  the  vectors  A  and  B  can  be  determined 
graphically  by  moving  the  origin  of  B  to  the  tip  of 
A  and  connecting  the  origin  of  A  to  the  tip  of  B . 
The  vector  —A  has  the  same  magnitude  as  A,  is 
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parallel  to  A,  but  points  in  the  opposite  direction. 
The  difference  A  —  B  is  defined  as  A  +  (— B). 


is 

A  +  B  =  ( ax  +  bx)i  +  ( ay  +  by)  j  +  (< az  +  bz)k. 

The  magnitude  of  a  vector  r  in  terms  of  its  com¬ 
ponents  is 

+  z2. 

The  scalar  product  of  two  vectors  A  and  B  is 
a  real  number  (scalar) 


r  =  r  = 


Addition  of  vectors  satisfies  the  ordinary  rules 
of  commutativity  and  associativity, 

A  T  B  =  B  T-  A, 

A  +  ( B  +  C)  =  (A  +  B)  +  C. 


A  point  in  a  coordinate  frame  can  be  specified 
by  giving  its  position  or  radius  vector ,  which  ex¬ 
tends  from  the  origin  of  the  frame  to  the  point. 
The  position  vector  r  can  be  expressed  in  terms 
of  basis  vectors ,  which  are  usually  unit  vectors , 
i.e.  have  a  length  of  one  distance  unit.  In  a  rectan¬ 
gular  vyz-frame,  we  denote  the  basis  vectors  par- 

A  A  -/V 

allel  to  the  coordinate  axes  by  i ,  j  and  k.  The  po¬ 
sition  vector  corresponding  to  the  point  (x,y,z) 
is  then 

A  A  A 

r  =xi  +  yj  +  zk. 

The  numbers  x,  y  and  z  are  the  components  of  r. 
Vectors  can  be  added  by  adding  their  compo¬ 
nents.  For  example,  the  sum  of 

/V  A  /V 

A  =  axi  +  ayj  +  azk , 

B  =bxi  +byj  +bzk, 


A  •  B  =  axbx  -\-ayby  +azbz  =  |A||Z?|  cos  (A,  B), 

where  (A,  B)  is  the  angle  between  the  vectors  A 
and  B.  We  can  also  think  of  the  scalar  product 
as  the  projection  of,  say,  A  in  the  direction  of  B 
multiplied  by  the  length  of  B.  If  A  and  B  are 
perpendicular,  their  scalar  product  vanishes.  The 
magnitude  of  a  vector  expressed  as  a  scalar  prod¬ 
uct  is  A  =  | A |  =  \J A  •  A. 

The  vector  product  of  the  vectors  A  and  B  is 
a  vector 

/V  A 

A  x  B  =  ( aybz  —  azby)i  +  (< azbx  —  axbz)  j 

A 

+  (axby  -  aybx)k 

A  A  A 

i  j  k 

—  ax  ay  a  7  • 
bx  by  bz 

This  is  perpendicular  to  both  A  and  B .  Its  length 
gives  the  area  of  the  parallelogram  spanned  by 
A  and  B.  The  vector  product  of  parallel  vec¬ 
tors  is  a  null  vector.  The  vector  product  is  anti- 
commutative: 

A  x  B  =  —B  x  A. 


Scalar  and  vector  products  satisfy  the  laws  of 
distributivity: 

A(B  +  C)  =  AB  +  AC, 
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Ax(Z?  +  C)  =  AxZ?  +  AxC, 
c A  +  B)C  =  AC  +  BC , 

(A  +  B)xC  =  AxC  +  BxC. 


A  scalar  triple  product  is  a  scalar 


Ax  B  C 


ax  ay 

bx  by 

Cx  Cy 


Cz 


Here  the  cross  and  dot  can  be  interchanged  and 
the  factors  permuted  cyclically  without  affecting 
the  value  of  the  product.  For  example  Ax  B  C  = 
BxCA  =  BCxA ,  but  A  x  B  C  =  —B  x 
AC. 

A  vector  triple  product  is  a  vector,  which  can 
be  evaluated  using  one  of  the  expansions 


Ax(B  xC)  =  B(A  C)  -  C(A  B), 
(Ax  B)xC  =  B(A  C)  -  A(B  C). 


When  computing  a  derivative  of  a  vector  product, 
one  must  be  careful  to  retain  the  order  of  the  fac¬ 
tors,  since  the  sign  of  the  vector  product  changes 
if  the  factors  are  interchanged. 


A.5  Matrices 

Assume  we  have  a  vector  x  with  components 
(. x,y,z ).  We  can  calculate  another  vector  x '  = 
(xf,y',zf),  the  components  of  which  are  linear 
combinations  of  the  original  components: 


In  all  these  products,  scalar  factors  can  be 
moved  around  without  affecting  the  product: 

A  kB  =k(A-  B), 

A  x  (B  x  kC)  =  k(A  x  (B  x  C)). 

The  position  vector  of  a  particle  is  usually 

A  /V 

a  function  of  time  r  =  r(t)  =  x(t)i  +  y(t)j  + 

A 

z(t)k.  The  velocity  of  the  particle  is  a  vector, 
tangent  to  the  trajectory,  obtained  by  taking  the 
derivative  of  r  with  respect  to  time: 

(J  A  A  /V 

v  =  —r(t)  =  r  =  xi  +  yj  +  zk. 
at 

The  acceleration  is  the  second  derivative,  r. 

Derivatives  of  the  various  products  obey  the 
same  rules  as  derivatives  of  products  of  real¬ 
valued  function: 

d 

—  ( AB)  =  AB  +  AB , 
d  t 

d 

—  (A  xB)  =  AxB-\-AxB. 

dr 


x'  =  a\\x  +any  +  a\yz, 

/  =  a2ix  +a22y  +  a23Z, 
z  =a2 \x  +  a22y  +  a^z- 

This  is  a  linear  transform  that  maps  the  vector  x 
to  a  vector  jc'. 

We  can  collect  the  coefficients  to  an  array, 
called  a  matrix  A : 

(an  ci  12  #13 

#21  #22  #23 

#31  #32  #33 

A  general  matrix  can  consist  of  an  arbitrary  num¬ 
ber  of  rows  and  columns.  In  this  book  we  need 
only  matrices  operating  on  vectors  of  a  three- 
dimensional  space,  and  they  always  have  three 
rows  and  columns.  Two  subscripts  refer  to  the  dif¬ 
ferent  elements  of  the  matrix,  the  first  one  giving 
the  row  and  the  second  one  the  column. 

When  using  matrix  formalism  it  is  convenient 
to  write  vectors  in  the  form  of  column  vectors: 

*  -  G 
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We  now  define  that  the  product  of  a  matrix  and 
a  column  vector 


the  previous  example  in  the  reverse  order,  we  get 
quite  a  different  result: 


or 


x' 

=  Ax 

an 

<312 

<313 

a2\ 

<322 

<323 

<331 

<332 

<333 

/I  2  0\  /I  1  1\  /I  3  5  \ 

2  1  10  1  2  =  3  5  7  . 

\1  3  2/  \1  2  3/  V3  8  13/ 

A  unit  matrix  I  is  a  matrix,  which  has  ones  on 
its  diagonal  and  zeros  elsewhere: 


means  just 

x  =a\\x  +  a\2y  +<313  z, 
yf  =  a2\x  +  a22y  +  a22z, 
z  =  a2\x  +  a22y  +  a^z. 

Comparing  these  equations  we  see,  for  example, 
that  the  first  component  of  x'  is  obtained  by  tak¬ 
ing  the  first  row  of  the  matrix,  multiplying  the 
components  of  the  vector  x  by  the  correspond¬ 
ing  components  of  that  row,  and  finally  adding 
the  products. 

This  definition  can  easily  be  generalised  to  the 
product  of  two  matrices.  The  elements  of  the  ma¬ 
trix 

C  =  AB 


If  a  vector  or  a  matrix  is  multiplied  by  a  unit  ma¬ 
trix,  it  will  remain  unchanged. 

If  the  product  of  two  matrices  is  a  unit  matrix, 
the  two  matrices  are  inverse  matrices  of  each  oth¬ 
ers.  The  inverse  matrix  of  A  is  denoted  by  A-1. 
It  satisfies  the  equations 

A~1A  =  AA~l  =/. 

In  spherical  astronomy  we  need  mainly  rota¬ 
tion  matrices ,  describing  the  rotation  of  a  coordi¬ 
nate  frame.  The  following  matrices  correspond  to 
rotations  around  the  x,  y  and  z  axes,  respectively: 


are 

C j  =  ^  Jttjkbkj • 
k 

This  is  easy  to  remember  by  noting  that  we  take 
the  row  i  of  the  first  factor  A  and  the  column  j  of 
the  second  factor  B  and  evaluate  the  scalar  prod¬ 
uct  of  the  two  vectors.  For  example 


r 1  v1 2  °\ 

0  1  2  2  1  1 

\1  2  3/  \1  3  2/ 


/1+2+1  2+1+3  0  +  1  +  2\ 

0+2+2  0+1+6  0+1+4 
\1  +  4  +  3  2  +  2  +  9  0  +  2  +  6/ 

/  4  6  3 

4  7  5 

\8  13  8 


When  multiplying  matrices,  we  have  to  be 
careful  with  the  order  of  the  factors,  because  usu¬ 
ally  AB  ^  BA.  If  we  multiply  the  matrices  of 


Rx(a)  = 


Ry(a )  = 


Rz(a)  = 


1 


0 


0 


0  COS  O' 

sin  a 

0  —  sin  a 

COSO' 

COS  O' 

0 

sin  a 

0 

1 

0 

—  sin  a 

0 

COSO' 

COS  O' 

sin  a  0 

—  sin  a 

cos  <x  0 

0 

0  1 

If  the  angle  is  a  =  0,  only  a  unit  matrix  remains. 

The  elements  of  a  rotation  matrix  can  easily 
be  determined.  For  example,  a  rotation  around  the 
x  axis  will  leave  the  x  coordinate  unaffected,  and 
thus  the  first  row  and  column  must  be  zeroes,  ex¬ 
cept  for  the  diagonal  element,  which  must  be  one. 
This  will  leave  four  elements.  When  the  angle  is 
zero,  the  matrix  has  to  reduce  to  a  unit  matrix; 
thus  the  diagonal  elements  must  be  cosines  and 
the  other  ones  sines.  The  only  problem  is  to  de¬ 
cide,  which  of  the  sines  will  get  the  minus  sign. 
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This  is  most  easily  done  by  testing  the  effect  of 
the  matrix  on  some  basis  vector. 

The  inverse  matrix  of  a  rotation  matrix  cor¬ 
responds  to  a  rotation  in  the  opposite  direction. 
Thus  it  is  obtained  from  the  original  matrix  by 
replacing  the  angle  a  by  —a.  The  only  change 
in  the  matrix  is  that  the  signs  of  the  sines  are 
changed. 

For  example,  the  precession  matrix  is  a  prod¬ 
uct  of  three  rotation  matrices.  Since  the  matrix 
product  is  not  commutative,  these  rotations  must 
be  carried  out  in  the  correct  order. 


The  surface  need  not  be  confined  to  a  plane. 
For  example,  the  area  of  a  sphere  is 

A=  I  dS, 

Js 

where  the  integration  is  extended  over  the  sur¬ 
face  S  of  the  sphere.  In  this  case  the  surface  el¬ 
ement  is 

d S  =  R 2  cos  6  dip  d 0, 
and  the  area  is 


A.6  Multiple  Integrals 

An  integral  of  a  function  /  over  a  surface  A 

1=1  fdA 
Ja 

can  be  evaluated  as  a  double  integral  by  express¬ 
ing  the  surface  element  dA  in  terms  of  coordinate 
differentials.  In  rectangular  coordinates, 

dA  =  dx  d y 


and  in  polar  coordinates 

dA  =  r  dr  dip. 


The  integration  limits  of  the  innermost  integral 
may  depend  on  the  other  integration  variable. 
For  example,  the  function  xcy  integrated  over  the 
shaded  area  is 


xcy  d A  =  /  /  xcy  dx  dy 

A  J  x  =  0  J  y  =  0 


-L 

=  f  [|^xvev]dv=  f  (xc2x—x)dx 

Jo  Jo 


1 


0 


-xe2x  —  -e2x  —  -x2  =  I(e2  —  l) 
2  4  2  4V  7 


R  cos  6  dcp 


Similarly,  a  volume  integral 

1=  f  fdV 
Jv 
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can  be  evaluated  as  a  triple  integral.  In  rectangu-  A. 7 
lar  coordinates,  the  volume  element  dV  is 


Numerical  Solution 
of  an  Equation 


dV  =  dx  dydz; 

in  cylindrical  coordinates 

dV  =  r  dr  dip  d z, 

and  in  spherical  coordinates 

dV  =  r 2  cos  6  dr  dcp  d 0 

(i 0  measured  from  the  xy  plane) 


or 


dV  =  r2  sin  6  dr  dcp  d 0 

(i 0  measured  from  the  z  axis) . 


For  example,  the  volume  of  a  sphere  with  ra¬ 
dius  R  is 


V=  dV 


2 JT  fTT  /2 


l 

n  R  pziT  r* 

Jr  =  0  J(p  =  0  J6  =  —  n/2 

n2i r 

\\-n/2r2  S'md]  drd<P 


r  cos  9  dr  dcp  dO 


n/2  2  • 

-x/2r  Sln 


R  r2jt 


= 

Jo  Jo 


2 r2  dr  dcp 


=L 


R 


=  I  4nr2  dr  = 


R 


0 


4jtr' 


=  -7 lR\ 


We  frequently  meet  equations  defying  analyti¬ 
cal  solutions.  Kepler’s  equation  is  a  typical  ex¬ 
ample.  If  we  cannot  do  anything  else,  we  can 
always  apply  some  numerical  method.  Next  we 
shall  present  two  very  simple  methods,  the  first 
of  which  is  particularly  suitable  for  calculators. 

Method  1:  Direct  Iteration  We  shall  write  the 
equation  as  f(x)  =  x.  Next  we  have  to  find  an 
initial  value  vo  for  the  solution.  This  can  be 
done,  for  example,  graphically  or  by  just  guess¬ 
ing.  Then  we  compute  a  succession  of  new  it¬ 
erates  x\  =  f{x o),  X2  =  fix i),  and  so  on,  un¬ 
til  the  difference  of  successive  solutions  becomes 
smaller  than  some  preset  limit.  The  last  iterate  Xi 
is  the  solution.  After  computing  a  few  x;  ’s,  it  is 
easy  to  see  if  they  are  going  to  converge.  If  not, 
we  rewrite  the  equation  as  /_1(x)  =  x  and  try 
again.  (/-1  is  the  inverse  function  of  /.) 

As  an  example,  let  us  solve  the  equation  x  = 
—  lnx.  We  guess  xo  =  0.5  and  find 

x\  =  —  ln0.5  =  0.69,  X2  =  0.37,  X3  =  1.00. 


This  already  shows  that  something  is  wrong. 
Therefore  we  change  our  equation  to  x  =  e-x  and 
start  again: 


xq  =  0.5, 

xj  =  e-0'5  =  0.61, 
X2  =  0.55, 

X3  =  0.58, 

X4  =  0.56, 

X5  =  0.57, 
x6  =  0.57. 
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Thus  the  solution,  accurate  to  two  decimal  places, 
is  0.57. 

Method  2.  Interval  Halving  In  some  patholog¬ 
ical  cases  the  previous  method  may  refuse  to  con¬ 
verge.  In  such  situations  we  can  use  the  foolproof 
method  of  interval  halving.  If  the  function  is  con¬ 
tinuous  (as  most  functions  of  classical  physics 
are)  and  we  manage  to  find  two  points  x\  and  X2 
such  that  f{x i)  >  0  and  f(x2)  <  0,  we  know 
that  somewhere  between  x\  and  X2  there  must  be 
a  point  v  in  which  f(x)  =  0.  Now  we  find  the 
sign  of  /  in  the  midpoint  of  the  interval,  and  se¬ 
lect  the  half  of  the  interval  in  which  /  changes 
sign.  We  repeat  this  procedure  until  the  interval 
containing  the  solution  is  small  enough. 

We  shall  try  also  this  method  on  our  equation 
v  =  —  lnv,  which  is  now  written  as  f(x)  =  0, 
where  f(x)  =  v  +  Inx.  Because  f(x)  — >  —  oo, 
when  v  — >  0  and  /( 1)  >  0,  the  solution  must  be 
in  the  range  (0,  1).  Since  /(0.5)  <  0,  we  know 


that  v  G  (0.5,  1).  We  continue  in  this  way: 


/ (0.75)  >  0 
/ (0.625)  >  0 
/ (0.563)  <  0 
/ (0.594)  >  0 


v  g  (0.5,  0.75), 
v  g  (0.5,  0.625), 
v  g  (0.563,  0.625), 
x  g  (0.563,0.594). 


The  convergence  is  slow  but  certain.  Each  iter¬ 
ation  restricts  the  solution  to  an  interval  which  is 
half  as  large  as  the  previous  one,  thus  improving 
the  solution  by  one  binary  digit. 


Theory  of  Relativity 


B 


Albert  Einstein  published  his  special  theory  of 
relativity  in  1905  and  the  general  theory  of  rel¬ 
ativity  ten  years  later.  Especially  the  general  the¬ 
ory,  which  is  essentially  a  gravitation  theory,  has 
turned  out  to  be  very  important  for  the  theories  of 
the  evolution  of  the  Universe.  Therefore  it  is  ap¬ 
propriate  to  consider  here  some  basic  principles 
of  relativity  theory.  A  more  detailed  discussion 
would  require  sophisticated  mathematics,  and  is 
beyond  the  scope  of  this  very  elementary  book. 


B.1  Basic  Concepts 

Everyone  knows  the  famous  Pythagorean  theo¬ 
rem 

As2  =  Ax2  +  Ay2, 

where  As  is  the  length  of  the  hypotenuse  of 
a  right-angle  triangle,  and  Ax  and  Ay  are  the 
lengths  of  the  other  two  sides.  (For  brevity,  we 
have  denoted  As2  =  (As)2 .)  This  is  easily  gener¬ 
alised  to  three  dimensions: 

As2  =  Ax2  +  Ay2  +  Az2. 

This  equation  describes  the  metric  of  an  ordinary 
rectangular  frame  in  a  Euclidean  space,  i.e.  tells 
us  how  to  measure  distances. 

Generally  the  expression  for  the  distance  be¬ 
tween  two  points  depends  on  the  exact  location 
of  the  points.  In  such  a  case  the  metric  must  be 
expressed  in  terms  of  infinitesimal  distances  in 
order  to  correctly  take  into  account  the  curva¬ 
ture  of  the  coordinate  curves.  (A  coordinate  curve 


is  a  curve  along  which  one  coordinate  changes 
while  ah  the  others  remain  constant.)  An  in¬ 
finitesimal  distance  ds  is  called  the  line  element. 
In  a  rectangular  frame  of  a  Euclidean  space  it  is 

ds2  =  dx2  +  dy2  +  dz2, 

and  in  spherical  coordinates 

ds2  =  dr2  +  r2  (dO2  +  cos2  9  df2) . 
Generally  ds2  can  be  expressed  as 

dV  =  Y:  g,j  dxj  dx  j , 

where  the  x*  ’s  are  arbitrary  coordinates,  and  the 
coefficients  gij  are  components  of  the  metric  ten¬ 
sor.  These  can  be  functions  of  the  coordinates, 
as  in  the  case  of  the  spherical  coordinates.  The 
metric  tensor  of  an  n -dimensional  space  can  be 
expressed  as  an  n  x  n  matrix.  Since  dx*  dxj  = 
dxjdxi ,  the  metric  tensor  is  symmetric,  i.e.  gij  = 
gji .  If  ah  coordinate  curves  intersect  perpendic¬ 
ularly,  the  coordinate  frame  is  orthogonal.  In  an 
orthogonal  frame,  gij  =  0  for  ah  i  /  j .  For  exam¬ 
ple,  the  spherical  coordinates  form  an  orthogonal 
frame,  the  metric  tensor  of  which  is 

(10  0  \ 

(g,j)  =  I  0  r2  0  . 

\0  0  r2  cos2  6 ) 

If  it  is  possible  to  find  a  frame  in  which  ah  the 
components  of  g  are  constant,  the  space  is  flat. 
In  the  rectangular  frame  of  a  Euclidean  space, 
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we  have  gn  =  gn  =  g 33  =  1;  hence  the  space 
is  flat.  The  spherical  coordinates  show  that  even 
in  a  flat  space,  we  can  use  frames  in  which  the 
components  of  the  metric  tensor  are  not  constant. 
The  line  element  of  a  two-dimensional  spherical 
surface  is  obtained  from  the  metric  of  the  spheri¬ 
cal  coordinate  frame  by  assigning  to  r  some  fixed 
value  R : 

ds2  =  R2  (d 02  +  cos2  6  dcp 2) . 


speed  of  light  follows  immediately  from  the  ba¬ 
sic  assumptions  of  the  special  theory  of  relativity. 
This  is  possible  only  if  different  observers  mea¬ 
sure  time  intervals  differently. 

Let  us  send  a  beam  of  light  from  the  origin.  It 
travels  along  a  straight  line  at  the  speed  c.  Thus, 
at  the  moment  t  its  space  and  time  coordinates 
satisfy  the  equation 

x2  +  y2  +  z2  =  c2t2.  (B.l) 


The  metric  tensor  is 


This  cannot  be  transformed  to  a  constant  tensor. 
Thus  the  surface  of  a  sphere  is  a  curved  space . 

If  we  know  the  metric  tensor  in  some  frame, 
we  can  compute  a  fourth-order  tensor,  the  curva¬ 
ture  tensor  Rijki ,  which  tells  us  whether  the  space 
is  curved  or  flat.  Unfortunately  the  calculations 
are  slightly  too  laborious  to  be  presented  here. 

The  metric  tensor  is  needed  for  all  computa¬ 
tions  involving  distances,  magnitudes  of  vectors, 
areas,  and  so  on.  Also,  to  evaluate  a  scalar  prod¬ 
uct,  we  must  know  the  metric.  In  fact  the  com¬ 
ponents  of  the  metric  tensor  can  be  expressed  as 
scalar  products  of  the  basis  vectors: 


Next  we  study  what  the  situation  looks  like  in  an¬ 
other  frame,  x'y'z ' ,  moving  at  a  velocity  v  with 
respect  to  the  vyz  frame.  Let  us  select  the  new 
frame  so  that  it  coincides  with  the  vyz  frame 
at  t  =  0.  Also,  let  the  time  coordinate  tf  of  the 
x'y'z!  frame  be  t'  =  0  when  t  —  0.  And  finally  we 
assume  that  the  x'y'z!  frame  moves  in  the  direc¬ 
tion  of  the  positive  v  axis.  Since  the  beam  of  light 
must  also  travel  at  the  speed  c  in  the  x'y'z'  frame, 
we  must  have 


x'2  +  y'2  +  z2  =  cV2 


If  we  require  that  the  new  (dashed)  coordinates 
be  obtained  from  the  old  ones  by  a  linear  trans¬ 
formation,  and  also  that  the  inverse  transforma¬ 
tion  be  obtained  simply  by  replacing  v  by  —v, 
we  find  that  the  transformation  must  be 


gij  —  ei  •  e  j . 

If  A  and  B  are  two  arbitrary  vectors 

A  =  yVe,:,  B  =  YJbih, 


their  scalar  product  is 


A  B  = 


albJef  •  e j  = 


gijalbJ 


l  J 


l  J 


B.2  Lorentz  Transformation. 
Minkowski  Space 

The  special  theory  of  relativity  abandoned  the  ab¬ 
solute  Newtonian  time,  supposed  to  flow  at  the 
same  rate  for  every  observer.  Instead  it  required 
that  the  speed  of  light  must  have  the  same  value  c 
in  all  coordinate  frames.  The  constancy  of  the 


x  —  vt 
y/l  ~  V2 /C2 


Z 


! 


=  z, 


t  —  vx/c 2 
y/i  —  V2 /c2 


(B.2) 


This  transformation  between  frames  moving  at 
a  constant  speed  with  respect  to  each  other  is 
called  the  Lorentz  transformation. 

Because  the  Lorentz  transformation  is  derived 
assuming  the  invariance  of  (B.l),  it  is  obvious 
that  the  interval 


As2  =  - c2  At 2  +  Ax2  +  Ay2  +  Az2 


of  any  two  events  remains  invariant  in  all  Lorentz 
transformations.  This  interval  defines  a  metric  in 
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four-dimensional  spacetime.  A  space  having  such 
a  metric  is  called  the  Minkowski  space  or  Lorentz 
space.  The  components  of  the  metric  tensor  g  are 


<N 

1 

0 

0 

0\ 

0 

1 

0 

0 

0 

0 

1 

0 

V  o 

0 

0 

1/ 

Since  this  is  constant,  the  space  is  flat.  But  it  is 
no  longer  an  ordinary  Euclidean  space,  because 
the  sign  of  the  time  component  differs  from  the 
sign  of  the  space  components.  In  the  older  litera¬ 
ture,  a  variable  ict  is  often  used  instead  of  time, 
i  being  the  imaginary  unit.  Then  the  metric  looks 
Euclidean,  which  is  misleading;  the  properties  of 
the  space  cannot  be  changed  just  by  changing  no¬ 
tation. 

In  a  Minkowski  space  position,  velocity,  mo¬ 
mentum  and  other  vector  quantities  are  described 
by  four-vectors ,  which  have  one  time  and  three 
space  components.  The  components  of  a  four- 
vector  obey  the  Lorentz  transformation  when  we 
transform  them  from  one  frame  to  another,  both 
moving  along  a  straight  line  at  a  constant  speed. 

According  to  classical  physics  the  distance  of 
two  events  depends  on  the  motion  of  the  observer, 
but  the  time  interval  between  the  events  is  the 
same  for  all  observers.  The  world  of  special  rel¬ 
ativity  is  more  anarchistic:  even  time  intervals 
have  different  values  for  different  observers. 
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physical  reason  to  assume  that  the  two  masses 
should  have  anything  in  common.  However,  al¬ 
ready  the  experiments  made  by  Galilei  showed 
that  evidently  m\  =  rag.  This  has  been  verified 
later  with  very  high  accuracy. 

The  weak  equivalence  principle ,  which  states 
that  m\  =  rag,  can  therefore  be  accepted  as 
a  physical  axiom.  The  strong  equivalence  prin¬ 
ciple  generalises  this:  if  we  restrict  our  observa¬ 
tions  to  a  sufficiently  small  region  of  spacetime, 
there  is  no  way  to  tell  whether  we  are  subject  to 
a  gravitational  field  or  are  in  uniformly  acceler¬ 
ated  motion.  The  strong  equivalence  principle  is 
one  of  the  fundamental  postulates  of  general  rel¬ 
ativity. 

Curvature  of  Space  General  relativity  descri¬ 
bes  gravitation  as  a  geometric  property  of  the 
spacetime.  The  equivalence  principle  is  an  ob¬ 
vious  consequence  of  this  idea.  Particles  mov¬ 
ing  through  spacetime  follow  the  shortest  possi¬ 
ble  paths,  geodesics.  The  projection  of  a  geodesic 
onto  a  three-dimensional  space  need  not  be  the 
shortest  way  between  the  points. 

The  geometry  of  spacetime  is  determined  by 
the  mass  and  energy  distribution.  If  this  distribu¬ 
tion  is  known,  we  can  write  the  field  equations , 
which  are  partial  differential  equations  connect¬ 
ing  the  metric  to  the  mass  and  energy  distribution 
and  describing  the  curvature  of  spacetime. 

In  the  case  of  a  single-point  mass,  the  field 
equations  yield  the  Schwarzschild  metric,  the  line 
element  of  which  is 


The  Equivalence  Principle  Newton’s  laws  re¬ 
late  the  acceleration,  a,  of  a  particle  and  the  ap¬ 
plied  force  F  by 


F  =  m\a, 

where  mi  is  the  inertial  mass  of  the  particle,  re¬ 
sisting  the  force  trying  to  move  the  particle.  The 
gravitational  force  felt  by  the  particle  is 

F  =  mgf, 

where  rag  is  the  gravitational  mass  of  the  particle, 
and  /  is  a  factor  depending  only  on  other  masses. 
The  masses  mx  and  rag  appear  as  coefficients  re¬ 
lated  to  totally  different  phenomena.  There  is  no 


2  GM 
c2r 


c 2  d t2  + 


dr2 

1  -  2 GM/c2r 


+  r2  (d02  +  cos2  0  d</>2) . 


Here  M  is  the  mass  of  the  point;  r,  0  and  0  are  or¬ 
dinary  spherical  coordinates.  It  can  be  shown  that 
the  components  cannot  be  transformed  to  con¬ 
stants  simultaneously:  spacetime  must  be  curved 
due  to  the  mass. 

If  we  study  a  very  small  region  of  spacetime, 
the  curvature  has  little  effect.  Locally  the  space  is 
always  a  Minkowski  space  where  special  relativ¬ 
ity  can  be  used.  Locality  means  not  only  a  limited 
spatial  volume  but  also  a  limited  interval  in  time. 
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B  Theory  of  Relativity 


Unlike  in  the  Newtonian  formalism  in  general 
relativity  there  are  no  equations  of  motion  de¬ 
scribing  the  motion  of  a  particle.  It  is  possible 
to  use  the  positions  of  the  bodies  to  compute  the 
metric  tensor  of  the  space  and  then  the  geodesics 
representing  the  orbits  of  the  bodies.  When  the 
bodies  are  moving,  the  metric  of  the  space  keeps 
changing,  and  thus  this  method  is  very  laborious. 
The  PPN  (Parametrised  Post-Newtonian)  formal¬ 
ism  has  been  developed  for  practical  calculations, 
yielding  approximate  equations  of  motion.  It  con¬ 
tains  ten  constants,  the  values  of  which  are  spe¬ 
cific  for  Einstein’s  theory  of  gravity.  Alternative 
theories  give  different  values  for  these  constants. 
The  formalism  therefore  provides  a  framework 
for  experimental  tests  of  general  relativity. 

The  PPN  formalism  is  an  approximation  that 
can  be  used  if  velocities  are  well  below  the  speed 
of  light  and  the  gravitational  field  is  weak  and 
hence  the  curvature  of  the  space  is  low. 

B.4  Tests  of  General  Relativity 

General  relativity  gives  predictions  different  from 
classical  physics.  Although  the  differences  are 
usually  very  small,  there  are  some  phenomena  in 
which  the  deviation  can  be  measured  and  used  to 
test  the  validity  of  general  relativity.  At  presents 
five  different  astronomical  tests  have  verified  the 
theory. 

First  of  all  the  orbit  of  a  planet  is  no  longer 
a  closed  Keplerian  eclipse.  The  effect  is  strongest 
for  the  innermost  planets,  whose  perihelia  should 
turn  little  by  little.  Most  of  the  motion  of  the  peri¬ 
helion  of  Mercury  is  predicted  by  Newtonian  me¬ 
chanics;  only  a  small  excess  of  43  arc  seconds 
per  century  remains  unexplained.  And  it  so  hap¬ 
pens  that  this  is  exactly  the  correction  suggested 
by  general  relativity. 

Secondly  a  beam  of  light  should  bend  when 
it  travels  close  to  the  Sun.  For  a  beam  grazing 


the  surface,  the  deviation  should  be  about  1.75". 
Such  effects  have  been  observed  during  total  so¬ 
lar  eclipses  and  also  by  observing  pointlike  radio 
sources,  just  before  and  after  occultation  by  the 
Sun. 

The  third  classical  way  of  testing  general  rela¬ 
tivity  is  to  measure  the  redshift  of  a  photon  climb¬ 
ing  in  a  gravitational  field.  We  can  understand 
the  redshift  as  an  energy  loss  while  the  photon 
does  work  against  the  gravitational  potential.  Or 
we  can  think  of  the  redshift  as  caused  by  the 
metric  only:  a  distant  observer  finds  that  near 
a  mass,  time  runs  slower,  and  the  frequency  of 
the  radiation  is  lower.  The  time  dilation  in  the 
Schwarzschild  metric  is  described  by  the  coeffi¬ 
cient  of  d t ;  thus  it  is  not  surprising  that  the  radi¬ 
ation  emitted  at  a  frequency  v  from  a  distance  r 
from  a  mass  M  has  the  frequency 


if  observed  very  far  from  the  source.  This  grav¬ 
itational  redshift  has  been  verified  by  laboratory 
experiments. 

The  fourth  test  employs  the  slowing  down  of 
the  speed  of  light  in  a  gravitational  field  near  the 
Sun.  This  has  been  verified  by  radar  experiments. 

The  previous  tests  concern  the  solar  system. 
Outside  the  solar  system  binary  pulsars  have  been 
used  to  test  general  relativity.  An  asymmetric  sys¬ 
tem  in  accelerated  motion  (like  a  binary  star) 
loses  energy  as  it  radiates  gravitational  waves.  It 
follows  that  the  components  approach  each  other, 
and  the  period  decreases.  Usually  the  gravita¬ 
tional  waves  play  a  minor  role,  but  in  the  case  of 
a  compact  source  the  effect  may  be  strong  enough 
to  be  observed.  The  first  source  to  show  this  pre¬ 
dicted  shortening  of  period  was  the  binary  pulsar 
PSR  1913+16. 
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Table  C.1  SI  basic  units 


Quantity 

Symbol 

Unit 

Abbr. 

Definition 

Length 

l  j  S  j  •  •  • 

Metre 

m 

The  length  of  the  path  travelled  by  light  in  vacuum  during  a  time 
interval  of  1  /299,792,458  of  a  second. 

Mass 

m,  M 

Kilogram 

kg 

Equal  to  the  mass  of  the  international  prototype  of  the  kilogram. 

Time 

t 

Second 

s 

The  duration  of  9,192,631,770  periods  of  the  radiation 
corresponding  to  the  transition  between  two  hyperfine  levels  of 
the  ground  state  of  the  caesium  133-atom. 

Electric  current 

I 

Ampere 

A 

That  constant  current  which,  if  maintained  in  two  straight 
parallel  conductors  of  infinite  length,  of  negligible  circular  cross 
section  and  placed  1  metre  apart  in  a  vacuum,  would  produce  a 
force  equal  to  2  x  10-7  Newton  per  metre  of  length  between 
these  conductors. 

Temperature 

T 

Kelvin 

K 

The  fraction  1/273.16  of  the  thermodynamic  temperature  of  the 
triple  point  of  water. 

Amount  of  substance 

n 

Mole 

mol 

The  amount  of  substance  of  a  system  which  contains  as  many 
elementary  entities  as  there,  are  atoms  in  0.012  kg  of  12C. 

Luminous  intensity 

I 

Candela 

cd 

The  luminous  intensity  in  a  given  direction  of  a  source  which 
emits  monochromatic  radiation  of  frequency  540  x  1012  Hz  and 
of  which  the  radiant  intensity  in  that  direction  is  1  /683  Watt  per 
steradian. 

Table  C.2  Prefixes  for  orders  of  ten 


Prefix 

Symbol 

Multiple 

Prefix 

Symbol 

Multiple 

yocto 

y 

Tf 

(N 

1 

o 

H 

deca 

da 

101 

zepto 

z 

io-21 

hecto 

h 

102 

atto 

a 

10-18 

kilo 

k 

103 

femto 

f 

10-15 

Mega 

M 

106 

pico 

P 

10-12 

Giga 

G 

109 

nano 

n 

icr9 

Tera 

T 

1012 

micro 

P 

icr6 

Peta 

P 

1015 

milli 

m 

icr3 

Exa 

E 

1018 

centi 

c 

icr2 

Zetta 

Z 

1021 

deci 

d 

10-1 

Yotta 

Y 

1024 
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Table  C.3 

units 


Constants  and 


Radian 

1  rad 

=  180°/7T  -  57.2957795°  =  206,264.8" 

Degree 

1° 

=  0.01745329  rad 

Arc  second 

1" 

=  0.000004848  rad 

Velocity  of  light 

c 

=  299,792,458  ms-1 

Gravitational  constant 

G 

=  6.673  x  10-11  m3kg_1s-2 
=  An2  AU3  Mq1  a-2 

=  3,986,005  x  108  m3  M^1  s“2 

Planck  constant 

h 

=  6.6261  x  10“34  Js 

h 

=  h/2n  =  1.0546  x  10“34  Js 

Boltzmann  constant 

k 

=  1.3807  x  10"23  JK-1 

Radiation  density  constant 

a 

=  7.5659  x  10“16  Jm-3K“4 

Stefan-Boltzmann  constant 

(7 

=  a  c/4  =  5.6705  x  10"8  W  m~2  K“4 

Atomic  mass  unit 

amu 

=  1.6605  x  10-27  kg 

Electron  volt 

eV 

=  1.6022  x  10“19  J 

Electron  charge 

e 

=  1.6022  x  10"19  C 

Mass  of  electron 

mt 

=  9.1094  x  10"31  kg  =  0.511  MeV 

Mass  of  proton 

mv 

=  1.6726  x  10“27  kg  =  938.3  MeV 

Mass  of  neutron 

m  n 

=  1.6749  x  10“27  kg  =  939.6  MeV 

Mass  of  1 H  atom 

mH 

=  1.6735  x  10-27  kg  =  1.0078  amu 

Mass  of  2  He  atom 

W  He 

=  6.6465  x  10“27  kg  =  4.0026  amu 

Rydberg  constant  for  1 H 

=  1.0968  x  107  m-1 

Rydberg  constant  for  oo  mass 

tfoc 

=  1.0974  x  107  m_1 

Gas  constant 

R 

=  8.3145  JK-1  mol-1 

Normal  atmospheric  pressure 

atm 

=  101,325  Pa  =  1013  mbar  =  760  mmHg 

Astronomical  unit 

AU 

=  1.49597870  x  1011  m 

Parsec 

pc 

=  3.0857  x  1016  m  =  206,265  AU  =  3.26  ly 

Light-year 

iy 

=  0.9461  x  1016  m  =  0.3066  pc 
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Table  C.4 


Table  C.5 

alphabet 


Units  of  time 


Unit 

Equivalent  to 

Sidereal  year 

365.2564  d  (with  respect  to  fixed  stars) 

Tropical  year 

365.2422  d  (equinox  to  equinox) 

Anomalistic  year 

365.2596  d  (perihelion  to  perihelion) 

Gregorian  calendar  year 

365.2425  d 

Julian  year 

365.25  d 

Julian  century 

36,525  d 

Eclipse  year 

346.6200  d  (with  respect  to  the  ascending  node 
of  the  Moon) 

Lunar  year 

354.367  d  =  12  synodical  months 

Synodical  month 

29.5306  d  (newmoon  to  newmoon) 

Sidereal  month 

27.3217  d  (with  respect  to  fixed  stars) 

Tropical  month 

27.3216  d  (with  respect  to  the  vernal  equinox) 

Anomalistic  month 

27.5546  d  (perigee  to  perigee) 

Draconic  month 

27.2122  d  (node  to  node) 

Mean  solar  day 

24  h  mean  solar  time 

=  24  h  03  min  56.56  s  sidereal  time 
=  1.00273791  sidereal  days 

Sidereal  day 

24  h  sidereal  time 

=  23  h  56  min  04.09  s  mean  solar  time 
=  0.99726957  mean  solar  days 

Rotation  period  of  the  Earth 
(referred  to  fixed  stars) 

1.000000097  sidereal  days 
=  23  h  56  min  04.10  s  mean  solar  time 

The  Greek 


A,  a  B,/3  r,y  A,  8  E,e,s  Z,  £  H,r]  0,0, 1} 

alpha  beta  gamma  delta  epsilon  zeta  eta  theta 


I,i 

K,k 

A,  X 

M,  \± 

N,  v 

s,? 

O,  0 

77,  71,  uj 

iota 

kappa 

lambda 

mu 

nu 

xi 

omicron 

Pi 

P,  p 

Z,  a,  g 

T,  x 

r,  V 

0,  0,  (p 

x,x 

l//,  0 

Q ,  CD 

rho 

sigma 

tau 

upsilon 

phi 

chi 

psi 

omega 
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Table  C.6 


Table  C.7 


Table  C.8 


The  Sun 


Property 

Symbol 

Numerical  value 

Mass 

Me 

1.989  x  1030  kg 

Radius 

Re 

6.960  x  108  m  =  0.00465  AU 

Effective  temperature 

Tq 

5785  K 

Luminosity 

Lq 

3.9  x  1026  W 

Apparent  visual  magnitude 

V 

-26.78 

Colour  indices 

B  -  V 

0.62 

U -B 

0.10 

Absolute  visual  magnitude 

My 

4.79 

Absolute  bolometric  magnitude 

Mx)i 

4.72 

Inclination  of  equator  to  ecliptic 

7°  15' 

Equatorial  horizontal  parallax 

7Tq 

8.794" 

Motion:  direction  of  apex 

a  =  270° 

8  =  30° 

velocity  in  LSR 

19.7  kms-1 

Distance  from  galactic  centre 

8.5  kpc 

The  Earth  „ 

Property 

Symbol 

Numerical  value 

Mass 

M0 

=  Mq/ 332,946  =  5.974  x  1024  kg 

Mass,  Earth  +  Moon 

M0  +  M<i 

=  Mq/ 328,900.5  =  6.048  x  1024  kg 

Equatorial  radius 

Rc 

=  6,378,137  m 

Polar  radius 

*P 

=  6,356,752  m 

Flattening 

/ 

=  (Re  -  RP)/Re  =  1/298.257 

Surface  gravity 

8 

=  9.81  ms-2 

The  Moon 


Property 

Symbol 

Numerical  value 

Mass 

M(j 

=  M®/81.30  =  7.348  x  1022  kg 

Radius 

% 

=  1738  km 

Surface  gravity 

Sd 

=  1.62  ms-2  =  0.17g 

Mean  equatorial  horizontal  parallax 

7 Td 

=  57' 

Semimajor  axis  of  the  orbit 

a 

=  384,  400  km 

Smallest  distance  from  Earth 

r min 

=  356,  400  km 

Greatest  distance  from  Earth 

r  max 

=  406,  700  km 

Mean  inclination  of  orbit  to  ecliptic 

i 

=  5.145° 
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Table  C.9  Planets.  Re  =  equatorial  radius,  p  =  mean 
density,  rsid  =  sidereal  rotation  period  (R  indicates  ret¬ 
rograde  rotation),  s  =  inclination  of  the  equator  with 
respect  to  the  ecliptic  (at  the  beginning  of  2000), 


/  =  flattening,  T  =  surface  temperature,  p  =  geometric 
albedo,  Vo  =  mean  opposition  magnitude.  The  data  are 
based  mostly  on  the  Astronomical  Almanac 


Name 

Re 

[km] 

Mass 

P 

[g/cm3] 

Number  of 

known 

satellites 

Planet 

Planet  +  Satellites 

[kg] 

[M0] _ 

[T/q] 

Mercury 

2440 

3.30  x  1023 

0.0553 

1/6,023,600 

5.4 

— 

Venus 

6052 

4.87  x  1024 

0.8150 

1/408,523.5 

5.2 

Earth 

6378 

5.97  x  1024 

1.0123 

1/328,900.5 

5.5 

1 

Mars 

3397 

6.42  x  1023 

0.1074 

1/3,098,710 

3.9 

2 

Jupiter 

71,492 

1.90  x  1027 

317.89 

1/1047.355 

1.3 

67 

Saturn 

60,268 

5.69  x  1026 

95.17 

1/3498.5 

0.7 

62 

Uranus 

25,559 

8.66  x  1025 

14.56 

1/22,869 

1.3 

27 

Neptune 

24,764 

1.03  x  1026 

17.24 

1/19,314 

1.8 

14 

Name 

Uid 

£ 

[°] _ 

/ 

T 

[K] 

P 

V0 

Mercury 

58.646  d 

0.0 

0 

130-615 

0.106 

— 

Venus 

243.019  dR 

177.4 

0 

750 

0.65 

— 

Earth 

23  h  56  min  04.1  s 

23.4 

0.003353 

300 

0.367 

— 

Mars 

24  h  37  min  22.6  s 

25.2 

0.006476 

220 

0.150 

-2.01 

Jupiter 

9  h  55  min  30  s 

3.1 

0.06487 

140 

0.52 

-2.70 

Saturn 

10  h  39  min  22  s 

26.7 

0.09796 

100 

0.47 

+0.67 

Uranus 

17  h  14  min  24  s  R 

97.8 

0.02293 

65 

0.51 

+5.52 

Neptune 

16  h  06  min  36  s 

28.3 

0.01708 

55 

0.41 

+7.84 

Table  C.10  Magnitudes  of  the  planets.  The  table  gives 
the  expressions  used  in  the  Astronomical  Almanac.  They 
give  the  magnitudes  as  functions  of  the  phase  angle  a.  In¬ 
terior  planets  can  be  observed  at  relatively  large  phase  an¬ 
gles,  and  thus  several  terms  are  needed  to  describe  their 
phase  curves.  Phase  angles  of  the  exterior  planets  are  al¬ 
ways  small,  and  thus  very  simple  expressions  are  suffi¬ 
cient.  The  magnitude  of  Saturn  means  the  magnitude  of 
the  planet  only;  the  total  magnitude  also  depends  on  the 
orientation  of  the  rings 


V(1,0) 

V(1,g0 

Mercury 

-0.36 

V(1,0)+  3.80(^/100°) 

-  2.73(o'/100°)2  +  2.00(o'/100°)3 

Venus 

-4.29 

V(l,  0)  +  0.09(o'/100°) 

+  2.39(o'/100°)2  -  0.65(o'/100°)3 

Mars 

-1.52 

V(l,  0)  +  1.60(a/100°) 

Jupiter 

-9.25 

Vr(l,0)+0.5(a/100°) 

Saturn 

-8.88 

Uranus 

-7.19 

Vr(l,0)+0.28(a/100°) 

Neptune 

-6.87 

V(1,0) 
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Table  C.1 1  Osculating  elements  of  planetary  orbits  ing  node,  m  —  longitude  of  perihelion,  L  =  men  longi- 
on  JD  2,45 1,600.5  (Feb  24,  2000).  a  =  semimajor  axis,  tude,  Psid  =  mean  sidereal  orbital  period  (here  1  a  means 
e  =  eccentricity,  i  =  inclination,  Q  =  longitude  of  ascend-  a  Julian  year,  or  365.25  days),  Psyn  =  mean  synodic  period 


a 

e 

i 

Q 

zu 

L 

Fsid 

Psyn 

[AU] 

[106  km] 

[°] 

[°] 

[°] 

[°] 

[a] 

[d] 

[d] 

Mercury 

0.387 

57.9 

0.2056 

7.01 

48.3 

77.5 

119.4 

0.2408 

87.97 

115.9 

Venus 

0.723 

108.2 

0.0068 

3.39 

76.7 

131.9 

270.9 

0.6152 

224.7 

583.9 

Earth 

1.000 

149.6 

0.0167 

0.00 

143.9 

102.9 

155.2 

1.0000 

365.3 

— 

Mars 

1.524 

228.0 

0.0934 

1.85 

49.6 

336.1 

24.5 

1.8807 

686.9 

779.9 

Jupiter 

5.204 

778.6 

0.0488 

1.30 

100.5 

15.5 

39.0 

11.8565 

4330.6 

398.9 

Saturn 

9.582 

1433.4 

0.0558 

2.49 

113.6 

89.9 

51.9 

29.4235 

10,747 

378.1 

Uranus 

19.224 

2875.8 

0.0447 

0.77 

74.0 

170.3 

314.1 

83.747 

30,589 

369.7 

Neptune 

30.092 

4501.7 

0.0112 

1.77 

131.8 

39.5 

305.5 

163.723 

59,800 

367.5 

Mercury  a  =  0.38709893  +  0.000000667  e  =  0.20563069  +  0.000025277 

i  =7.00487°  -23.5 1"7  Q  =  48.33167°  -  446.30"7 

ru  =  77.45645°  +  573.57"  T  L  =  252.25084°  +  4.09233880°* 
Venus  a  =  0.72333199  +  0.000000927  e  =  0.00677323  -  0.000049387 

i  =  3.39471°  -  2.86 "7  Q  =  76.68069°  -  996.89 "7 

m  =  131.53298°  -  108.80"7  L  =  181.97973°  +  1.60213047°* 
Earth  +  Moon  a  =  1.00000011  -  0.000000057  e  =  0.01671022  -  0.000038047 

i  =  0.00005°  -  46.94" 7  J2  =  -11.26064°  -  18,  228.25"7 

m  =  102.94719°  +  1198.28"T  L  =  100.46435°  +  0.98560910°* 
Mars  a  =  1.52366231  -  0.000072217  e  =  0.09341233  +  0.000119027 

i  =  1.85061°  —  25.47"7  J2  =49.57854°  -  1020.19"7 

m  =  336.04084°  +  1560.78"7  L  =  355.45332°  +  0.52403304°* 
Jupiter  a  =  5.20336301  +  0.000607377  e  =  0.04839266  -  0.000128807 

i  =  1.30530°  -  4A5"T  Q  =  100.55615°  +  1217. 17" 7 

m  =  14.75385°  +  839.93 nT  L  =  34.40438°  +  0.08308676°* 

Saturn  a  =  9.53707032  -  0.003015307  e  =  0.05415060  -  0.000367627 

i  =  2.48446°  +6.1 1"7  J2  =  113.71504°  -  1591.05"7 

m  =  92.43194°  -  1948. 89"7  L  =  49.94432°  +  0.03346063°* 
Uranus  a  =  19.19126393  +  0.001520257  e  =  0.04716771  -  0.000191507 

i  =  0.76986°  -  2.09 "T  Q  =  74.22988°  +  168 1.40" 7 

si  =  170.96424°  +  1312. 56"7  L  =  313.23218°  +  0.01173129°* 
Neptune  a  =  30.06896348  -  0.001251967  e  =  0.00858587  +  0.000025147 

i  =  1.76917°  -  3. 64" 7  Q  =  131.72169°  -  151.25"7 

=  44.97135°  -  844.43" 7  L  =  304.88003°  +  0.00598106°* 


Table  C.1 2  Mean 
elements  of  planets  with 
respect  to  the  equator  and 
equinox  of  J2000.0.  The 
variable  *  is  the  time  in 
days  since  J2000.0  and 
7  is  the  same  time  in 
Julian  centuries: 
t  =  J  —  2,451,545.0, 

7  =  */ 36,525.  L  is  the 
mean  longitude, 

L  =  M  +  zu .  The  elements 
do  not  contain  periodic 
terms,  and  the  accuracy  of 
the  positions  computed 
from  them  is  of  the  order 
of  a  few  minutes  of  arc. 
The  values  are  from  the 
Explanatory  Supplement  to 
the  Astronomical  Almanac. 
The  elements  of  the  Earth 
describe  the  orbit  of  the 
barycentre  of  the 
Earth-Moon  system 
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Table  C.14  Some  well-known  asteroids,  a  =  semimajor  axis,  e  =  eccentricity,  i  =  inclination,  d  =  diameter, 
Tsid  =  sidereal  period  of  rotation,  p  =  geometric  albedo,  Vq  =  mean  opposition  magnitude 


Asteroidi 

Discoverer 

Year  of 
discovery 

a 

[AU] 

e 

i 

[°] 

d 

[km] 

Aid 

[h] 

P 

V0 

Type 

1 

Ceres 

Piazzi 

1801 

2.77 

0.08 

10.6 

946 

9.08 

0.07 

7.9 

c 

2 

Pallas 

Olbers 

1802 

2.77 

0.23 

34.8 

583 

7.88 

0.09 

8.5 

u 

3 

Juno 

Harding 

1804 

2.67 

0.26 

13.0 

249 

7.21 

0.16 

9.8 

s 

4 

Vesta 

Olbers 

1807 

2.36 

0.09 

7.1 

555 

5.34 

0.26 

6.8 

u 

5 

Astraea 

Hencke 

1845 

2.58 

0.19 

5.3 

116 

16.81 

0.13 

11.2 

s 

6 

Hebe 

Hencke 

1847 

2.42 

0.20 

14.8 

206 

7.27 

0.16 

9.7 

s 

7 

Iris 

Hind 

1847 

2.39 

0.23 

5.5 

222 

7.14 

0.20 

9.4 

s 

8 

Flora 

Hind 

1847 

2.20 

0.16 

5.9 

160 

13.60 

0.13 

9.8 

s 

9 

Metis 

Graham 

1848 

2.39 

0.12 

5.6 

168 

5.06 

0.12 

10.4 

s 

10 

Hygiea 

DeGasparis 

1849 

3.14 

0.12 

3.8 

443 

18.00 

0.05 

10.6 

c 

243 

Ida 

Palis  a 

1884 

2.86 

0.04 

1.1 

32 

4.63 

0.16 

433 

Eros 

Witt 

1898 

1.46 

0.22 

10.8 

20 

5.27 

0.18 

11.5 

s 

588 

Achilles 

Wolf 

1906 

5.18 

0.15 

10.3 

70 

? 

? 

16.4 

u 

624 

Hektor 

Kopff 

1907 

5.16 

0.03 

18.3 

230 

6.92 

0.03 

15.3 

u 

944 

Hidalgo 

Baade 

1920 

5.85 

0.66 

42.4 

30 

10.06 

? 

19.2 

MEU 

951 

Gaspra 

Neujmin 

1916 

2.21 

0.17 

4.1 

19 

7.04 

0.15 

1221 

Amor 

Delporte 

1932 

1.92 

0.43 

11.9 

5 

? 

? 

20.4 

? 

1566 

Icarus 

Baade 

1949 

1.08 

0.83 

22.9 

2 

2.27 

? 

12.3 

u 

1862 

Apollo 

Reinmuth 

1932 

1.47 

0.56 

6.4 

2 

? 

? 

16.3 

? 

2060 

Chiron 

Kowal 

1977 

13.64 

0.38 

6.9 

320 

? 

? 

17.3 

? 

5145 

Pholus 

Rabinowitz 

1992 

20.46 

0.58 

24.7 

190 

Table  C.1 5  Principal  meteor  showers 


Shower 

Period  of 

Maximum 

Radiant 

Meteors 

Comet 

visibility 

a 

5 

per  hour 

Quadrantids 

Jan. 1-5 

Jan.  3-4 

15.5  h 

+50° 

30-40 

Lyrids 

Apr.  19-25 

Apr.  22 

18.2  h 

+34° 

10 

Thatcher 

7]  Aquarids 

May  1-12 

May  5 

22.4  h 

-1° 

5-10 

Halley 

Perseids 

Jul.  20- Aug.  18 

Aug.  12 

3.1  h 

+58° 

40-50 

Swift-Tuttle 

k  Cygnids 

Aug.  17-24 

Aug.  20 

19.1  h 

+59° 

5 

Orionids 

Oct.  17-26 

Oct.  21 

6.3  h 

+  16° 

10-15 

Halley 

Taurids 

Oct.  10-Dec.  5 

Nov.  1 

3.8  h 

+  14°,  +22° 

5 

Encke 

Leonids 

Nov.  14-20 

Nov.  17 

10.2  h 

+22° 

10 

Tempel-Tuttle 

Geminids 

Dec.  7-15 

Dec.  13-14 

7.5  h 

+33° 

40-50 

Ursids 

Dec.  17-24 

Dec.  22 

13.5  h 

+78° 

5 
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Table  C.16  Periodic  comets  with  several  perihelion 
passages  observed.  N  =  number  of  passages  observed, 
r  =  time  of  perihelion  passage,  P  =  sidereal  period, 
q  —  perihelion  distance,  e  =  eccentricity,  co  =  argument 
of  perihelion  (1950.0),  Q  =  longitude  of  ascending  node 
(1950.0),  i  =  inclination,  /  =  longitude  of  perihelion,  de¬ 


fined  here  as  /  =  £2  +  arctan(tan&>cos  i),  b  =  latitude  of 
perihelion  (sin  b  =  sin  co  sin/),  Q  =  aphelion  distance.  The 
elements  are  affected  by  planetary  perturbations  as  well 
as  reaction  forces  due  to  evaporating  material  the  amount 
of  which  is  difficult  to  predict 


Comet 

N 

T 

P 

[a] 

q 

[AU] 

e 

co 

[°] 

J2 

[°] 

i 

[°] 

/ 

[°] 

b 

[°] _ 

Q 

[AU] 

Encke 

56 

Feb.  9,  1994 

3.28 

0.331 

0.850 

186.3 

334.0 

11.9 

160.2 

-1.3 

4.09 

Grigg-Skjellerup 

16 

Jul.  24,  1992 

5.10 

0.995 

0.664 

359.3 

212.6 

21.1 

212.0 

-0.3 

4.93 

Honda-Mrkos-Pajdusakova 

8 

Sep.  12,  1990 

5.30 

0.541 

0.822 

325.8 

88.6 

4.2 

54.5 

-2.4 

5.54 

Tuttle-Giacobini-Kresak 

8 

Feb.  8,  1990 

5.46 

1.068 

0.656 

61.6 

140.9 

9.2 

202.1 

8.1 

5.14 

Tempel  2 

19 

Mar.  16,  1994 

5.48 

1.484 

0.522 

194.9 

117.6 

12.0 

312.1 

-3.1 

4.73 

Wirtanen 

8 

Sep.  21,  1991 

5.50 

1.083 

0.652 

356.2 

81.6 

11.7 

77.9 

-0.8 

5.15 

Clark 

8 

Nov.  28,  1989 

5.51 

1.556 

0.501 

208.9 

59.1 

9.5 

267.7 

-4.6 

4.68 

Forbes 

8 

Mar.  15,  1993 

6.14 

1.450 

0.568 

310.6 

333.6 

7.2 

284.5 

-5.4 

5.25 

Pons-Winnecke 

20 

Aug.  19,  1989 

6.38 

1.261 

0.634 

172.3 

92.8 

22.3 

265.6 

2.9 

5.62 

d’ Arrest 

15 

Feb.  4,  1989 

6.39 

1.292 

0.625 

177.1 

138.8 

19.4 

316.0 

1.0 

5.59 

Schwassmann-Wachmann  2 

11 

Jan.  24,  1994 

6.39 

2.070 

0.399 

358.2 

125.6 

3.8 

123.8 

-0.1 

4.82 

Wolf-Harrington 

8 

Apr.  4,  1991 

6.51 

1.608 

0.539 

187.0 

254.2 

18.5 

80.8 

-2.2 

5.37 

Ciacobini-Zinner 

12 

Apr.  14,  1992 

6.61 

1.034 

0.707 

172.5 

194.7 

31.8 

8.3 

3.9 

6.01 

Reinmuth  2 

8 

Jun.  29,  1994 

6.64 

1.893 

0.464 

45.9 

295.4 

7.0 

341.1 

5.0 

5.17 

Perrine-Mrkos 

8 

Mar.  1,  1989 

6.78 

1.298 

0.638 

166.6 

239.4 

17.8 

46.6 

4.1 

5.87 

Arend-Rigaux 

7 

Oct.  3,  1991 

6.82 

1.438 

0.600 

329.1 

121.4 

17.9 

91.7 

-9.1 

5.75 

Borrelly 

11 

Dec.  18,  1987 

6.86 

1.357 

0.624 

353.3 

74.8 

30.3 

69.0 

-3.4 

5.86 

Brooks  2 

14 

Sep.  1,  1994 

6.89 

1.843 

0.491 

198.0 

176.2 

5.5 

14.1 

-1.7 

5.40 

Finlay 

11 

Jun.  5,  1988 

6.95 

1.094 

0.700 

322.2 

41.7 

3.6 

4.0 

-2.2 

6.19 

Johnson 

7 

Nov.  19,  1990 

6.97 

2.313 

0.366 

208.3 

116.7 

13.7 

324.3 

-6.4 

4.98 

Daniel 

8 

Aug.  31,  1992 

7.06 

1.650 

0.552 

11.0 

68.4 

20.1 

78.7 

3.8 

5.71 

Holmes 

8 

Apr.  10,  1993 

7.09 

2.177 

0.410 

23.2 

327.3 

19.2 

349.4 

7.4 

5.21 

Reinmuth  1 

8 

May  10,  1988 

7.29 

1.869 

0.503 

13.0 

119.2 

8.1 

132.0 

1.8 

5.65 

Faye 

19 

Nov.  15,  1991 

7.34 

1.593 

0.578 

204.0 

198.9 

9.1 

42.6 

-3.7 

5.96 

Ashbrook-Jackson 

7 

Jul.  13,  1993 

7.49 

2.316 

0.395 

348.7 

2.0 

12.5 

350.9 

-2.4 

5.34 

Schaumasse 

10 

Mar.  5,  1993 

8.22 

1.202 

0.705 

57.5 

80.4 

11.8 

137.3 

10.0 

6.94 

Wolf 

14 

Aug.  28,  1992 

8.25 

2.428 

0.406 

162.3 

203.4 

27.5 

7.6 

8.1 

5.74 

Whipple 

9 

Dec.  22,  1994 

8.53 

3.094 

0.259 

201.9 

181.8 

9.9 

23.4 

-3.7 

5.25 

Comas  Sola 

8 

Aug.  18,  1987 

8.78 

1.830 

0.570 

45.5 

60.4 

13.0 

105.2 

9.2 

6.68 

Vais  ala  1 

6 

Apr.  29,  1993 

10.8 

1.783 

0.635 

47.4 

134.4 

11.6 

181.2 

8.5 

7.98 

Tuttle 

11 

Jun.  27,  1994 

13.5 

0.998 

0.824 

206.7 

269.8 

54.7 

106.1 

-21.5 

10.3 

Halley 

30 

Feb.  9,  1986 

76.0 

0.587 

0.967 

111.8 

58.1 

162.2 

305.3 

16.4 

35.3 
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Table  C.1 7  Nearest  stars.  V  =  apparent  visual  magnitude,  B  —  V  =  colour  index,  r  =  distance,  fi  =  proper  motion, 
vr  =  radial  velocity  (positive  for  receding  objects) 


Name 

<*2000 

<*>2000 

V 

B-V 

Spectrum 

r 

[pc] 

[7a] 

ur 

[km/s] 

[h] 

[min] 

[°] 

n 

Sun 

-26.8 

0.6 

G2V 

a  Cen  C  (Proxima) 

14 

29.7 

-62 

41 

11.0 

2.0 

M5eV 

3.9 

-16 

a  Cen  A 

14 

39.6 

-60 

50 

-0.0 

0.7 

G2V 

1.33 

3.7 

-22 

a  CenB 

14 

39.6 

-60 

50 

1.3 

0.9 

K1V 

1.33 

3.7 

-22 

Barnard’s  star 

17 

57.8 

4 

42 

9.5 

1.7 

M5V 

1.83 

10.3 

-108 

Wolf  359 

10 

56.5 

7 

01 

13.5 

2.0 

M6eV 

2.39 

4.7 

+  13 

BD+36°2147 

11 

03.3 

35 

58 

7.5 

1.5 

M2V 

2.54 

4.8 

-86 

a  CMa  (Sirius)  A 

6 

45.1 

-16 

43 

-1.5 

0.0 

A1V 

2.66 

1.3 

-8 

a  CMa  (Sirius)  B 

6 

45.1 

-16 

43 

8.4 

wdA 

2.66 

1.3 

-8 

Luyten  726-8  A 

1 

39.0 

-17 

57 

12.5 

M6eV 

2.66 

3.3 

+29 

Luyten  726-8  B  (UV  Cet) 

1 

39.0 

-17 

57 

13.0 

M6eV 

2.66 

3.3 

+32 

Ross  154 

18 

49.8 

-23 

50 

10.4 

M4eV 

2.92 

0.7 

-4 

Ross  248 

23 

41.9 

44 

11 

12.2 

1.9 

M5eV 

3.13 

1.6 

-81 

s  Eri 

3 

32.9 

-9 

27 

3.7 

0.9 

K2V 

3.26 

1.0 

+  16 

Ross  128 

11 

47.7 

0 

48 

11.1 

1.8 

M5V 

3.31 

1.4 

-13 

Luyten  789-6  A 

22 

38.6 

-15 

17 

12.8 

2.0 

M5eV 

3.40 

3.3 

-60 

Luyten  789-6  B 

22 

38.6 

-15 

17 

13.3 

3.40 

3.3 

-60 

BD+43°44  A 

0 

18.4 

44 

01 

8.1 

1.6 

M3V 

3.44 

2.9 

+  13 

BD+43°44B 

0 

18.4 

44 

01 

11.0 

1.8 

M6V 

3.44 

2.9 

+20 

e  Ind 

22 

03.4 

-56 

47 

4.7 

1.1 

K5V 

3.45 

4.7 

-40 

BD+59°1915  A 

18 

42.8 

59 

38 

8.9 

1.5 

M4V 

3.45 

2.3 

0 

BD+59°1915B 

18 

42.8 

59 

38 

9.7 

1.6 

M4V 

3.45 

2.3 

+  10 

6 1  Cyg  A 

21 

06.9 

38 

45 

5.2 

1.2 

K5V 

3.46 

5.2 

-64 

6 1  Cyg  B 

21 

06.9 

38 

45 

6.0 

1.4 

K7V 

3.46 

5.2 

-64 

r  Cet 

1 

43.1 

-15 

56 

3.5 

0.7 

G8V 

3.48 

1.9 

-16 

CD— 36°15693 

23 

05.9 

-35 

51 

7.4 

1.5 

M2V 

3.51 

6.9 

+  10 

a  CMi  (Procyon)  A 

7 

39.3 

14 

0.4 

0.4 

L5IV 

3.51 

1.3 

-3 

a  CMi  (Procyon)  B 

7 

39.3 

14 

10.7 

wdL 

3.51 

1.3 

G51-15 

8 

29.8 

26 

47 

14.8 

M7V 

3.62 

1.3 

BD+5°1668 

7 

27.4 

5 

13 

9.8 

1.6 

M4V 

3.76 

3.8 

+26 

Luyten  725-32 

1 

12.6 

-17 

00 

11.8 

M6eV 

3.77 

1.4 

Kapteyn’s  star 

5 

11.7 

-45 

01 

8.8 

1.6 

M1VI 

3.85 

8.8 

+245 

CD— 39°14192 

21 

17.2 

-38 

52 

6.7 

1.4 

MOeV 

3.85 

3.5 

+21 

Kruger  60  A 

22 

28.0 

57 

42 

9.9 

1.6 

M3V 

3.95 

0.9 

-26 

Kruger  60  B 

22 

28.0 

57 

42 

11.5 

1.8 

M4eV 

3.95 

0.9 

-26 

Ross  614  A 

6 

29.4 

-2 

49 

11.2 

1.7 

M4eV 

4.13 

1.0 

+24 

Ross  614  B 

6 

29.4 

-2 

49 

14.8 

4.13 

1.0 

+24 

BD— 12°4523 

16 

30.3 

-12 

40 

10.2 

1.6 

M5V 

4.15 

1.2 

-13 

Wolf 424  A 

12 

33.3 

9 

01 

13.2 

1.8 

M6V 

4.29 

1.8 

-5 

Wolf 424  B 

12 

33.3 

9 

01 

13.2 

4.29 

1.8 

-5 
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Table  C.1 7  (Continued) 


Name 

<*2000 

<*>2000 

V 

B-V 

Spectrum 

r 

[pc] 

[7a] 

vr 

[km/s] 

[h] 

[min] 

n 

n 

van  Maanen’s  star 

0 

49.2 

5 

23 

12.4 

0.6 

wdG 

4.33 

3.0 

+54 

Luyten  1159-16 

2 

00.2 

13 

03 

12.2 

M5eV 

4.48 

2.1 

CD— 37°15492 

0 

05.4 

-37 

21 

8.6 

1.5 

M3V 

4.48 

6.1 

+23 

Luyten  143-23 

10 

44.5 

-61 

12 

13.9 

dM 

4.48 

1.7 

CD-46°  11540 

17 

28.7 

-46 

54 

9.4 

1.5 

M3 

4.52 

1.1 

LP 73 1-58 

10 

48.2 

-11 

20 

15.6 

M7V 

4.55 

1.6 

Luyten  145-141 

11 

45.7 

-64 

50 

11.4 

0.2 

wdA 

4.57 

2.7 

BD+68°946 

17 

36.4 

68 

20 

9.1 

1.5 

M3V 

4.63 

1.3 

-22 

CD— 49°13515 

21 

33.6 

-49 

01 

8.7 

1.5 

M2V 

4.63 

0.8 

+8 

BD+50°1725 

10 

11.3 

49 

27 

6.6 

1.4 

K2V 

4.67 

1.5 

-26 

G 158-27 

0 

06.7 

-07 

32 

13.7 

M5V 

4.67 

2.1 

BD— 15°6290 

22 

53.3 

-14 

18 

10.2 

1.6 

M4V 

4.69 

1.1 

+9 

CD-44°  11909 

17 

37.1 

-44 

19 

11.0 

M5V 

4.72 

1.1 

G  208-44/45  A 

19 

53.9 

44 

25 

13.4 

M6eV 

4.72 

0.7 

G 208-44/45 B 

19 

53.9 

44 

25 

14.3 

dM 

4.72 

G 208-44/45 C 

19 

53.9 

44 

25 

15.5 

dM 

4.72 

o2  Eri  A 

4 

15.3 

-7 

39 

4.4 

0.8 

K0V 

4.76 

-43 

o2  Eri  B 

4 

15.3 

-7 

39 

9.5 

0.0 

wdA 

4.76 

-21 

o2  Eri  C 

4 

15.3 

-7 

39 

11.2 

1.7 

M4eV 

4.76 

-45 

BD+20°2465 

10 

19.6 

19 

52 

9.4 

1.5 

M4V 

4.88 

0.5 

+  11 

70  Oph  A 

18 

05.5 

2 

30 

4.2 

0.9 

K0V 

4.98 

1.1 

-7 

70  OphB 

18 

05.5 

2 

30 

6.0 

K5V 

4.98 

1.1 

-10 

BD+44°2051  A 

11 

05.5 

43 

32 

8.7 

M2V 

5.00 

4.5 

+65 

BD+44°2051B 

11 

05.5 

43 

32 

14.4 

M5eV 

5.00 

4.5 

+65 

a  Aql  (Altair) 

19 

50.8 

8 

52 

0.8 

0.2 

A7V 

5.08 

0.7 

-26 

Table  C.1 8  Brightest  stars  (V  <  2).  V  =  apparent  visual  magnitude,  B  —  V  =  colour  index,  r  =  distance.  Remarks: 
b  =  binary,  sb  =  spectroscopic  binary,  v  =  variable 


Name 

<*2000 

7ooo 

V 

B-V 

Spectrum 

r 

Remarks 

[h]H 

[min] 

[°]  1 

r]  i 

[pc] 

a  CMa 

Sirius 

6 

45.2 

-16 

43 

-1.5 

0.0 

A IV,  wdA 

2.7 

b 

a  Car 

Canopus 

6 

24.0 

-52 

42 

-0.7 

0.2 

A9II 

60 

a  Cen 

Rigil 

14 

39.6 

-60 

50 

-0.3 

0.7 

G2V,K1V 

1.3 

b,  Proxima 

Kentaurus 

2.2°  apart 

a  Boo 

Arcturus 

14 

15.7 

19 

11 

-0.0 

1.2 

K2IIIp 

11 

a  Lyr 

Vega 

18 

36.9 

38 

47 

0.0 

0.0 

A0V 

8 

a  Aur 

Capella 

5 

16.7 

46 

00 

0.1 

EH 

G2III,G6III 

14 

b 

yd  Ori 

Rigel 

5 

14.5 

-8 

12 

0.1 

B8Ia 

90 

b 

a  CMi 

Procyon 

7 

39.3 

5 

14 

0.4 

F5IV,wdF 

3.5 

b 

a  Eri 

Achernar 

1 

37.7 

-57 

14 

0.5 

EH 

B3Vp 

40 
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Table  C.1 8  (Continued) 


Name 

<*2000 

<*>2000 

V 

B  -V 

Spectrum 

r 

[pc] 

Remarks 

[h] 

[min] 

[°] 

n 

a  Ori 

Betelgeuze 

5 

55.2 

7 

24 

0.5 

1.9 

M2I 

200 

v  0.4  —  1.3,  sb 

ydCen 

Hadar 

14 

03.8 

-60 

22 

0.6 

-0.2 

B1III 

60 

b 

a  Aql 

Altair 

19 

50.8 

8 

52 

0.8 

0.2 

A7V 

5.1 

a  Cru 

Acrux 

12 

26.6 

-63 

06 

0.8 

-0.3 

B0.5IV,B1V 

120 

b  1.6  +  2.1 

a  Tau 

Aldebaran 

4 

35.9 

16 

31 

0.9 

1.5 

K5III 

20 

b,  v 

a  Vir 

Spica 

13 

25.2 

-11 

10 

1.0 

-0.2 

B1IV 

50 

sb,  several  comp. 

a  Sco 

Antares 

16 

29.4 

-26 

26 

1.0 

1.8 

M1.5I,B2.5V 

50 

< 

o 

oo 

P  Gem 

Pollux 

7 

45.3 

28 

02 

1.2 

1.1 

K0III 

11 

a  PsA 

Fomalhaut 

22 

57.6 

-29 

37 

1.2 

0.1 

A3V 

7.0 

<*Cyg 

Deneb 

20 

41.4 

45 

17 

1.3 

0.1 

A2Ia 

500 

P  Cru 

Mimosa 

12 

47.7 

-59 

41 

1.3 

-0.2 

B0.5III 

150 

v,  sb 

a  Leo 

Regulus 

10 

08.4 

11 

58 

1.4 

-0.1 

B7V 

26 

b 

e  CMa 

Adhara 

6 

58.6 

-28 

58 

1.5 

-0.2 

B2II 

170 

b 

a  Gem 

Castor 

7 

34.6 

31 

53 

1.6 

0.0 

A1V,A2V 

14 

b 

y  Cru 

Gacrux 

12 

31.2 

-57 

07 

1.6 

1.6 

M3.5III 

40 

V 

X  Sco 

Shaula 

17 

33.6 

-37 

06 

1.6 

-0.2 

B1.5IV 

V 

Y  Ori 

Bellatrix 

5 

25.1 

6 

21 

1.6 

-0.2 

B2III 

P  Tau 

Elnath 

5 

26.3 

28 

36 

1.7 

-0.1 

B7III 

55 

P  Car 

Miaplacidus 

9 

13.2 

-69 

43 

1.7 

0.0 

A1III 

e  Ori 

Alnilam 

5 

36.2 

-1 

12 

1.7 

-0.2 

BOIa 

a  Gru 

A1  Na’ir 

22 

08.2 

-46 

58 

1.7 

-0.1 

B7IV 

20 

e  UMa 

Alioth 

12 

54.0 

55 

58 

1.8 

-0.0 

AOIVp 

120 

V 

y  Vel 

Regor 

8 

09.5 

-47 

20 

1.8 

-0.2 

WC8JB1IV 

b  1.8  +  4.3,  each  sb 

a  Per 

Mirfak 

3 

24.3 

49 

52 

1.8 

0.5 

F5Ib 

35 

a  UMa 

Dubhe 

11 

03.7 

61 

45 

1.8 

1.1 

K0III 

30 

b 

eSgr 

Kaus  Australis 

18 

24.2 

-34 

23 

1.9 

-0.0 

A0II 

70 

8  CMa 

Wezen 

7 

08.4 

-26 

23 

1.9 

0.7 

F8Ia 

e  Car 

Avior 

8 

22.5 

-59 

31 

1.9 

1.3 

K3III,B2V 

25 

sb 

r/  UMa 

Alkaid 

13 

47.5 

49 

19 

1.9 

-0.2 

B3V 

0  Sco 

Girtab 

17 

37.3 

-43 

00 

1.9 

0.4 

F0II 

50 

P  Aur 

Menkalinan 

5 

59.6 

44 

57 

1.9 

0.0 

A1IV 

30 

£  Ori 

Alnitak 

5 

40.8 

-1 

57 

1.9 

-0.2 

O9.5Ib,B0III 

45 

b  2.1  +  4.2 

a  TrA 

Atria 

16 

48.7 

-69 

02 

1.9 

1.4 

K2II-III 

40 

y  Gem 

Alhena 

6 

37.7 

16 

24 

1.9 

0.0 

A1IV 

30 

a  Pav 

Peacock 

20 

25.7 

-56 

44 

1.9 

-0.2 

B3V 

5  Vel 

8 

44.7 

-54 

43 

2.0 

0.0 

A1V 

P  CMa 

Mirzam 

22.7 

-17 

57 

2.0 

-0.2 

B 1 II— III 

70 

V 

a  Hya 

Alphard 

9 

27.6 

-8 

40 

2.0 

1.4 

K3II-III 

a  Ari 

Hamal 

2 

07.2 

23 

28 

2.0 

1.2 

K2III 

25 
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Table  C.19  Some  double  stars.  Magnitudes  of  the  components  are  m\  and  m2,  and  the  angular  separation  d;  r  is  the 
distance  of  the  star 


Name 

<*2000 

<*>2000 

m  1 

m2 

Spectrum 

d 

["] 

r 

[pc] 

[h] 

[min] 

[°] 

n 

rj  Cas 

Achird 

0 

49.1 

57 

49 

3.7 

7.5 

GOV 

M0 

12 

6 

y  Ari 

Mesarthim 

1 

53.5 

19 

18 

4.8 

4.9 

Alp 

B9V 

8 

a  Psc 

Alrescha 

2 

02.0 

2 

46 

4.3 

5.3 

AOp 

A3m 

2 

y  And 

Alamak 

2 

03.9 

42 

20 

2.4 

5.1 

K3IIb 

B8V,A0V 

10 

100 

8  Ori 

Mintaka 

5 

32.0 

-0 

18 

2.5 

7.0 

B0III,O9V 

B2V 

52 

70 

X  Ori 

Meissa 

5 

35.1 

56 

3.7 

5.7 

08e 

B0.5V 

4 

£  Ori 

Alnitak 

5 

40.8 

-1 

56 

2.1 

4.2 

09.5  lbe 

B0III 

2 

a  Gem 

Castor 

7 

34.6 

31 

53 

2.0 

3.0 

A1V 

A5Vm 

15 

y  Leo 

Algieba 

10 

20.0 

19 

50 

2.6 

3.8 

Kim 

G7III 

4 

£  UMa 

Alula  Australis 

11 

18.2 

31 

32 

4.4 

4.9 

GOV 

GOV 

1 

7 

a  Cru 

Acrux 

12 

26.6 

-63 

06 

1.6 

2.1 

B0.5IV 

B1V 

4 

y  Vir 

Porrima 

12 

41.7 

-1 

27 

3.7 

3.7 

F0V 

F0V 

a  CVn 

Cor  Caroli 

12 

56.1 

38 

18 

2.9 

5.5 

AOp 

F0V 

20 

£  UMa 

Mizar 

13 

23.9 

54 

56 

2.4 

4.1 

AlVp 

Aim 

14 

21 

a  Cen 

Rigil  Kentaurus 

14 

39.6 

-60 

50 

0.0 

1.3 

G2V 

K1V 

21 

1.3 

6  Boo 

Izar 

14 

45.0 

27 

04 

2.7 

5.3 

K0II-III 

A2V 

3 

60 

8  Ser 

15 

34.8 

10 

32 

4.2 

5.3 

F0IV 

F0IV 

4 

50 

P  Sco 

Graffias 

16 

05.4 

-19 

48 

2.6 

4.9 

B1V 

B2V 

14 

110 

a  Her 

Rasalgethi 

17 

14.6 

14 

23 

3. 0-4.0 

5.7 

M5Ib-II 

G5III,F2V 

120 

p  Her 

17 

23.7 

37 

08 

4.5 

5.5 

B9.5III 

A0V 

4 

70  Oph 

18 

05.5 

2 

30 

4.3 

6.1 

K0V 

K5V 

2 

5 

e  Lyr 

18 

44.3 

39 

40 

4.8 

4.4 

A4VJF1V 

A8V,F0V 

208 

50 

e1  Lyr 

18 

44.3 

39 

40 

5.1 

6.2 

A4V 

F1V 

3 

50 

e2  Lyr 

18 

44.4 

39 

37 

5.1 

5.3 

A8V 

F0V 

2 

50 

(  Lyr 

18 

44.8 

37 

36 

4.3 

5.7 

Am 

F0IV 

44 

30 

0  Ser 

Alya 

18 

56.2 

4 

12 

4.5 

4.9 

A5V 

A5V 

22 

30 

y  Del 

46.7 

16 

07 

4.5 

5.4 

K1IV 

F7V 

10 

40 

(  Aqr 

28.8 

-0 

01 

4.4 

4.6 

F3V 

F6IV 

2 

30 

8  Cep 

29.1 

58 

24 

3. 5-4. 3 

7.5 

F5Ib-G2Ib 

B7IV 

41 

90 

Table  C.20  Milky  Way  Galaxy 


Property 

Value 

Mass 

>  2  x  1011  Mq 

Disc  diameter 

30  kpc 

Disc  thickness  (stars) 

1  kpc 

Disc  thickness  (gas  and  dust) 

200  pc 

Halo  diameter 

50  kpc 

Sun’s  distance  from  the  centre 

8.5  kpc 

Sun’s  orbital  velocity 

220  kms-1 

Sun’s  period 

240  x  106  a 

Direction  of  the  centre  (2000.0) 

a  =  17  h  45.7  min 

5  =  — 29°00/ 

Direction  of  the  north  pole  (2000.0) 

a  =  12  h  51.4  min 

5  =  +27°08/ 

Galactic  coordinates  of  the 

/  =  123°00' 

celestial  north  pole 

b  =  +27°08/ 
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Table  C.21  Members  of  the  Local  Group  of  Galaxies.  V  =  apparent  visual  magnitude,  My  =  absolute  visual  magni¬ 
tude,  r  =  distance 


<*2000 

<*>2000 

Type 

V 

My 

r 

[kpc] 

[h] 

[min] 

[°] 

['] 

Milky  Way 

17 

45.7 

-29 

00 

Sbc 

8 

NGC  224  =  M3 1 

00 

42.7 

41 

16 

Sb 

3.2 

-21.2 

760 

NGC598  =  M33 

01 

33.8 

30 

30 

Sc 

5.6 

-18.9 

790 

Large  Magellanic  Cloud 

05 

19.6 

-69 

27 

Irr 

-18.5 

50 

Small  Magellanic  Cloud 

00 

52.6 

-72 

48 

Irr 

1.8 

-17.1 

60 

NGC  221=M32 

00 

42.7 

40 

52 

E2 

7.9 

-16.5 

760 

NGC  205 

00 

40.4 

41 

41 

dE5 

-16.4 

760 

IC10 

00 

20.4 

59 

17 

Irr 

7.8 

-16.3 

660 

NGC  6822 

19 

44.9 

-14 

48 

Irr 

7.5 

NGC  185 

00 

39.0 

48 

20 

dE3 

8.5 

-15.6 

660 

IC  1613 

01 

04.8 

02 

08 

Irr 

-15.3 

720 

NGC  147 

00 

33.2 

48 

30 

dE4 

9.0 

-15.1 

660 

WLM 

00 

02.0 

-15 

28 

Irr 

10.4 

-14.4 

930 

Sagittarius 

18 

55.1 

-30 

29 

dE7 

3.1 

-13.8 

24 

Fornax 

02 

39.9 

-34 

30 

dE3 

7.6 

-13.1 

140 

Pegasus 

23 

28.6 

14 

45 

Irr 

12.1 

-12.3 

760 

Leo  I 

10 

08.4 

12 

18 

dE3 

-11.9 

250 

And  II 

01 

16.5 

33 

26 

dE3 

12.4 

-11.8 

700 

And  I 

00 

45.7 

38 

00 

dEO 

12.7 

-11.8 

810 

Leo  A 

09 

59.4 

30 

45 

Irr 

12.7 

-11.5 

690 

Aquarius 

20 

46.9 

-12 

51 

Irr 

13.7 

-11.3 

SagDIG 

19 

30.0 

-17 

41 

Irr 

15.0 

-10.7 

Pegasus  II  =  And  VI 

23 

51.7 

24 

36 

dE 

14.0 

-10.6 

830 

Pisces  =  LGS  3 

01 

03.9 

21 

54 

Irr 

14.1 

-10.4 

810 

And  III 

00 

35.3 

36 

30 

dE6 

14.2 

-10.2 

760 

And  V 

01 

10.3 

47 

38 

dE 

14.3 

810 

Leo  II 

11 

13.5 

22 

10 

dEO 

11.5 

210 

Cetus 

00 

26.1 

-11 

02 

dE 

14.3 

-10.1 

780 

Sculptor 

01 

00.1 

-33 

43 

dE3 

10.0 

-9.8 

90 

Phoenix 

01 

51.1 

-44 

27 

Irr 

13.2 

-9.8 

Tucana 

22 

41.8 

-64 

25 

dE5 

15.1 

-9.6 

870 

Sextans 

10 

13.0 

-01 

37 

dE4 

10.3 

-9.5 

90 

Cassiopeia  =  And  VII 

23 

26.5 

50 

42 

dE 

14.7 

-9.5 

690 

Carina 

06 

41.6 

-50 

58 

dE4 

10.6 

-9.4 

100 

Ursa  Minor 

15 

08.8 

67 

07 

dE5 

10.0 

-8.9 

60 

Draco 

17 

20.3 

57 

55 

dE3 

10.9 

-8.6 

80 

Ursa  Major 

158 

43.2 

51 

55 

dE 

13.2 

-6.8 

100 

Canes  Venatici 

13 

28.0 

33 

33 

dE 

-7.9 

220 

Bootes 

14 

00.0 

14 

30 

dE 

13.3 

-5.7 

60 

Ursa  Major  II 

08 

51.5 

63 

08 

dE 

14.3 

-3.8 

30 
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Table  C.21  (Continued) 


<*2000 

<*>2000 

Type 

V 

My 

r 

[kpc] 

[h]_ _ 1 

[min] 

[°] _ 

n 

Coma  Berenices 

12 

27.0 

23 

54 

dE 

14.5 

-3.7 

44 

Canes  Venatici  II 

12 

57.2 

34 

19 

dE 

15.1 

-4.8 

150 

Hercules 

16 

31.0 

12 

47.5 

dE 

14.7 

-6.0 

140 

Leo  IV 

11 

33.0 

-0 

32 

dE 

15.9 

-5.1 

160 

And  IX 

00 

52.9 

43 

12 

dE 

16.2 

-8.3 

760 

And  X 

01 

06.5 

44 

48 

dE 

16.1 

-8.1 

710 

And  XI 

00 

46.3 

33 

48 

dE 

17.2 

-7.3 

760 

And  XII 

00 

47.5 

34 

22 

dE 

18.1 

-6.4 

760 

And  XIII 

00 

51.8 

33 

00 

dE 

17.6 

-6.9 

760 

Table  C.22  Optically  brightest  galaxies.  B  =  apparent  blue  magnitude,  d  =  apparent  diameter,  r  =  distance 


Name 

<*2000 

<*>2000 

Type 

B 

d 

r 

\h\ 

[min] 

[°] 

H | 

n 

[Mpc] 

NGC55 

0 

15.1 

-39 

13 

Sc/Irr 

7.9 

30 

X 

5 

2.3 

NGC  205 

0 

40.4 

41 

41 

E6 

8.9 

12 

X 

6 

0.7 

NGC  221  =  M32 

0 

42.7 

40 

52 

E2 

9.1 

3.4 

X 

2.9 

0.7 

NGC  224  =  M3 1 

0 

42.8 

41 

16 

Sb 

4.3 

163 

X 

42 

0.7 

NGC  247 

0 

47.2 

-20 

46 

S 

9.5 

21 

X 

8 

2.3 

NGC  253 

0 

47.6 

-25 

17 

Sc 

7.0 

22 

X 

5 

2.3 

Small  Magellanic  Cloud 

0 

52.6 

-72 

48 

I  IT 

2.9 

216 

X 

216 

0.06 

NGC  300 

0 

54.9 

-37 

41 

Sc 

8.7 

22 

X 

16 

2.3 

NGC  598  =  M33 

1 

33.9 

30 

39 

Sc 

6.2 

61 

X 

42 

0.7 

Fornax 

2 

39.9 

-34 

32 

dE 

9.1 

50 

X 

35 

0.2 

Large  Magellanic  Cloud 

5 

23.6 

-69 

45 

Irr/Sc 

0.9 

432 

X 

432 

0.05 

NGC  2403 

7 

36.9 

65 

36 

Sc 

8.8 

22 

X 

12 

2.0 

NGC  2903 

9 

32.1 

21 

30 

Sb 

9.5 

16 

X 

7 

5.8 

NGC  3031  =  M81 

9 

55.6 

69 

04 

Sb 

7.8 

25 

X 

12 

2.0 

NGC  3034  =  M82 

9 

55.9 

69 

41 

Sc 

9.2 

10 

X 

1.5 

2.0 

NGC  4258  =  M106 

12 

19.0 

47 

18 

Sb 

8.9 

19 

X 

7 

4.3 

NGC  4472  =  M49 

12 

29.8 

8 

00 

E4 

9.3 

10 

X 

7 

11 

NGC  4594  =  Ml  04 

12 

40.0 

-11 

37 

Sb 

9.2 

8 

X 

5 

11 

NGC  4736  =  M94 

12 

50.9 

41 

07 

Sb 

8.9 

13 

X 

12 

4.3 

NGC  4826  =  M64 

12 

56.8 

21 

41 

9.3 

10 

X 

4 

3.7 

NGC  4945 

13 

05.4 

-49 

28 

Sb 

8.0 

20 

X 

4 

3.9 

NGC  5055  =  M63 

13 

15.8 

42 

02 

Sb 

9.3 

8 

X 

3 

4.3 

NGC  5128  =  Cen  A 

13 

25.5 

-43 

01 

E0 

7.9 

23 

X 

20 

3.9 

NGC  5194  =  M51 

13 

29.9 

47 

12 

Sc 

8.9 

11 

X 

6 

4.3 

NGC  5236  =  M83 

13 

37.0 

-29 

52 

Sc 

7.0 

13 

X 

12 

2.4 

NGC  5457  =  M101 

14 

03.2 

54 

21 

Sc 

8.2 

23 

X 

21 

4.3 

NGC  6822 

19 

45.0 

-14 

48 

I  IT 

9.2 

20 

X 

10 

0.7 
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Table  C.23 

Constellations.  The  first 
column  gives  the 
abbreviation  of  the  Latin 
name  used  to  form  star 
names 


Abbreviation 

Latin  name 

Genitive 

English  name 

And 

Andromeda 

Andromedae 

Andromeda 

Ant 

Antlia 

Antliae 

Air  Pump 

Aps 

Apus 

Apodis 

Bird  of  Paradise 

Aql 

Aquila 

Aquilae 

Eagle 

Aqr 

Aquarius 

Aquarii 

Water-bearer 

Ara 

Ara 

Arae 

Altar 

Ari 

Aries 

Arietis 

Ram 

Aur 

Auriga 

Aurigae 

Charioteer 

Boo 

Bootes 

Bootis 

Herdsman 

Cae 

Caelum 

Caeli 

Chisel 

Cam 

Camelopardalis 

Camelopardalis 

Giraffe 

Cnc 

Cancer 

Cancri 

Crab 

CMa 

Canis  Major 

Canis  Majoris 

Great  Dog 

CMi 

Canis  Minor 

Canis  Minoris 

Little  Dog 

Cap 

Capricomus 

Capricomi 

Sea-goat 

Car 

Carina 

Carinae 

Keel 

Cas 

Cassiopeia 

Cassiopeiae 

Cassiopeia 

Cen 

Centaurus 

Centauri 

Centaurus 

Cep 

Cepheus 

Cephei 

Cepheus 

Cet 

Cetus 

Ceti 

Whale 

Cha 

Chamaeleon 

Chamaeleontis 

Chameleon 

Cir 

Circinus 

Circini 

Compasses 

Col 

Columba 

Columbae 

Dove 

Com 

Coma  Berenices 

Comae  Berenices 

Berenice’s  Hair 

CrA 

Corona  Austrina 

Coronae  Austrinae 

Southern  Crown 

CrB 

Corona  Borealis 

Coronae  Borealis 

Northern  Crown 

Crv 

Corvus 

Corvi 

Crow 

Crt 

Crater 

Crateris 

Cup 

Cru 

Crux 

Crucis 

Southern  Cross 

CVn 

Canes  Venatici 

Canum  Venaticorum 

Hunting  Dogs 

Cyg 

Cygnus 

Cygni 

Swan 

Del 

Delphinus 

Delphini 

Dolphin 

Dor 

Dorado 

Doradus 

Swordfish 

Dra 

Draco 

Draconis 

Dragon 

Equ 

Equuleus 

Equulei 

Little  Horse 

Eri 

Eridanus 

Eridani 

Eridanus 

For 

Fornax 

Fornacis 

Furnace 

Gem 

Gemini 

Geminorum 

Twins 

Gru 

Grus 

Gruis 

Crane 

Her 

Hercules 

Herculis 

Hercules 

Hor 

Horologium 

Horologii 

Clock 

Hya 

Hydra 

Hydrae 

Water  Serpent 

Hyi 

Hydrus 

Hydri 

Water  Snake 

Ind 

Indus 

Indi 

Indian 
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Table  C.23  (Continued) 

Abbreviation 

Latin  name 

Genitive 

English  name 

Lac 

Lacerta 

Lacertae 

Lizard 

Leo 

Leo 

Leonis 

Lion 

Lep 

Lepus 

Leporis 

Hare 

Lib 

Libra 

Librae 

Scales 

LMi 

Leo  Minor 

Leonis  Minoris 

Little  Lion 

Lup 

Lupus 

Lupi 

Wolf 

Lyn 

Lynx 

Lyncis 

Lynx 

Lyr 

Lyra 

Lyrae 

Lyre 

Men 

Mensa 

Mensae 

Table  Mountain 

Mic 

Micro  scopium 

Micro  scopii 

Microscpe 

Mon 

Monoceros 

Monocerotis 

Unicorn 

Mus 

Musca 

Muscae 

Fly 

Nor 

Norma 

Normae 

Square 

Per 

Perseus 

Persei 

Perseus 

Phe 

Phoenix 

Phoenicis 

Phoenix 

Pic 

Pictor 

Pictoris 

Painter 

PsA 

Piscis  Austrinus 

Piscis  Austrini 

Southern  Fish 

Psc 

Pisces 

Piscium 

Fishes 

Pup 

Puppis 

Puppis 

Poop 

Pyx 

Pyxis 

Pyxidis 

Compass 

Ret 

Reticulum 

Reticuli 

Net 

Scl 

Sculptor 

Sculptoris 

Sculptor 

Sco 

Scorpius 

Scorpii 

Scorpion 

Set 

Scutum 

Scuti 

Sobieski’s  Shield 

Ser 

Serpens 

Serpentis 

Serpent 

Sex 

Sextans 

Sextantis 

Sextant 

Sge 

Sagitta 

Sagittae 

Arrow 

Sgr 

Sagittarius 

Sagittarii 

Archer 

Tau 

Taurus 

Tauri 

Bull 

Tel 

Telescopium 

Telescopii 

Telescope 

TrA 

Triangulum  Australe 

Trianguli  Australis 

Southern  Triangle 

Tri 

Triangulum 

Trianguli 

Triangle 

Tuc 

Tucana 

Tucanae 

Toucan 

UMa 

Ursa  Major 

Ursae  Majoris 

Great  Bear 

UMi 

Ursa  Minor 

Ursae  Minoris 

Little  Bear 

Vel 

Vela 

Velorum 

Sails 

Vir 

Virgo 

Virginis 

Virgin 

Vol 

Volans 

Volantis 

Flying  Fish 

Vul 

Vulpecula 

Vulpeculae 

Fox 

Oct 

Octans 

Octantis 

Octant 

Oph 

Ophiuchus 

Ophiuchi 

Serpent-bearer 

Ori 

Orion 

Orionis 

Orion 

Pav 

Pavo 

Pavonis 

Peacock 

Peg 

Pegasus 

Pegasi 

Pegasus 
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Table  C.24  Largest 
optical  telescopes. 

D  —  diameter  of  the  mirror 


Telescope 

Location 

Completion 

year 

D 

[m] 

GranTeCan 

La  Palma 

2009 

10.4 

William  M.  Keck  Telescope  I 

Mauna  Kea,  Hawaii 

1992 

10 

William  M.  Keck  Telescope  II 

Mauna  Kea,  Hawaii 

1996 

10 

Southern  African  Large  Telescope 

Sutherland,  South  Africa 

2005 

10 

Subaru  Telescope 

Mauna  Kea,  Hawaii 

1999 

8.3 

Large  Binocular  Telescope  1 

Mt.  Graham,  Arizona 

2005 

8.4 

Kueyen  Telescope  (VLT  2) 

Cerro  Paranal,  Chile 

1999 

8.2 

Melipal  Telescope  (VLT  3) 

Cerro  Paranal,  Chile 

2000 

8.2 

Yepun  Telescope  (VLT  4) 

Cerro  Paranal,  Chile 

2000 

8.2 

Gemini  North  Telescope 

Mauna  Kea,  Hawaii 

1999 

8.1 

Gemini  South  Telescope 

Cerro  Pachon,  Chile 

2000 

8.1 

Multi-Mirror  Telescope 

Mt.  Hopkins,  Arizona 

1999 

6.5 

Walter  Baade  (Magellan  1  Telescope) 

Las  Campanas,  Chile 

2000 

6.5 

Landon  Clay  (Magellan  2  Telescope) 

Las  Campanas,  Chile 

2002 

6.5 

Table  C.25  Largest  parabolic  radio  telescopes.  D  =  diameter  of  the  antenna,  Xm{n  =  shortest  wavelength 


Completion 

year 

D 

[m] 

7- min 

[cm] 

Remarks 

Arecibo 

Puerto  Rico,  USA 

1963 

305 

5 

Fixed  disk;  limited  tracking 

Green  Bank 

West  Virginia,  USA 

2001 

100  x  110 

0.3 

The  largest  fully  steerable  telescope 

Effelsberg 

Bonn,  Germany 

1973 

100 

0.8 

Jodrell  Bank 

Macclesfield,  Great  Britain 

1957 

76.2 

10-20 

First  large  paraboloid  antenna 

Jevpatoria 

Crimea 

1979 

70 

1.5 

Parkes 

Australia 

1961 

64 

2.5 

Innermost  17  m  of  dish  can  be  used 
down  to  3  mm  wavelengths 

Goldstone 

California,  USA 

64 

1.5 

Belongs  to  NASA  deep  space  network 

Tidbinbilla 

Australia 

64 

1.3 

NASA 

Madrid 

Spain 

64 

1.3 

NASA 
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Table  C.26  Millimetre  and  submillimetre  telescopes  and  interferometers,  h  =  altitude  above  sea  level,  D  =  diameter 
of  the  antenna,  Xm[n  =  shortest  wavelength 


Institute 

Location 

h 

[m] 

D 

[m] 

^■min 

[mm] 

Remarks; 
operational  since 

NR  AO,  VLA 

New  Mexico,  USA 

2124 

25 

7 

27  antennas 
<imax  —  36.6  km 
1976 

NR  AO,  VLBA 

USA 

16-3720 

25 

13 

10  antennas 
1988-1993 

Max-Planck-lnstitut  fur  Radioastronomie  & 
University  of  Arizona 

Mt.  Graham,  USA 

3250 

10 

0.3 

1994 

California  Institute  of  Technology 

Mauna  Kea,  Hawaii 

4100 

10.4 

0.3 

1986 

Science  Research  Council  England  &  Holland 

Mauna  Kea,  Hawaii 

4100 

15.0 

0.5 

The  James 

Clerk  Maxwell 
Telescope 

1986 

California  Institute  of  Technology 

Owens  Valley,  USA 

1220 

10.4 

0.5 

3  antenna 

interferometer 

1980 

Sweden-ESO  Southern  Hemisphere  Millimeter 
Antenna  (SEST) 

La  Silla,  Chile 

2400 

15.0 

0.6 

1987 

Institut  de  Radioastronomie  Millimetrique 
(IRAM),  France  &  Germany 

Plateau  de  Bure,  France 

2550 

15.0 

0.6 

3  antenna 

interferometer 

1990; 

fourth  antenna 
1993 

IRAM 

Pico  Veleta,  Spain 

2850 

30.0 

0.9 

1984 

National  Radio  Astronomy  Observatory 
(NRAO) 

Kitt  Peak,  USA 

1940 

12.0 

0.9 

1983  (1969) 

University  of  Massachusetts 

New  Salem,  USA 

300 

13.7 

1.9 

radom 

1978 

University  of  California,  Berkeley 

Hat  Creek  Observatory 

1040 

6.1 

2 

3  antenna 

interferometer 

1968 

Purple  Mountain  Observatory 

Nanjing,  China 

3000 

13.7 

2 

radom 

1987 

Daeduk  Radio  Astronomy  Observatory 

Soul,  South-Korea 

300 

13.7 

2 

radom 

1987 

University  of  Tokyo 

Nobey ama,  Japan 

1350 

45.0 

2.6 

1982 

University  of  Tokyo 

Nobey ama,  Japan 

1350 

10.0 

2.6 

5  antenna 

interferometer 

1984 

Chalmers  University  of  Technology 

Onsala,  Sweden 

10 

20.0 

2.6 

radom 

1976 
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Table  C.27  Some  important  astronomical  satellites  and  space  probes  1980-2002 


Satellite 

Launch  date 

Target 

Solar  Max 

USA 

Feb.  14,  1980 

Sun 

Venera  13 

SU 

Oct.  30,  1981 

Venus 

Venera  14 

SU 

Nov.  4,  1981 

Venus 

IRAS 

USA 

Jan.  25,  1983 

infrared 

Astron 

SU 

Mar.  23,  1983 

ultraviolet 

Venera  15 

SU 

Jun.  2,  1983 

Venus 

Venera  16 

SU 

Jun.  7,  1983 

Venus 

Exosat 

ESA/USA 

May  26,  1983 

X-ray 

Vega  1 

SU 

Dec.  15  1984 

Venus/Halley 

Vega  2 

SU 

Dec.  21,  1984 

Venus/Halley 

Giotto 

ESA 

Jul.  2,  1985 

Halley 

Suisei 

Japan 

Aug.  18,  1985 

Halley 

Ginga 

Japan 

Feb.  5,  1987 

X-ray 

Magellan 

USA 

May  4,  1989 

Venus 

Hipparcos 

ESA 

Aug.  8,  1989 

astrometry 

COBE 

USA 

Nov.  18,  1989 

cosmic  background  radiation 

Galileo 

USA 

Oct.  18,  1989 

Jupiter  etc. 

Granat 

SU 

Dec.  1,  1989 

gamma  ray 

Hubble 

USA/ESA 

Apr.  24,  1990 

UV,  visible 

Rosat 

Germany 

Jun.  1,  1990 

X-ray 

Gamma 

SU 

Jul.  11,  1990 

gamma  ray 

Ulysses 

ESA 

Oct.  6,  1990 

Sun 

Compton 

USA 

Apr.  5,  1991 

gamma  ray 

EUVE 

USA 

Jun.  7,  1992 

extreme  UV 

Asuka 

Japan 

Feb.  20,  1993 

X-ray 

Clementine 

USA 

Jan.  25, 1994 

Moon 

ISO 

ESA 

Nov.  17,  1995 

infrared 

SOHO 

ESA 

Dec.  2,  1995 

Sun 

Near-Shoemaker 

USA 

Feb.  17,  1996 

Mathilde,  Eros 

BeppoSAX 

Italy 

Apr.  30,  1996 

X-ray 

Mars  Global  Surveyor 

USA 

Nov.  7,  1996 

Mars 

Cassini/Huygens 

USA/ESA 

Oct.  15,  1997 

Saturn,  Titan 

Mars  Pathfinder/S ojoumer 

USA 

Dec.  4,  1996 

Mars 

Lunar  Prospector 

USA 

Jan.  6,  1998 

Moon 

Nozomi 

Japan 

Jul.  4,  1998 

Mars 

Deep  Space  1 

USA 

Oct.  24,  1998 

Braille,  Borrelly 

Stardust 

USA 

Feb.  7,  1999 

Wild  2 

Chandra 

USA 

Jul.  23,  1999 

X-ray 

XMM-Newton 

ESA 

Dec.  10,  1999 

X-ray 

Hete  2 

USA 

Oct.  9,  2000 

gamma  ray 

Mars  Odyssey 

USA 

Apr.  7,  2001 

Mars 

MAP 

USA 

Jun.  30,  2001 

cosmic  background  radiation 
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Table  C.27  (Continued) 


Satellite 

Launch  date 

Target 

Genesis 

USA 

Aug.  8,  2001 

solar  particles 

RHESSI 

USA 

Feb.  5,  2002 

Sun 

Grace 

Gemany-USA 

Mar.  17,  2002 

Earth’s  gravity 

Integral 

ESA 

Oct.  17,  2002 

gamma  ray 

Galex 

USA 

Apr.  28,  2003 

galaxies 

Hayabusa 

Japan 

May  9,  2003 

Itokawa 

Mars  Express 

ESA 

Jun.  2,  2003 

Mars 

Spirit 

USA 

Jun. 10, 2003 

Mars 

Opportunity 

USA 

Jul.  8,  2003 

Mars 

Spitzer 

USA 

Jun. 10, 2003 

infrared 

Smart- 1 

ESA 

Sep.  28,  2003 

Moon 

Rosetta 

ESA 

Mar.  2,  2004 

Churyumov-Gerasimenko 

Gravity  Probe  B 

USA 

Apr.  20,  2004 

relativity 

Messenger 

USA 

Aug.  3,  2004 

Mercury 

Swift 

USA 

Nov.  20,  2004 

gamma  ray  bursts 

Deep  Impact 

USA 

Jan.  12,  2005 

Tempel  1 

Mars  Recon  Orbiter 

USA 

Aug.  12,  2005 

Mars 

Venus  Express 

ESA 

Nov.  9,  2005 

Venus 

New  Horizons 

USA 

Jan.  19,  2006 

Pluto 

Akari 

Japan 

Feb.  22,  2006 

infrared 
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Chapter  2 

2.1  The  distance  is  ~  7640  km,  the  northern¬ 
most  point  is  79°N,  45 °W,  in  North  Greenland, 
1250  km  from  the  North  Pole. 

2.2  The  star  can  culminate  south  or  north  of 
zenith.  In  the  former  case  we  get  8  =  65°,  0  = 
70°,  and  in  the  latter  8  =  70°,  0  =  65°. 

2.3  (a)  tp  >  58°7/.  If  refraction  is  taken  into  ac¬ 
count,  the  limit  is  57° 24'.  (b)  0  =  8  =  7° 24'. 
(c)  -59°  10'  <0  <  -0°50'. 

2.4  Pretty  bad. 

2.5  A0  =  70°22\  /30  =  0°0',  A0  =  250°22/, 
£©  =  0°0'. 

2.6  (c)  (90  =  18  h. 

2.8  a  =  6  h  45  min  9  s  ,8  =  -16°  43'. 

2.9  vt  =  16.7  kms-1,  v  =  18.5  kms-1,  after  ~ 
61,000  years,  ji  =  1.62"  per  year,  parallax  0.42". 

Chapter  3 

3.1  (a)  The  flux  density  in  the  focal  plane  as  well 
as  the  exposure  time  are  proportional  to  (D//)2. 
Thus  the  required  exposure  is  3.2  s.  (b)  1.35  cm 
and  1.80  cm.  (c)  60  and  80. 


3.2  (a)  0.001"  (note  that  the  aperture  is  a  line 
rather  than  circular;  therefore  the  coefficient  1 .22 
should  not  be  used),  (b)  140  m. 

Chapter  4 

4.1  0.9. 

4.2  The  absolute  magnitude  will  be  —17.5  and 
apparent  6.7. 

4.3  N(m  +  l)/N(m)  =  IO3/5  =  3.98. 

4.4  r  =  2.1  kpc,  Eb-v  =  0-7,  and  ( B  —  V)o  = 
0.9. 

4.5  (a)  Am  =  1.06  mag,  m  =  2.42.  (b)  r  = 

—  In0.856  0.98. 

Chapter  5 

5.2  n  =  166,  which  corresponds  to  X  =  21 .04  cm. 
Such  transitions  would  keep  the  population  of  the 
state  n  =  166  very  high,  resulting  in  downward 
transitions  also  from  this  state.  Such  transitions 
have  not  been  detected.  Hence,  the  line  is  pro¬ 
duced  by  some  other  process. 

5.3  If  we  express  the  intensity  as  something  per 
unit  wavelength,  we  get  Amax  =  1.1  mm.  If  the 
intensity  is  given  per  unit  frequency,  we  have 
Amax  =  1.9  mm.  The  total  intensity  is  2.6  x 
1013  W  m-2  sterad-1.  At  550  nm  the  intensity  is 
practically  zero. 
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5.4  (a)  L  =  1.35  x  1029  W.  The  flux  in  the  given 
interval  can  be  found  by  integrating  Planck’s 
law  numerically.  Using  the  Wien  approximation 
a  rather  complicated  expression  can  be  derived. 
Both  methods  give  the  result  that  3.3  %  of  the  ra¬ 
diation  is  in  the  visual  range,  thus  Ly  =  4.45  x 
1027  W.  (b)  At  a  distance  of  10  pc  the  observed 
flux  density  is  3.7  x  10-9  Wm~2.  (b)  10.3  km. 

5.5  Mboi  =  0.87,  whence  R  =  2.0  Rq. 

5.6  T  =  1380  K.  There  are  several  strong  ab¬ 
sorption  lines  in  this  spectral  region,  reducing  the 
brightness  temperature. 

5.8  nrms  ~  6700  km  s“ 1 . 

Chapter  6 

6.1  Va/Vp  =  (1  —  e)/{\  +  e).  For  the  Earth  this 
is  0.97. 

6.2  a  —  1.4581  AU,  v  ~  23.6  kms-1. 

6.3  The  period  must  equal  the  sidereal  rotation 
period  of  the  Earth,  r  =  42,339  km  =  6.64  R®. 
Areas  within  8.6°  from  the  poles  cannot  be  seen 
from  geostationary  satellites.  The  hidden  area  is 

1.1  %  of  the  total  surface  area. 

6.4  p  =  3tt / (G P2 (a /2)3)  ~  1400  kgm-3. 

6.5  M  =  90°,  E  =  90.96°,  /  =  91.91°. 

6.6  The  orbit  is  hyperbolic,  <2  =  3. 55  x  107  AU, 
e  =  1  +  3.97  x  10-16,  rp  =  2.1  km.  The  comet 
will  hit  the  Sun. 

6.7  The  orbital  elements  of  the  Earth  calculated 
from  Table  C.12  are  a  =  1.0000,  e  =  0.0167, 
i  =  0.0004°,  Q  =  -11.13°,  or  =  102.9°,  L  = 
219.5°.  The  geocentric  radius  vector  of  the  Sun 
in  the  ecliptic  coordinates  is 

/0.7583\ 
r  =  0.6673  . 

V  o.o  ) 


The  corresponding  equatorial  radius  vector  is 

/0.7583\ 
r  =  0.6089  , 

\0.2640/ 

which  gives  a  ~  2  h  35  min  3  s,  <5  ~  15.19°. 
The  exact  direction  is  a  =  2  h  34  min  53  s,  8  = 
15.17°. 

Chapter  7 

7.1  Assuming  the  orbits  are  circular,  the  greatest 
elongation  is  arcsin(s/l  AU).  For  Mercury  this 
is  23°  and  for  Venus  46°.  The  elongation  of  a  su¬ 
perior  planet  can  be  anything  up  to  1 80° .  The  sky 
revolves  about  15°  per  hour,  and  thus  correspond¬ 
ing  times  for  Mercury  and  Venus  are  1  h  30  min 
and  3  h  5  min,  respectively.  In  opposition  Mars  is 
visible  the  whole  night.  These  values,  however, 
depend  on  the  actual  declinations  of  the  planets. 

7.2  (a)  8.7°.  (b)  The  Earth  must  be  90°  from 
the  ascending  node  of  Venus,  which  is  the  situ¬ 
ation  about  13  days  before  vernal  and  autumnal 
equinoxes,  around  March  8  and  September  10. 

7.3  (a)  If  the  radii  of  the  orbits  are  <21  and  <22,  the 
angular  velocity  of  the  retrograde  motion  is 

dA  _  VGM 
df  yaia2(yal  +  -n/»D  ' 

(b)  In  six  days  Pluto  moves  about  0.128°  corre¬ 
sponding  to  4  mm.  For  a  main  belt  asteroid  the 
displacement  is  almost  4  cm. 

7.4  If  the  orbital  velocity  of  the  planet  is  v  the 
deviation  in  radians  is 

v  1  Igmq 

a  —  —  w  . 

c  c  V  a 

This  is  greatest  for  Mercury,  a  =  0.00016  rad  = 
33".  This  planetary  aberration  must  be  taken  into 
account  when  computing  accurate  ephemerides. 
The  deviation  is  largest  when  the  planet  is  in  con¬ 
junction  or  opposition  and  moves  almost  perpen¬ 
dicularly  to  the  line  of  sight. 
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7.5  p  =  0.1 1,  q  =  2,  and  A  =  0.2.  In  reality  the 
Moon  reflects  most  of  the  light  directly  back¬ 
wards  (opposition  effect),  and  thus  q  and  A  are 
much  smaller. 

7.6  Am  =  0.9.  The  surface  brightness  remains 
constant. 

7.7  The  absolute  magnitude  is  V(1,0)  =  23. 
(a)  m  =  18.7.  (b)  m  =  14.2.  At  least  a  15  cm  tele¬ 
scope  is  needed  to  detect  the  asteroid  even  one 
day  before  the  collision. 

7.8  Using  the  values  of  Appendix  C,  the  gravita¬ 
tional  acceleration  is  9.865  m/s2  at  the  poles  and 
9.799  m/s2  on  the  equator.  On  the  equator  then 
rotational  velocity  is  464  m/s  and  the  centrifugal 
acceleration  0.034  m/s2;  thus  the  total  accelera¬ 
tion  is  on  9.765  m/s2. 

Chapter  8 

8.1  Psid  =  11.9  a,  a  =  5.20  AU,  d  =  144,000  km. 
Obviously  the  planet  is  Jupiter. 

8.2  (a)  Hint:  If  there  is  a  synodic  period  P  there 
must  be  integers  p  and  q  such  that  ( n2  —n\)P  = 
2np  and  (n2  —  n\)  P  =  2tz  q .  Sometimes  one  can 
see  claims  that  the  configuration  of  the  whole 
planetary  system  will  recur  after  a  certain  period. 
Such  claims  are  obviously  nonsense,  (b)  7.06  d. 

8.3  Apply  (7.1),  where  now  P\  =  24.62  and 
P2  =  30.30.  Hence  Ph2  =  131.34h  =  5d  11  h 
20  min.  One  revolution  as  seen  from  the  surface 
of  Mars  takes  131.34  hours.  Thus  Deimos  seems 
to  move  360/131.34  =  2.74  degrees  an  hour. 

8.4  According  to  Example  8.1  the  distance  of 
the  Roche  limit  is  R  ~  2.5  x  TZ,  where  1Z  is  the 
radius  of  the  planet.  The  following  table  gives  the 
distances  of  the  nearest  moon  and  the  inner  edge 
of  the  brightest  ring  of  each  planet. 


Planet 

Nearest 

moon 

Radius 

of  the 
planet 

Radius  of 

moon’s 
orbit  [km] 

Ratio 

Earth 

Moon 

6378 

384,400 

60 

Mars 

Phobos 

3396 

9377 

2.8 

Jupiter 

Metis 

71,492 

127,690 

1.8 

— 

rengas 

— 

122,500 

1.7 

Saturn 

Pan 

60,268 

133,584 

2.2 

— 

ring 

— 

92,000 

1.5 

Uranus 

Cordelia 

25,559 

49,751 

1.9 

— 

ring 

— 

44,718 

1.7 

Neptune 

Naiad 

24,764 

48,227 

1.9 

— 

ring 

— 

63,000 

2.5 

The  nearest  satellites  and  inner  edges  of  rings 
are  either  close  to  the  Roche  limit  or  inside  it. 
The  Earth’s  Moon  is  an  exception  in  this  re¬ 
spect. 

8.5  The  densest  hexagonal  packing  gives  a  vol¬ 
ume  density  of 

n  2r  it 

- 7=  x  7  =  - 7=  ^  0.74. 

3\/3  VW3 r  3V2 

The  mass  of  hydrogen  atoms  that  could  be  packed 
in  one  cubic  metre  is  (1/5.5  x  10-11)3  x  1.67  x 
10~27  x  0.74  =  7428  kg.  The  density  of  Jupiter 
calculated  this  way  would  be  7428  kg/m3.  The 
actual  density  is  only  1326  kg/m3. 

Chapter  9 

9.1  cgadfeb;  the  actual  spectral  classes  from 
top  to  bottom  are  A0,  M5,  06,  F2,  K5,  G2,  B3. 

Chapter  10 

10.1  The  period  is  P  =  1  /\/2  years  and  the  rela¬ 
tive  velocity  42,100  ms-1.  The  maximum  sepa¬ 
ration  of  the  lines  is  0.061  nm. 

10.3  Substituting  the  values  to  the  equation  of 
Example  10.1  we  get  an  equation  for  a .  The  so¬ 
lution  is  a  =4.4  AU.  The  mass  of  the  planet  is 
0.0015  Mq. 
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Chapter  11 

11.1  10.5. 

11.2  (a)  9.5  x  1037  s-1.  (b)  The  neutrino  pro¬ 
duction  rate  is  1.9  x  1038  s-1,  and  each  second 
9  x  1028  neutrinos  hit  the  Earth. 

11.3  The  mean  free  path  is  1  //cp  ~  42,000  AU. 

Chapter  12 

12.1  tff  =  6.7  x  105  a.  Stars  are  born  at  the  rate 
0.75  Mq  per  year. 

12.2  ti  ~  400,000  a,  tn  ~  3  x  108  a. 

12.3  About  900  million  years. 

Chapter  13 

13.1  (a)  6.3  x  107  Wm“2.  (b)  16  m2. 

13.2  807  Wm“2. 

Chapter  14 

14.1  dr/r  =  -0.46,  dM  =  0.14. 

14.2  (a)  T  =  3570  K.  (b)  tfmin/tfmax  =  0.63. 

14.3  (a)  1300  pc.  (b)  860  years  ago;  due  to 
inaccuracies  a  safe  estimate  would  be  860  ± 
100  years.  Actually,  the  explosion  was  observed 
in  1054.  (c)  -7.4. 

Chapter  15 

15.1  L  =  2.3  x  1040  kgm2  s_1,  dR  =  45  m. 

15.2  (a)  M  =  0.5 M©,  a  =  0.49  x  109  ms-2  ~ 
4  x  107g.  (b)  A  standing  astronaut  will  be  subject 
to  a  stretching  tidal  force.  The  gravitational  ac¬ 
celeration  felt  by  the  feet  is  3479  ms-2  %  355 g 
larger  than  that  felt  by  the  head.  If  the  astronaut 
lies  tangentially,  (s)he  will  experience  a  com¬ 
pressing  force  of  177  g. 


15.3  v  =  ve(l  —  GM/(Rc2)).  If  Av/v  is  small, 
we  have  also  AA  =  (GM  /  Rc2)XQ.  A  photon 
emitted  from  the  Sun  is  reddened  by  2.1  x 
10_6Ae.  In  yellow  light  (550  nm)  the  change  is 
0.0012  nm. 

Chapter  16 

16.1  2.6  kpc  and  0.9  kpc,  a  =  1.5  mag/kpc. 

16.2  7  kms-1. 

16.3  The  velocity  of  the  proton  is  v  =  0.0462c  = 
1.38  x  107  ms  1 .  The  radius  of  the  orbit  is  r  = 
mv/qB  =  0.01  AU. 

Chapter  17 

17.1  7.3. 

17.2  The  potential  energy  is  approximately  U  = 
—  G(m2n2/(2R)),  where  m  is  the  mass  of  one 
star,  n  the  number  of  stars  (there  are  n(n  — 
l)/2  ~  n2 /2  pairs),  and  R  the  radius  of  the  clus¬ 
ter.  The  average  velocity  is  ~  V Gmn / (2R)  = 
0.5  kms-1. 

Chapter  18 

18.1  /i  =  0.0055"  a-1. 

18.2  (a)  5.7  kpc.  (b)  11  kpc.  Possible  reasons 
for  the  discrepancy  include:  (1)  The  distance  is 
so  large  that  the  approximations  used  for  deriv¬ 
ing  Oort’s  formulae  are  not  very  good.  (2)  Taking 
into  account  the  interstellar  extinction  will  reduce 
the  distance  in  (b).  (3)  The  peculiar  velocity  of  the 
star  was  neglected. 

18.3  (a)  3  (and  the  Sun),  (b)  The  number  is  of  the 
order  of  100,000.  This  is  a  typical  selection  ef¬ 
fect:  bright  stars  are  rare  but  they  are  visible  overt 
long  distances. 

18.4  (a)  If  the  thickness  of  the  disk  is  //,  the  light 
has  to  travel  a  distance  s  =  min{r,  (///2)sec/?}  in 
the  interstellar  medium.  Thus  the  magnitude  will 
be  m  =  M  +  5 lg(r/10  pc)  +  as.  (b)  s  =  200  pc, 
m  =  10.2. 
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Chapter  19 

19.1  (a)  26,  (b)  25. 

19.2  The  diameter  must  be  of  the  order  one  light- 
week  ~  1200  AU.  M  ~  —23.5.  If  this  is  the 
bolometric  magnitude,  the  luminosity  is  L  ~  2  x 
1011  L©,  corresponding  to  210L©AU-3. 

Chapter  20 

20.1  r  =  v/H  =  93  Mpc  (if  H  =  75),  diameter 
is  35  kpc,  M  =  —  20.4.  Potential  sources  of  error 
include:  (1)  inaccuracy  of  the  Hubble  constant, 
(2)  peculiar  velocity  of  the  galaxy,  (3)  intergalac- 
tic  extinction,  (4)  only  2-dimensional  projection 
is  observed,  and  the  edge  depends  on  the  limiting 
magnitude  used. 

20.2  If  H  =  50  kms-1  Mpc-1,  r  =  51  Mpc,  or 
74  times  the  distance  of  M31.  If  H  is  doubled, 
the  distance  is  reduced  to  half  of  that,  but  is  still 
higher  than  the  value  obtained  in  Exercise  19.1. 
A  possible  explanation  is  the  peculiar  velocity  of 
the  galaxy. 

20.3  mv  =  1.5  x  10-35  kg,  or  0.00002  times  the 
mass  of  the  electron. 

Chapter  21 

21.1  If  the  greenhouse  effect  is  neglected,  the 
distances  corresponding  to  temperatures  373  K 
and  273  K  are  0.47  AU  and  0.87  AU,  respec¬ 
tively.  The  effective  temperature  of  the  young  Sun 
was  T  =  i/OJ  x  5785  K  =  5291  K.  The  corre¬ 


sponding  limits  then  were  0.39  AU  and  0.73  AU. 
Thus  the  continuously  habitable  zone  would  ex¬ 
tend  from  0.47  to  0.73  AU. 

21.2  If  there  are  n  stars  in  a  volume  V,  the  dis¬ 
tance  between  the  nearest  neighbours  is  of  the  or¬ 
der  y/V/n.  If  binary  and  triple  stars  in  Table  C.17 
are  counted  as  single  objects,  there  are  47  stars 
in  a  volume  of  520  cubic  parsecs,  (a)  If  a  frac¬ 
tion  0.01  of  them  is  habited,  the  average  distance 
between  neighbouring  civilisations  is  10  parsecs; 
(b)  100  parsecs. 

21.3  Since  comets  are  mostly  dirty  ice,  the  den¬ 
sity  is  close  to  that  of  water.  Thus  the  mass  of  the 
comet  is  about  5  x  108  kg.  A  freely  falling  ob¬ 
ject  will  reach  a  velocity  that  is  the  same  as  the 
escape  velocity  from  the  Earth,  1 1  km/s.  The  ki¬ 
netic  energy  of  the  comet  is  at  least  3  x  1016  J, 
corresponding  to  7.6  megatons  of  TNT  or  500  Hi¬ 
roshima  bombs. 

Chapter  22 

22.1  The  centre  of  gravity  of  the  Sun  and  Jupiter 
is  about  743,000  km  from  the  centre  of  the  Sun, 
i.e.  somewhat  outside  the  solar  surface.  From  a 
distance  of  10  pc  this  subtends  an  angle  of  0.0005 
arcsecond.  Thus  the  total  amount  of  oscillation  is 
one  milliarcsecond. 

22.2  Jupiter  covers  a  fraction  0.0106  of  the  so¬ 
lar  disc,  corresponding  to  0.01 15  magnitudes.  For 
the  Earth  the  change  is  0.0001  magnitudes. 
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Chapter  2.  Spherical  Astronomy 

Plate  1  A  long  exposure  shows  the  apparent 
motion  of  the  celestial  sphere.  Pekka  Parviainen 
has  caught  in  his  image  the  star  trails,  a  trail  of 
the  crescent  moon,  and  a  trail  of  the  rising  Sun, 
taken  through  a  24  stop  ND-filter.  (Photo  www. 
polarimage.fi) 

Plate  2  Atmospheric  refraction  has  distorted 
and  discoloured  the  image  of  the  setting  Sun. 
A  rare  green  segment  can  be  seen  above  the  Sun. 
(Photo  Pekka  Parviainen) 

Chapter  3.  Observations  and  Instruments 

Plate  3  Astronomical  observatories  are  built  in 
high,  dry  places  with  a  clear  sky.  The  picture 
shows  the  summit  of  Mauna  Kea  in  Hawaii  with 
the  twin  domes  of  the  world’s  largest  telescopes, 
the  William  M.  Keck  Telescopes.  Each  of  them 
has  a  mosaic  mirror  with  a  diameter  of  10  meters. 
(Photo  W.M.  Keck  Observatory) 

Plate  4  The  next  generation  space  telescope, 
James  Webb,  is  under  construction  in  the  U.S.  It 
will  be  launched  about  2013.  It  will  have  a  6.5 
meter  mirror  made  of  18  segments.  The  telescope 
will  be  situated  at  the  L2  point  of  the  Sun-Earth 
system,  about  1.5  million  kilometers  from  the 
Earth.  (Illustration  NASA) 


Chapter  7.  The  Solar  System 

Plate  5  The  transits  of  Mercury  over  the  disk 
of  the  Sun  were  widely  observed  in  1999  and 

2003.  The  planet  is  nearing  the  limb  of  the  Sun  at 
the  end  of  the  latter  transit.  The  photograph  was 
taken  with  the  1  meter  Swedish  solar  telescope 
on  La  Palma.  (Photo  Royal  Swedish  Academy  of 
Sciences) 

Plate  6  Eclipses  are  astronomical  phenomena 
that  tend  to  attract  the  attention  of  the  general 
public.  A  composite  made  from  five  different  ex¬ 
posures  shows  the  extent  of  the  Earth’s  shadow 
and  the  movement  of  the  Moon  from  lower  right 
to  upper  left.  The  eclipse  occurred  in  October 

2004.  (Photo  Pekka  Parviainen) 

Plate  7  The  first  and  so  far  only  colour  pic¬ 
tures  from  the  surface  of  Venus  were  obtained  in 
1982.  Panoramic  views  from  Venera  13  (above) 
and  Venera  14  (below).  At  the  edges,  the  cam¬ 
era  is  looking  at  the  horizon  and  in  the  centre, 
at  the  ground  around  the  base  of  the  space  probe. 
Parts  of  the  base,  the  protective  covers  of  the  cam¬ 
era  and  a  colour  map  can  be  seen.  (Photo  Soviet 
Academy  of  Science) 

Plate  8  One  of  the  largest  impact  craters  on 
the  surface  of  the  Earth,  Lake  Manicouagan  in 
northern  Canada.  The  ancient  impact  structure 
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is  about  70  kilometers  in  diameter.  This  natural- 
colour  image  of  the  crater  was  acquired  by  Nasa’s 
Terra  satellite  in  June  2001 .  The  impact  is  thought 
to  have  occurred  about  212  million  years  ago. 
(Photo  NASA/JPL/MISR) 

Plate  9  The  last  people  to  have  walked  on  the 
Moon  were  geologist  Harrison  Schmitt  (in  the 
picture)  and  Eugene  Ceman,  who  took  the  photo. 
Schmitt  is  studying  a  big  boulder  at  the  Taurus- 
Littrow  landing  site  of  Apollo  17  spacecraft  in 
December  1972,  with  the  lunar  rover  in  the  fore¬ 
ground.  (Photo  NASA) 

Plate  10  The  ‘‘Burns  Cliff”  in  the  “Endurance 
Crater”  on  Mars.  The  photo  was  taken  by  the  Op¬ 
portunity  Mars  rover  in  November  2004.  Oppor¬ 
tunity  and  its  twin  Spirit  have  studied  the  surface 
of  Mars  in  unprecedented  detail  since  their  land¬ 
ing  in  early  2002.  The  mosaic  spans  about  180 
degrees  from  side  to  side.  (Photo  NASA) 

Plate  11  Colour-enhanced  picture  of  the  surface 
of  Europa.  The  surface  is  criss-crossed  by  linea¬ 
ments,  and  the  colour  differences  show  that  they 
are  younger  than  the  surrounding  areas.  The  sur¬ 
face  is  mainly  water  ice.  (Photo  Galileo/NASA) 

Plate  12  A  true  colour  mosaic  of  Jupiter  was 
taken  by  Cassini  spacecraft  in  2000,  when  Cassini 
made  its  closest  approach  to  Jupiter  on  its  way 
to  Saturn.  The  clouds  of  the  giant  planet  move 
in  reddish-brown  and  white  bands,  with  the  Great 
Red  Spot  and  several  smaller  white  ovals  in  view. 
The  smallest  visible  features  are  about  60  kilome¬ 
ters  across.  (Photo  NASA) 

Plate  13  A  simulated  image  of  the  rings  of  Sat¬ 
urn,  based  on  radio  occultation  measurements 
by  Cassini  spacecraft  in  May  2005.  Green  and 
blue  shades  indicate  regions  where  the  particles 
are  smaller  than  5  centimeters  and  1  centimeter 
across.  In  purple  regions ,  the  particles  are  mostly 
larger  than  5  centimeters.  The  white  area  is  the 
thickest  part  of  B-ring,  where  radio  signals  were 
blocked.  (Photo  NASA) 

Plate  14  The  first  view  from  the  surface  of  Ti¬ 
tan,  the  largest  moon  of  Saturn,  was  obtained  in 


January  2005,  when  the  European  Huygens  probe 
made  a  soft  landing.  The  boulders  are  made  of 
water  ice,  and  their  size  is  mostly  10-20  centime¬ 
ters.  The  surface  itself  is  methane-filled  mud  and 
sand.  (Photo  ESA/NASA) 

Plate  15  Details  of  the  icy  surface  of  Triton,  the 
largest  satellite  of  Neptune.  The  plain  in  the  cen¬ 
ter  is  called  Tuonela.  The  albedo  of  Triton  is  very 
similar  to  that  of  Pluto,  and  thus  this  kind  of  fea¬ 
ture  may  be  seen  by  the  New  Horizons  spacecraft 
in  July  2015  when  it  flies  by  Pluto.  (Photo  Voy¬ 
ager  2/NASA) 

Plate  16  Surface  and  jets  of  Comet  Wild  2.  The 
Stardust  spacecraft  flew  by  the  head  of  comet 
Wild  2  in  January  2004.  The  best  black-and-white 
photo  of  the  comet’s  surface  has  been  merged 
with  a  long-exposure  photo  showing  the  faint  dust 
jets  emanating  from  the  surface.  The  diameter  of 
the  comet’s  head  is  about  5  km.  (Photo  NASA) 

Chapter  9.  Stellar  Spectra 

Plate  17  Spectra  of  stars  of  different  spectral 
classes.  The  wavelength  range  is  in  the  visual 
part,  from  400  to  700  nanometers.  The  O  stars  at 
the  top  are  hottest,  and  the  M  stars  at  the  bottom, 
the  coolest.  (Photo  N OAO/AURA/NSF) 

Chapter  13.  The  Sun 

Plate  18  Three-colour  composite  of  the  solar 
surface,  photographed  by  the  Extreme  Ultravio¬ 
let  Imaging  Telescope  of  SOHO  satellite  in  May 
1998.  Three  ultraviolet  images  taken  at  differ¬ 
ent  wavelengths  (17.1  nm,  19.5  nm  and  28.4  nm) 
were  coded  red ,  yellow  and  blue  and  merged  into 
one  picture.  (Photo  SOHO/EIT/NASA/Goddard 
Space  Flight  Center) 

Chapter  16.  The  Interstellar  Medium 

Plate  19  One  of  the  most  detailed  visual-wave¬ 
length  pictures  of  the  Orion  Nebula  was  pub¬ 
lished  in  2006.  It  was  taken  by  the  Hubble  Space 
Telescope,  which  used  105  orbits  to  photograph 
the  nebula.  The  original  picture  has  1000  million 
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pixels.  Thousands  of  previously  unknown  stars 
were  found  in  the  picture,  and  also  many  new 
brown  dwarfs.  Orion  Nebula  is  a  large  dust  and 
gas  region,  giving  rise  to  new  stars.  The  width  of 
the  picture  is  about  half  a  degree  (the  same  as  full 
Moon).  (Photo  NASA/ESA/M.  Robberto) 

Plate  20  Orion  Nebula  photographed  by  the 
Spitzer  Telescope  in  infrared  light  in  2006.  While 
the  Hubble  picture  (Plate  19)  shows  bright  gas, 
with  dark  dust  lanes,  Spitzer  sees  the  dust  as 
bright  regions.  The  wavelength  of  8  microme¬ 
ters  has  been  coded  red ,  and  5.8  microns,  orange. 
They  represent  dust  heated  by  the  stars.  Light 
of  4.5  microns  (green)  shows  hot  gas  and  dust, 
and  light  of  3.6  microns  (blue)  is  from  starlight. 
(Photo  NASA) 

Plate  21  The  dark  cloud  Barnard  68  in  the  con¬ 
stellation  of  Ophiuchus.  B68  is  a  dense  molecu¬ 
lar  cloud,  which  is  just  starting  to  contract  and 
later  form  new  stars.  The  diameter  of  the  cloud 
is  7  light-months  (0.2  parsecs),  and  its  distance  is 
about  500  light-years.  It  was  photographed  by  the 
Antu  (VLT  1)  Telescope  in  1999.  (Photo  ESO) 

Plate  22  The  star  V838  Monocerotis  brightened 
suddenly  in  early  2002.  The  outburst  lasted  sev¬ 
eral  weeks  and  sent  a  sphere  of  light  spread¬ 
ing  into  the  surrounding  interstellar  medium.  The 
light-echo  has  illuminated  the  dust  clouds  around 
the  star  and  unveiled  never-before-seen  patterns. 
This  photo  was  taken  by  the  Hubble  Space  Tele¬ 
scope  at  the  end  of  2004,  nearly  three  years  after 
the  outburst.  (Photo  NASA/ESA) 

Plate  23  The  Rosette  Nebula  in  Monoceros  be¬ 
longs  to  the  same  extensive  dust  and  gas  complex 
as  the  Orion  Nebula.  Within  the  Rosette  Nebula 
there  is  a  young  star  cluster  called  NGC  2244, 
which  gives  energy  to  the  cloud.  The  ultraviolet 
light  from  the  stars  makes  the  hydrogen  glow  red , 
and  the  oxygen,  greenish.  The  picture  is  about  1 
degree  wide,  and  it  was  taken  by  the  MegaPrime 
Camera  at  the  Canada-France-Hawaii  Telescope 
on  Mauna  Kea  in  2003.  (Photo  CFHT/J.-C.  Cuil- 
landre) 


Plate  24  The  Helix  Nebula  is  one  of  the  nearest 
planetary  nebulae  and  also  one  of  the  apparently 
widest.  Its  diameter  is  nearly  one  degree  (two  full 
Moons  side  by  side),  but  its  surface  brightness  is 
so  low  it  cannot  be  seen  by  the  naked  eye.  It  is  lo¬ 
cated  in  the  constellation  of  Aquarius.  This  com¬ 
posite  photograph  combines  an  ultra- sharp  mo¬ 
saic  by  the  Hubble  Space  Telescope  with  a  wide 
view  by  the  WIYN  0.9  meter  Telescope  on  Kitt 
Peak.  (Photo  NASA/NOAO/ESA) 

Plate  25  Radio  image  of  the  Cassiopeia  A  su¬ 
pernova  remnant.  Cas  A  is  one  of  the  bright¬ 
est  radio  sources  in  the  sky.  This  image  was 
made  at  three  different  frequencies:  1.4  GHz, 
5.0  GHz  and  8.4  GHz,  with  the  Very  Large  Ar¬ 
ray  radio  telescope  in  New  Mexico.  The  radiation 
comes  mainly  from  relativistic  electrons.  (Image 
NRAO/AUI) 

Plate  26  The  remnant  of  the  Tycho  supernova 
in  X-rays.  The  supernova  exploded  in  1572  and 
was  observed  by  Tycho  Brahe.  During  the  last 
years,  the  Chandra  X-Ray  Observatory  has  made 
observations  of  this  object.  The  image  shows  an 
expanding  bubble  of  extremely  hot  gas  (in  green 
and  red)  inside  a  shell  of  rapidly  moving  elec¬ 
trons  (blue).  (Image  NASA/Chandra) 

Chapter  19.  Galaxies 

Plate  27  Part  of  the  Andromeda  Galaxy  M31, 
photographed  with  the  Japanese  Subaru  8.4  m 
telescope  on  Mauna  Kea,  Hawaii.  The  nucleus  of 
the  galaxy  is  outside  the  picture  on  the  upper  left. 
The  stars  with  the  vertical  lines  are  foreground 
stars  in  our  Milky  Way.  M31  is  a  sister  galaxy 
to  our  Milky  Way,  with  similar  star  clusters,  dust 
and  gas  clouds  and  spiral  arms.  (Photo  Subaru 
Telescope,  NOAJ) 

Plate  28  The  most  detailed  infrared  view  of 
the  Andromeda  Galaxy  so  far  was  obtained  by 
the  Spitzer  Telescope  in  2004.  While  the  Sub¬ 
aru  picture  (Plate  27)  shows  the  starlight  from 
the  galaxy,  the  Spitzer  picture  records  the  infrared 
radiation  from  the  dust  in  the  spiral  arms  at  the 
wavelength  of  24  micrometers.  The  spiral  arms 
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with  their  warm  dust  and  star-forming  clouds  can 
be  seen  much  more  clearly  than  at  visual  wave¬ 
lengths.  (Photo  Spitzer/NASA) 

Plate  29  The  central  part  of  the  active  galaxy 
Centaurus  A,  NGC  5128,  seen  by  the  Hubble 
Space  Telescope.  The  whole  galaxy  is  crossed  by 
a  thick  dust  lane  with  intricate  wisps  of  warm 
dust  and  gas.  At  the  centre  of  the  galaxy,  behind 
the  dust  shroud,  lies  a  black  hole,  which  sends 
powerful  jets  in  two  directions.  The  jets  can  be 
seen  only  in  X-rays  or  at  radio  wavelengths.  For 
a  wider  view  of  Centaurus  A  see  the  next  Plate. 
(Photo  NASA) 

Plate  30  New  pictures  of  the  outer  parts  of 
Centaurus  A  have  revealed  several  ring-like  fea¬ 
tures,  which  contain  faint  blue  stars.  The  rings 
may  have  been  formed  in  interactions  with  other, 
smaller  galaxies,  which  have  collided  with  Cen¬ 
taurus  A  and  lost  part  of  their  stars  to  this  giant 
galaxy.  (Photo  Cerro  Tololo  International  Obser¬ 
vatory,  NOAO/NSF) 

Plate  31  A  well-known  galaxy-quasar  pair:  in 
the  center  the  galaxy  NGC  4319,  and  on  the  up¬ 
per  right,  the  quasar  Markarian  215.  Although  the 
pair  seems  to  belong  together,  in  reality  this  is 
a  case  of  chance  alignment.  The  distance  to  the 
quasar  is  14  times  larger  than  the  distance  to  the 
galaxy.  (Photo  Hubble  Space  Telescope/NASA) 


there  seem  to  be  six  galaxies  engaging  in  a  cos¬ 
mic  dance.  The  number  of  interacting  galaxies  is 
however  only  four.  The  small  well-defined  spiral 
galaxy  to  the  right  of  the  center,  lies  nearly  five 
times  further  away  than  the  others.  And  the  right¬ 
most  “galaxy”  isn’t  actually  a  galaxy  at  all,  but 
a  long  “tidal  tail”  of  stars  torn  off  from  the  other 
galaxies.  (Photo  Hubble  Space  Telescope/NASA) 

Chapter  20.  Cosmology 

Plate  33  The  Hubble  Space  Telescope  has  pho¬ 
tographed  several  deep  sky  fields  to  get  represen¬ 
tative  samples  of  the  Universe.  This  photograph  is 
the  Hubble  Deep  Field  South  in  the  constellation 
Tucana,  and  it  contains  thousands  of  never-before 
seen  galaxies  up  to  the  distance  of  12  Ga.  The 
small  red  dot  in  the  middle  of  the  photograph  is 
a  distant  quasar,  whose  light  was  studied  with  the 
Space  Telescope  Imaging  Spectrograph,  to  map 
invisible  clouds  of  hydrogen  gas  between  us  and 
the  quasar.  (Photo  NASA) 

Plate  34  Temperature  fluctuations  in  the  cosmic 
microwave  background.  The  observations  were 
made  with  the  Wilkinson  Microwave  Anisotropy 
Probe  (WMAP)  in  2002.  From  the  scale  depen¬ 
dence  of  the  fluctuations  we  can  estimate  the  av¬ 
erage  density  of  the  Universe  (very  near  the  crit¬ 
ical  density),  and  the  age  of  the  Universe  (about 
13.7  Ga).  (Photo  WMAP/NASA) 


Plate  32  A  small  group  of  interacting  galax¬ 
ies  called  Seyfert’s  Sextet.  As  the  name  implies, 
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radius,  15,  24 
rotation,  34,  38 
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Extreme  ultraviolet,  82 
Eye,  56 
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Galactic  habitable  zone,  448 
Galactic  north  pole,  22 
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Giant,  236 
Giant  branch,  277 
Giant  phase,  269,  274,  276 
Giant  planets,  142 
atmosphere,  153 
Giant  star,  232 

Globular  cluster,  270,  277,  303 
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GRS-80,  15 


Index 


539 


H 

h  and  x  Persei,  360 
Ha  line,  287,  288,  293 
Habitable  zone,  448 
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fragmentation,  349 
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density,  356 
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540 


Index 


element  abundances,  340 
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Longitude  of  the  ascending  node,  127,  128 
Lorentz  transformation,  474 
Love  number,  175 
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Orbital  resonance,  202 
Orbital  velocity,  134,  137 
Orion,  336 

Orion  nebula,  336,  342,  344,  346-348 
Osculating  elements,  128,  482 
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Period-luminosity  relation,  276,  302 

Permanent  tide,  175 

Perseids,  223 

Perseus  arm,  374 

Perturbations,  130 

Perturbograph,  131 

Phase,  145 

Phase  angle,  145,  161,  163,  481 
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flat,  428 
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natural  width,  107 
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Starquake,  317 
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motion,  374 
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variable,  see  Variables 
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Stromgren’s  four-colour  system,  95 
Subduction  zone,  190 
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Sudbury  neutrino  detector,  87 
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spectral  class,  283 
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Sunspot  number,  290,  29 1 
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548 


Index 
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core  collapse,  311 
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Symbiotic  stars,  307 
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Synodic  month,  147 
Synodic  period,  145 
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T 
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optical,  497 
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Temperature,  114 
antenna,  114,  341 
brightness,  114,  115,  341 
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effective,  113,  114,  119,  232,  235 
excitation,  110,  116,  342 
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kinetic,  115,  136 
spin,  342 

Temperature  gradient  in  stars,  250 
Terrestrial  planets,  142 
core,  152 
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Tethys,  207 
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Thermal  radiation,  112,  166 
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Thermosphere,  191 
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Threshold  temperature,  433 
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mean  sidereal,  35,  38 
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solar,  19,  35,  38 
terrestrial,  39 
terrestrial  dynamical,  39 
units,  479 
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dynamical,  264,  271 
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Titan,  207,  208,  454 
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TM,  see  Time,  mean  solar 
TNO,  215 
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Trojan  asteroids,  131,  217 
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Tropical  year,  146 
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TTauri  star,  169,  266,  276,  300,  304 
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Umbriel,  210 
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isotropic,  426,  427,  433 
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Uranometria,  3 1 
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magnetic  field,  160,  210 
rings,  209,  210 
rotation,  208 
URCA  process,  315 
Urey,  Harold,  450 
USNO-Bl.O  Catalog,  34 
UT,  see  Time,  universal 
UT0,  38 
UT1,  38 

UTC,  see  Time,  coordinated  universal 


UV  Ceti  stars,  304 
Uvby  system,  95 

V 

VI 057  Cygni,  304,  305 
V1357  Cyg,  322 
V1500  Cygni,  308 
Vaisala,  Yrjo,  60 
Valles  Marineris,  196,  197 
Van  Allen’s  belts,  158 
Van  de  Hulst,  Hendrik,  341 
Van  den  Bergh,  Sidney,  333,  393 
Van  der  Waals  force,  170 
Variable  stars,  see  Variables 
Variables,  299 

classification,  299 
eclipsing,  300 
eruptive,  300,  303,  304 
irregular,  303 
names,  299 
pulsating,  300,  301 
rotating,  301 
semiregular,  303 
Vector,  465 
Vector  product,  466 
Vector  triple  product,  467 
Vega,  23,  96,  230,  369 
Veil  nebula,  351,  352 
Vela  pulsar,  317,  355 
Velocity 
radial,  362 
tangential,  362 
Velocity  of  light,  478 
Venera,  186 
Venus,  142,  185 

atmosphere,  185,  188 
atmospheric  pressure,  185 
clouds,  188 
greenhouse  effect,  167 
impact  craters,  187 
lava  flows,  187 
phases,  185 
rotation,  185 
surface,  186,  187 
temperature,  167,  185 
transit,  151 
volcanoes,  186 
Venus  Express,  186 

Vernal  equinox,  17,  21,  23,  34,  35,  126,  146 
Vertical,  17 

Vibrational  transitions,  347 
Vidicon  camera,  70 
Viking,  199 

Virgo  Cluster,  407,  408,  424 
Virial,  134 

Virial  theorem,  134,  394 
VLBI,  80 

Vogt-Russell  theorem,  252 
Voigt  profile,  108 


550 


Index 


Voyager  1,  206 
Voyager  2,  206 

W 

Warrawoona  Group,  45 1 

Wavelength,  28,  428 

Weber  cylinder,  87 

Weird  Terrain,  184 

Weizsacker,  Carl  Friedrich  von,  254 

Whipple,  F.,  220 

White  dwarf,  235,  236,  269-271,  274,  313 
radius,  313,  319 
Wien  approximation,  113,  115 
Wien  displacement  constant,  113 
Wien  displacement  law,  113,  118 
Wilson,  Robert,  425 
Winter  solstice,  146 
Wolf,  Charles,  234 
Wolf  diagram,  332,  333 

Wolf-Rayet  star,  229,  234,  270,  273,  275-277,  310 
W  UMa  stars,  245 
W  Virginis  star,  300,  302,  375 

X 

X-ray  burster,  323,  324 
X-ray  pulsar,  322,  324 


X-rays,  81,  82,  408 
solar,  295,  296 
Xena,  214 
XUV,  82 

Y 

Year,  40,  479 

anomalistic,  146 
sidereal,  36,  130,  146 
tropical,  36,  146 
Yerkes  Observatory,  57 

Z 

£  Persei,  360,  361 
ZAMS,  258 

Zeeman  effect,  109,  234,  291,  356 

Zenith,  16 

Zenith  angle,  174 

Zenith  distance,  17,  26,  99 

Zero  age  main  sequence,  see  ZAMS 

Zero-point  energy,  433 

Zodiacal  light,  224 

Zone  catalogue,  32 

Zone  of  avoidance,  331 


